Deci claims breakthrough inference performance on Intel Sapphire Rapids

Israeli startup Deci claims GPU-like performance in running computer vision and natural language processing models on Intel’s fourth-generation Xeon processors

Aaron Tan, Informa TechTarget

Published: 27 Jan 2023 5:41

Israeli startup Deci claims to have achieved breakthrough performance in running computer vision and natural language processing (NLP) models on Intel’s latest Xeon processor.

Delivering graphics processing unit (GPU)-like artificial intelligence (AI) inference performance, the breakthrough was accomplished with the company’s proprietary Automated Neural Architecture Construction (AutoNAC) technology, which generates custom hardware-aware AI model architectures.

For computer vision, Deci claimed to have delivered more than a three-fold increase in throughput, as well as a 1% boost in accuracy, when compared with an INT8 version of a ResNet50 convolutional neural network running on Intel’s fourth-generation Xeon processor, codenamed Sapphire Rapids.

For NLP, Deci also delivered a more than three-fold increase in acceleration compared with the INT8 version of the Bert language model on Intel Sapphire Rapids with improved accuracy. The models were compiled and quantised to INT8 with Intel’s Advanced Matrix Extensions (AMX) and the Intel extension for PyTorch.

Since 2019, Deci and Intel have been working together under the latter’s Ignite programme for early-stage startups to optimise deep learning inference on Intel chips, with inference results improving over generations of Xeon processors.

Deci, which raised $25m in Series B funding in July 2022, is also a member of the Intel Disruptor programme aimed at fostering innovation in AI and data-centric use cases and has collaborated with Intel on multiple MLPerf submissions.

Yonatan Geifman, Deci’s CEO and co-founder, said the latest performance breakthrough marks “another chapter in the Deci-Intel partnership which empowers AI developers to achieve unparalleled accuracy and inference performance with hardware-aware model architectures”.

Deci claims breakthrough inference performance on Intel Sapphire Rapids

Israeli startup Deci claims GPU-like performance in running computer vision and natural language processing models on Intel’s fourth-generation Xeon processors

Read more about AI in APAC

Read more on Artificial intelligence, automation and robotics

FuriosaAI to fuel LG Exaone LLM: Is it a challenge to Nvidia?

Red Hat launches llm-d community & project

8 metrics to measure GenAI’s performance and business value

Copilot+ PCs with AMD and Intel silicon show AI PC trends