Intel Achieves First, Only Full NPU Support in MLPerf Client v0.6 Benchmark

May 5, 2025 Published

Results show Intel Core Ultra Series 2 processors offer unprecedented AI compute performance spanning the CPU, GPU and NPU.

In this article:

What’s New: Intel today announced that it is the only company to achieve full neural processing unit (NPU) support in the newly released MLPerf Client v0.6 benchmark. The result marks the industry’s first standardized evaluation of large language model (LLM) performance on client NPUs. Intel’s measurements of MLPerf Client v0.6 show Intel® Core™ Ultra Series 2 processors can produce output on both the graphics processing unit (GPU) and the NPU much faster than a typical human can read.

“We are proud to lead the industry in enabling full NPU acceleration and industry-leading GPU performance for AI workloads on client PC platforms. This success reflects Intel’s deep hardware-software co-optimization and commitment to democratizing AI for PCs everywhere."
–Daniel Rogers, Intel vice president and general manager of PC Product Marketing

Why It Matters: With its Intel Core Ultra Series 2 processors, Intel is at the forefront of the AI PC evolution, offering unprecedented AI compute performance spanning the central processing unit (CPU), GPU and NPU.

MLPerf Client v0.6 measures four content generation and summarization use cases based on the Llama 2 7B model. Intel demonstrated leading performance across NPU and built-in Intel® Arc™ GPU.

Intel achieved the fastest NPU response time, generating the first word in just 1.09 seconds (first token latency), meaning it begins answering almost immediately after receiving a prompt. It also delivered the highest NPU throughput at 18.55 tokens per second, referring to how quickly the system can generate each additional piece of text, enabling seamless real-time AI interaction. Additionally, compared to competition, Intel showed GPU leadership in time to first token, starting faster than the competition and reinforcing its NPU and GPU end-to-end AI acceleration advantage.

About NPU Benchmarking on MLPerf: Developed collaboratively by MLCommons consortium members — including Intel, AMD, Microsoft, Nvidia and Qualcomm — MLPerf Client v0.6 extends beyond previous GPU-centric tests to now include dedicated NPU benchmarking.

Driven by close collaboration between Intel's NPU hardware and OpenVINO™ software teams, Intel Core Ultra processors remain the only NPU to achieve complete NPU compliance in the final benchmark.

More: Press Kit: Intel Core Ultra Processors (Series 2)

Editor's Note: GPU results and CPU model number were updated after original publication on May 5, 2025.

The Small Print:

Testing Configuration

	AMD	Intel
OEM Platform	ASUS Zenbook S 16	ASUS Zenbook S 14
OEM Model Number	UM5606WA	UX5406SA
CPU Model	AMD Ryzen AI HX 370	Intel® Core™ Ultra 9 288V Processor
BIOS Date	March 21, 2025	February 26, 2025
BIOS Version	UM5606WA.317	UX5406SA.306
Total Memory	32GB LPDDR5, 7500 MHz	32GB LPDDR5, 8533 MHz
Graphics Brand	AMD Radeon 890M	Intel Arc 140V
Storage Memory	1TB	1TB
OS	Windows 11 Pro x64	Windows 11 Pro x64
Power Source	AC	AC
Power Plan	Balanced	Balanced
Power Mode	Best Performance	Best Performance
OEM Power Setting	myASUS: FullSpeed	myASUS: FullSpeed

* All data measured as of April 28, 2025. See press kit for workload and configuration details.

Notices & Disclaimers

Performance varies by use, configuration and other factors. Learn more at intel.com/performance index. Performance results are based on testing as of dates shown in configurations and may not reflect all publicly available updates.  See backup for configuration details.  No product or component can be absolutely secure. Your costs and results may vary.

Artificial Intelligence

Argonne National Laboratory Celebrates Aurora Exascale Computer

Intel, HPE and DOE leaders and researchers gather to celebrate the collaboration, which is already revolutionizing how scientists use artificial intelligence and simulations.

July 18, 2025

Artificial Intelligence

AI PC Global Report

The Innovation Imperative: Unlocking Business Potential with Intel-powered AI PCs

July 17, 2025

Artificial Intelligence

Intel and Weizmann Institute Speed AI with Speculative Decoding Advance

A new method to handle AI acceleration algorithms delivers up to 2.8 times faster LLM inference, enabling vendor-agnostic AI. It is available on Hugging Face.

July 16, 2025

Data Center

From Circuits to Scale: Intel’s Path to Exascale

As the Aurora Supercomputer reaches its ceremonial opening, three of Intel's team members reflect on the unique scope and challenges of the project.

July 15, 2025

Results show Intel Core Ultra Series 2 processors offer unprecedented AI compute performance spanning the CPU, GPU and NPU.

Related Posts

Argonne National Laboratory Celebrates Aurora Exascale Computer

AI PC Global Report

Intel and Weizmann Institute Speed AI with Speculative Decoding Advance

From Circuits to Scale: Intel’s Path to Exascale