Intel Achieves First, Only Full NPU Support in MLPerf Client v0.6 Benchmark

Results show Intel Core Ultra Series 2 processors offer unprecedented AI compute performance spanning the CPU, GPU and NPU.

What’s New: Intel today announced that it is the only company to achieve full neural processing unit (NPU) support in the newly released MLPerf Client v0.6 benchmark. The result marks the industry’s first standardized evaluation of large language model (LLM) performance on client NPUs. Intel’s measurements of MLPerf Client v0.6 show Intel® Core™ Ultra Series 2 processors can produce output on both the graphics processing unit (GPU) and the NPU much faster than a typical human can read.

“We are proud to lead the industry in enabling full NPU acceleration and industry-leading GPU performance for AI workloads on client PC platforms. This success reflects Intel’s deep hardware-software co-optimization and commitment to democratizing AI for PCs everywhere."
–Daniel Rogers, Intel vice president and general manager of PC Product Marketing

Why It Matters: With its Intel Core Ultra Series 2 processors, Intel is at the forefront of the AI PC evolution, offering unprecedented AI compute performance spanning the central processing unit (CPU), GPU and NPU.

MLPerf Client v0.6 measures four content generation and summarization use cases based on the Llama 2 7B model. Intel demonstrated leading performance across NPU and built-in Intel® Arc™ GPU.

Intel achieved the fastest NPU response time, generating the first word in just 1.09 seconds (first token latency), meaning it begins answering almost immediately after receiving a prompt. It also delivered the highest NPU throughput at 18.55 tokens per second, referring to how quickly the system can generate each additional piece of text, enabling seamless real-time AI interaction. Additionally, compared to competition, Intel showed GPU leadership in time to first token, starting faster than the competition and reinforcing its NPU and GPU end-to-end AI acceleration advantage.

About NPU Benchmarking on MLPerf: Developed collaboratively by MLCommons consortium members — including Intel, AMD, Microsoft, Nvidia and Qualcomm — MLPerf Client v0.6 extends beyond previous GPU-centric tests to now include dedicated NPU benchmarking.

Driven by close collaboration between Intel's NPU hardware and OpenVINO™ software teams, Intel Core Ultra processors remain the only NPU to achieve complete NPU compliance in the final benchmark.

More: Press Kit: Intel Core Ultra Processors (Series 2)

Editor's Note: GPU results and CPU model number were updated after original publication on May 5, 2025.

The Small Print:

Testing Configuration

AMD Intel
OEM Platform ASUS Zenbook S 16 ASUS Zenbook S 14
OEM Model Number UM5606WA UX5406SA
CPU Model AMD Ryzen AI HX 370 Intel® Core™ Ultra 9 288V Processor
BIOS Date March 21, 2025 February 26, 2025
BIOS Version UM5606WA.317 UX5406SA.306
Total Memory 32GB LPDDR5, 7500 MHz 32GB LPDDR5, 8533 MHz
Graphics Brand AMD Radeon 890M Intel Arc 140V
Storage Memory 1TB 1TB
OS Windows 11 Pro x64 Windows 11 Pro x64
Power Source AC AC
Power Plan Balanced Balanced
Power Mode Best Performance Best Performance
OEM Power Setting myASUS: FullSpeed myASUS: FullSpeed

* All data measured as of April 28, 2025. See press kit for workload and configuration details.

Notices & Disclaimers

Performance varies by use, configuration and other factors. Learn more at intel.com/performance index. Performance results are based on testing as of dates shown in configurations and may not reflect all publicly available updates.  See backup for configuration details.  No product or component can be absolutely secure. Your costs and results may vary.