Sony Pixel Power calrec Sony

NVIDIA Takes Inference to New Heights Across MLPerf Tests

05/04/2023

MLPerf remains the definitive measurement for AI performance as an independent, third-party benchmark. NVIDIA's AI platform has consistently shown leadership across both training and inference since the inception of MLPerf, including the MLPerf Inference 3.0 benchmarks released today.

Three years ago when we introduced A100, the AI world was dominated by computer vision. Generative AI has arrived, said NVIDIA founder and CEO Jensen Huang.

This is exactly why we built Hopper, specifically optimized for GPT with the Transformer Engine. Today's MLPerf 3.0 highlights Hopper delivering 4x more performance than A100.

The next level of Generative AI requires new AI infrastructure to train large language models with great energy efficiency. Customers are ramping Hopper at scale, building AI infrastructure with tens of thousands of Hopper GPUs connected by NVIDIA NVLink and InfiniBand.

The industry is working hard on new advances in safe and trustworthy Generative AI. Hopper is enabling this essential work, he said.

The latest MLPerf results show NVIDIA taking AI inference to new levels of performance and efficiency from the cloud to the edge.

Specifically, NVIDIA H100 Tensor Core GPUs running in DGX H100 systems delivered the highest performance in every test of AI inference, the job of running neural networks in production. Thanks to software optimizations, the GPUs delivered up to 54% performance gains from their debut in September.

In healthcare, H100 GPUs delivered a 31% performance increase since September on 3D-UNet, the MLPerf benchmark for medical imaging.

Powered by its Transformer Engine, the H100 GPU, based on the Hopper architecture, excelled on BERT, a transformer-based large language model that paved the way for today's broad use of generative AI.

Generative AI lets users quickly create text, images, 3D models and more. It's a capability companies from startups to cloud service providers are rapidly adopting to enable new business models and accelerate existing ones.

Hundreds of millions of people are now using generative AI tools like ChatGPT - also a transformer model - expecting instant responses.

At this iPhone moment of AI, performance on inference is vital. Deep learning is now being deployed nearly everywhere, driving an insatiable need for inference performance from factory floors to online recommendation systems.

L4 GPUs Speed Out of the Gate NVIDIA L4 Tensor Core GPUs made their debut in the MLPerf tests at over 3x the speed of prior-generation T4 GPUs. Packaged in a low-profile form factor, these accelerators are designed to deliver high throughput and low latency in almost any server.

L4 GPUs ran all MLPerf workloads. Thanks to their support for the key FP8 format, their results were particularly stunning on the performance-hungry BERT model.

In addition to stellar AI performance, L4 GPUs deliver up to 10x faster image decode, up to 3.2x faster video processing and over 4x faster graphics and real-time rendering performance.

Announced two weeks ago at GTC, these accelerators are already available from major systems makers and cloud service providers. L4 GPUs are the latest addition to NVIDIA's portfolio of AI inference platforms launched at GTC.

Software, Networks Shine in System Test NVIDIA's full-stack AI platform showed its leadership in a new MLPerf test.

The so-called network-division benchmark streams data to a remote inference server. It reflects the popular scenario of enterprise users running AI jobs in the cloud with data stored behind corporate firewalls.

On BERT, remote NVIDIA DGX A100 systems delivered up to 96% of their maximum local performance, slowed in part because they needed to wait for CPUs to complete some tasks. On the ResNet-50 test for computer vision, handled solely by GPUs, they hit the full 100%.

Both results are thanks, in large part, to NVIDIA Quantum Infiniband networking, NVIDIA ConnectX SmartNICs and software such as NVIDIA GPUDirect.

Orin Shows 3.2x Gains at the Edge Separately, the NVIDIA Jetson AGX Orin system-on-module delivered gains of up to 63% in energy efficiency and 81% in performance compared with its results a year ago. Jetson AGX Orin supplies inference when AI is needed in confined spaces at low power levels, including on systems powered by batteries.

For applications needing even smaller modules drawing less power, the Jetson Orin NX 16G shined in its debut in the benchmarks. It delivered up to 3.2x the performance of the prior-generation Jetson Xavier NX processor.

A Broad NVIDIA AI Ecosystem The MLPerf results show NVIDIA AI is backed by the industry's broadest ecosystem in machine learning.

Ten companies submitted results on the NVIDIA platform in this round. They came from the Microsoft Azure cloud service and system makers including ASUS, Dell Technologies, GIGABYTE, H3C, Lenovo, Nettrix, Supermicro and xFusion.

Their work shows users can get great performance with NVIDIA AI both in the cloud and in servers running in their own data centers.

NVIDIA partners participate in MLPerf because they know it's a valuable tool for customers evaluating AI platforms and vendors. Results in the latest round demonstrate that the performance they deliver today will grow with the NVIDIA platform.

Users Need Versatile Performance NVIDIA AI is the only platform to run all MLPerf inference workloads and scenarios in data center and edge computing. Its versatile performance and efficiency make users the real winners.

Real-world applications typically employ many neural networks of different kinds that often need to deliver answers in real time.

For example, an AI application may need to understand a user's spoken request, classify an image, make a recommendation and then deliver a response as a spoken message in a human-sounding voice. Each step requires a different type
LINK: https://blogs.nvidia.com/blog/2023/04/05/inference-mlperf-ai/...
See more stories from nvidia

Most recent headlines

04/09/2025

Monumental Sports & Entertainment and Dalet Win Prestigious 2025 NAB Show Project of the Year Award

Monumental Sports & Entertainment (MSE), in collaboration with Dalet, has been a...

28/04/2025

From Audio to Video, Spotify's $100 Million Payout Fuels Creator Success Stories

Podcasts have become a cornerstone of the Spotify experience, evolving from a ni...

28/04/2025

Spotify Lends a Helping Hand to NYC Neighbors, the 9/11 Memorial & Museum

Each April, runners and walkers of all stripes gather together in Lower Manhattan for the 9/11 Memorial & Museum 5K. This race remembers those killed on Septemb...

28/04/2025

L3Harris to Present at Three Upcoming Investor Conferences

MELBOURNE, Fla., April 28, 2025 - L3Harris Technologies (NYSE: LHX) Chief Financial Officer and Aerojet Rocketdyne President Ken Bedingfield will present at Bar...

28/04/2025

LiveU Acquires Actus Digital

HACKENSACK, NJ LiveU, a global provider of live IP-video contribution, production and distribution solutions, has signed a definitive agreement to acquire Actus...

28/04/2025

LiveU Signs Definitive Agreement to Acquire the Business...

LiveU, the global leader in live IP-video contribution, production and distribution solutions, has signed a definitive agreement to acquire Actus Digital's ...

28/04/2025

Ikegami to Demonstrate IPX-100 Compact IP Base Station an...

Ikegami Electronics (Europe) will promote the latest additions to its range of broadcast-quality television production equipment at Broadcast Innovation Day (BI...

28/04/2025

Intinor Partners with Zest Technologies to Present Advanc...

Intinor is once again set to collaborate with its UK partner Zest Technologies at MPTS 2025 (Olympia, 14-15 May 2025). Following the recent launch of key update...

28/04/2025

CJP Broadcast Highlights Scalable Studio Solutions at MPT...

CJP Broadcast, the UK-based systems integrator specialising in virtual production and broadcast studio design, installation, commissioning and support, will ret...

28/04/2025

Interra Systems to Bring Comprehensive Suite of Video QC...

The media landscape in the Middle East continues to see a rise in OTT platforms, regional content creation, and expanding viewer expectations, which means the d...

28/04/2025

StreamPort Media Appointed as Official Distributor for Cl...

Clear-Com has announced the appointment of StreamPort Media as its authorized distributor in the Middle East. This partnership will expand access to Clear-Com&...

28/04/2025

nxtedition to Highlight AI Agents and Intuitive Productio...

nxtedition will showcase its unified production platform at CABSAT 2025, featuring advanced AI automation, open-source integrations and a seamless approach to l...

28/04/2025

nxtedition to Showcase Faster, Smarter, Seamless Storytel...

nxtedition will demonstrate its story-first production platform at MPTS 2025, highlighting integrated AI Agents, open-source language models, and frictionless c...

28/04/2025

Keepit and leading B2B platform company Ingram Micro anno...

Keepit has teamed with Ingram Micro, a leading business-to-business platform company for the global technology ecosystem, to expand access to Keepit's vendo...

28/04/2025

Ross Video to Showcase Hyperconverged Live Production Sol...

Ross Video, a global leader in video production technology, is participating in CABSAT 2025, taking place at the Dubai World Trade Center. CABSAT is the leadi...

28/04/2025

Disguise Appoints Media and Entertainment Leader Jake Sto...

Disguise, the leading platform and solutions provider driving the next generation of visual experiences, has appointed Jake Stone as its Senior Vice President o...

28/04/2025

Lightware Brings its Latest USB-C Innovation and Integrat...

Lightware, a global leader in connectivity and signal management solutions, is set to return to InfoComm 2025 with a dynamic showcase of industry-first innovati...

28/04/2025

EASY IP from arkona technologies Wins Futures Best of Sho...

arkona technologies GmbH, provider of cutting-edge IP core infrastructure solutions has announced that its EASY-IP platform is the recipient of Future's Bes...

28/04/2025

TMT Insights Wins Two Project of the Year Awards at NAB...

TMT Insights wrapped up an award-winning NAB 2025 with two of its high-profile customer engagements recognized with Project of the Year Awards: Content Acquisit...

28/04/2025

Fix Format Issues & Enhance Videos 80% Faster with VideoProc AI - Major Update

Fix Format Issues & Enhance Videos 80% Faster with VideoProc AI - Major Update Brie Clayton April 25, 2025 0 Comments VideoProc Converter AI just got ...

28/04/2025

New in Premiere Pro and After Effects at NAB 2025 - Larry Jordan with Kylee Pea of Adobe

New in Premiere Pro and After Effects at NAB 2025 - Larry Jordan with Kylee Pe a...

28/04/2025

The Text Selector Expression is arguably the most elusive Adobe After Effects Feature and yet it's its most powerful Text Feature

The Text Selector Expression is arguably the most elusive Adobe After Effects Fe...

28/04/2025

Master your music for free with a new desktop app from Brainworx

Master your music for free with a new desktop app from Brainworx Brie Clayton April 27, 2025 0 Comments bx_mastering studio promises free streaming-re...

28/04/2025

LiveU Inks Deal to Acquire Actus Digital, Boost Video Monitoring and Analytics Capabilities

LiveU Inks Deal to Acquire Actus Digital, Boost Video Monitoring and Analytics C...

28/04/2025

Small Van, Big Story: Changing the Game in Sports Broadcasting with Obvious C

Small Van, Big Story: Changing the Game in Sports Broadcasting with Obvious C By SVG Staff Monday, April 28, 2025 - 10:51 am Print This Story | Subscribe ...

28/04/2025

SVG Sit-Down: ESPN's Chris Calcinari on New Flagship Mobile Unit, Cloud and REMI Production, Early Prep for Super Bowl LXI

SVG Sit-Down: ESPN's Chris Calcinari on New Flagship Mobile Unit, Cloud and ...

28/04/2025

A Swiss Soccer Summer: Previewing UEFA Women's Euros 2025 with BBC Sport and Sunset+Vine

A Swiss Soccer Summer: Previewing UEFA Women's Euros 2025 with BBC Sport and...

28/04/2025

New research from Sky Sports looks at the role of Womens sport fandom in the future of sports

Monday 28 April 2025 New research from Sky Sports, released today, shows that w...

28/04/2025

Official trailer released for Sky Documentaries three-part series, Bibaa & Nicole: Murder in the Park, airing 11 May

Monday 28 April 2025 To view this content, please enable our use of cookies. To...

28/04/2025

Victory lap for A League of Their Own

After 20 Legendary Seasons Sky is Hanging Up The Boots of its BAFTA-winning Sports Show. Production of farewell series tees off this summerMonday 28 April 2025 ...

28/04/2025

Netflix Celebrates the Creative Tapestry of APAC Films at Tokyo Showcase

Back to All News Netflix Celebrates the Creative Tapestry of APAC Films at Tokyo Showcase Entertainment 28 April 2025 GlobalJapanSouth KoreaIndiaThailandInd...

28/04/2025

Get Ready for Netflix Tudum 2025: The Live Event! Watch the Trailer for Our Must-See Celebration

Back to All News Get Ready for Netflix Tudum 2025: The Live Event! Watch the Tr...

28/04/2025

NVIDIA Brings Cybersecurity to Every AI Factory

As enterprises increasingly adopt AI, securing AI factories - where complex, agentic workflows are executed - has never been more critical. NVIDIA is bringing ...

28/04/2025

How Agentic AI Enables the Next Leap in Cybersecurity

Agentic AI is redefining the cybersecurity landscape - introducing new opportunities that demand rethinking how to secure AI while offering the keys to addressi...

28/04/2025

Oracle Cloud Infrastructure Deploys Thousands of NVIDIA Blackwell GPUs for Agentic AI and Reasoning Models

Oracle has stood up and optimized its first wave of liquid-cooled NVIDIA GB200 N...

27/04/2025

A Tribute to Bruce Logan, Written By Steve Weiss

It's with deep regret that we share the passing of our dear friend Bruce Logan, ASC. Bruce was not just a collaborator-he was family. He worked with us at ...

27/04/2025

KAULITZ & KAULITZ - Launch date and first look for season 2

Back to All News KAULITZ & KAULITZ - Launch date and first look for season 2 Entertainment 27 April 2025 GlobalGermany Link copied to clipboard KAULITZ & ...

27/04/2025

'Senna' Wins Best Series Creator Category At The 2025 PLATINO Awards

Back to All News Senna Wins Best Series Creator Category At The 2025 PLATINO Awards Entertainment 27 April 2025 GlobalBrazil Link copied to clipboard This...

27/04/2025

Masterclass With Creative Team Behind 'Adolescence' Takes Filmmakers and Emerging Talent Behind the Scenes of Hit Show

Back to All News Masterclass With Creative Team Behind Adolescence Takes Filmma...

26/04/2025

Samsung Ads Launches New Interactive Ad Format

NEW YORK Samsung Ads has debuted a new interactive advertising format, Creative Canvas, that helps automate and deliver interactive ads....

26/04/2025

Sinclair Names Vincent J. Sollecito VP/GM of WPEC

WEST PALM BEACH, Fla. Sinclair has appointed Vincent J. Sollecito vice president and general manager of WPEC, serving the West Palm Beach, Florida market....

26/04/2025

Comcast Technology Solutions, AD-ID Join Forces to Advance Ad Standards

NEW YORK and DENVER AD-ID and Comcast Technology Solutions (CTS) have announced that they are working together to promote the adoption of industry standards in ...

26/04/2025

Cobalt Scores a Trifecta of Awards at NAB 2025

Cobalt Scores a Trifecta of Awards at NAB 2025 Brie Clayton April 25, 2025 0 Comments Company adds another Best of Show and two Product of the Year tr...

26/04/2025

FilmLight Colour Awards welcomes 2025 entries

FilmLight Colour Awards welcomes 2025 entries Brie Clayton April 25, 2025 0 Comments Entries open from 1 May 31 July to colourists on any grading pl...

26/04/2025

Blackmagic's Latest Products - Larry Jordan Guest Spots with Dan May at NAB Las Vegas 2025

Blackmagic's Latest Products - Larry Jordan Guest Spots with Dan May at NAB ...

25/04/2025

The Legend of Ochi Takes Families on an Adventure

Emily Watson, Isaiah Saxon, Helena Zengel, and Finn Wolfhard at The Legend of Ochi premiere (photo by Soul Brother/Shutterstock for Sundance Film Festival)...

25/04/2025

Interoperability and Networked Electronic Warfare: Critical for the Modern Warfighter

Using its CORVUS portfolio, L3Harris created a highly adaptable battle network t...

25/04/2025

Agile Content and Tiscali team up to launch the tv service Linkem My TI-VI in Italy Agile TV

Agile Content joins forces with Tiscali to introduce a new platform in Italy, of...

25/04/2025

Agile Content is Now AgileTV Agile TV

The company positions Agile TV as the commercial brand focused on delivering TVaaS solutions for operators and media. The chosen tagline TVaaS your way enca...

25/04/2025

Agile TV, United Teleports, and BNS Launch a More Profitable, Hassle-Free TV Service Agile TV

A Fully Managed Solution That Maximizes Revenue for North American Operators Whi...