Sony Pixel Power calrec Sony

Speed Demon: NVIDIA Blackwell Takes Pole Position in Latest MLPerf Inference Results

02/04/2025

In the latest MLPerf Inference V5.0 benchmarks, which reflect some of the most challenging inference scenarios, the NVIDIA Blackwell platform set records - and marked NVIDIA's first MLPerf submission using the NVIDIA GB200 NVL72 system, a rack-scale solution designed for AI reasoning.

Delivering on the promise of cutting-edge AI takes a new kind of compute infrastructure, called AI factories. Unlike traditional data centers, AI factories do more than store and process data - they manufacture intelligence at scale by transforming raw data into real-time insights. The goal for AI factories is simple: deliver accurate answers to queries quickly, at the lowest cost and to as many users as possible.

The complexity of pulling this off is significant and takes place behind the scenes. As AI models grow to billions and trillions of parameters to deliver smarter replies, the compute required to generate each token increases. This requirement reduces the number of tokens that an AI factory can generate and increases cost per token. Keeping inference throughput high and cost per token low requires rapid innovation across every layer of the technology stack, spanning silicon, network systems and software.

The latest updates to MLPerf Inference, a peer-reviewed industry benchmark of inference performance, include the addition of Llama 3.1 405B, one of the largest and most challenging-to-run open-weight models. The new Llama 2 70B Interactive benchmark features much stricter latency requirements compared with the original Llama 2 70B benchmark, better reflecting the constraints of production deployments in delivering the best possible user experiences.

In addition to the Blackwell platform, the NVIDIA Hopper platform demonstrated exceptional performance across the board, with performance increasing significantly over the last year on Llama 2 70B thanks to full-stack optimizations.

NVIDIA Blackwell Sets New Records The GB200 NVL72 system - connecting 72 NVIDIA Blackwell GPUs to act as a single, massive GPU - delivered up to 30x higher throughput on the Llama 3.1 405B benchmark over the NVIDIA H200 NVL8 submission this round. This feat was achieved through more than triple the performance per GPU and a 9x larger NVIDIA NVLink interconnect domain.

While many companies run MLPerf benchmarks on their hardware to gauge performance, only NVIDIA and its partners submitted and published results on the Llama 3.1 405B benchmark.

Production inference deployments often have latency constraints on two key metrics. The first is time to first token (TTFT), or how long it takes for a user to begin seeing a response to a query given to a large language model. The second is time per output token (TPOT), or how quickly tokens are delivered to the user.

The new Llama 2 70B Interactive benchmark has a 5x shorter TPOT and 4.4x lower TTFT - modeling a more responsive user experience. On this test, NVIDIA's submission using an NVIDIA DGX B200 system with eight Blackwell GPUs tripled performance over using eight NVIDIA H200 GPUs, setting a high bar for this more challenging version of the Llama 2 70B benchmark.

Combining the Blackwell architecture and its optimized software stack delivers new levels of inference performance, paving the way for AI factories to deliver higher intelligence, increased throughput and faster token rates.

NVIDIA Hopper AI Factory Value Continues Increasing The NVIDIA Hopper architecture, introduced in 2022, powers many of today's AI inference factories, and continues to power model training. Through ongoing software optimization, NVIDIA increases the throughput of Hopper-based AI factories, leading to greater value.

On the Llama 2 70B benchmark, first introduced a year ago in MLPerf Inference v4.0, H100 GPU throughput has increased by 1.5x. The H200 GPU, based on the same Hopper GPU architecture with larger and faster GPU memory, extends that increase to 1.6x.

Hopper also ran every benchmark, including the newly added Llama 3.1 405B, Llama 2 70B Interactive and graph neural network tests. This versatility means Hopper can run a wide range of workloads and keep pace as models and usage scenarios grow more challenging.

It Takes an Ecosystem This MLPerf round, 15 partners submitted stellar results on the NVIDIA platform, including ASUS, Cisco, CoreWeave, Dell Technologies, Fujitsu, Giga Computing, Google Cloud, Hewlett Packard Enterprise, Lambda, Lenovo, Oracle Cloud Infrastructure, Quanta Cloud Technology, Supermicro, Sustainable Metal Cloud and VMware.

The breadth of submissions reflects the reach of the NVIDIA platform, which is available across all cloud service providers and server makers worldwide.

MLCommons' work to continuously evolve the MLPerf Inference benchmark suite to keep pace with the latest AI developments and provide the ecosystem with rigorous, peer-reviewed performance data is vital to helping IT decision makers select optimal AI infrastructure.

Learn more about MLPerf.

Images and video taken at an Equinix data center in the Silicon Valley.
LINK: https://blogs.nvidia.com/blog/blackwell-mlperf-inference/...
See more stories from nvidia

North America Stories

03/04/2025

Grass Valley, Ross Announce Tech Partnership

MONTREAL Two long-time competitors in the TV production market have announced a collaboration that allows integration of two of its platforms....

03/04/2025

Honoring a Digital TV Pioneer

When the history of digital television in the United States is written, the individuals involved will read like a who's who of broadcast technology. Many of...

03/04/2025

FCC Commissioners Simington, Gomez to Speak at 2025 NAB Show

With the broadcast industry facing a number of major regulatory issues this year, the NAB has announced that two FCC commissioners, Nathan Simington and Anna M....

03/04/2025

Celtx launches Screenplay Plugin to help editors automate...

Celtx, which provides screenwriting software to seven million global users, today announced script integration with Adobe Premiere Pro. Adobe Premiere Pro use...

03/04/2025

Veset Partners with Major Japanese Broadcaster to Bring J...

Cloud playout solutions provider, Veset, has announced its partnership with one of Japan's leading broadcasters to integrate JPEG XS and Multicast networkin...

03/04/2025

Red Bee Sponsors AVTpro to Advance Audio Description Stan...

OOONA, a leading provider of professional management and production tools for the media localization industry, announced today that Red Bee Media will be sponso...

03/04/2025

Evolution Digital Named Preferred Set-Top Box Provider fo...

Feature-Rich, Widely Deployed Android TV Devices Enable Seamless Transition to Fully Managed Pay-TV Services for Network Operators AgileTV, United Teleports a...

03/04/2025

Avid Showcases Integrated, Open and Cloud-Enabled Postproduction Workflows to Power Creativity at NAB Show 2025

Avid Showcases Integrated, Open and Cloud-Enabled Postproduction Workflows to Po...

03/04/2025

FutureVideo V-Station HD Multi-channel Video Recording & Streaming Systems Incorporate Seamless Multicam Project Migration to Premiere Pro

FutureVideo V-Station HD Multi-channel Video Recording & Streaming Systems Incor...

03/04/2025

Berklee's Ty-Juana L. Flores Appointed Board Chair of the Boston Cultural Council

Berklee's Ty-Juana L. Flores Appointed Board Chair of the Boston Cultural Co...

03/04/2025

Berklee's South Asian Scholar Association to Host All-Ages Concert

Berklee's South Asian Scholar Association to Host All-Ages Concert The student-led event, Mehfil, will feature songs in eight different languages and blen...

03/04/2025

BeNarative, Haivision Partner on Integrated Video Production Solutions

PARIS BeNarative, an innovative video production platform, has announced a technical and commercial partnership with Haivision, a major provider of live video c...

03/04/2025

NAB Urges FCC to Allow Software-Based EAS

WASHINGTON The NAB is requesting the Federal Communications Commission make changes to Emergency Alert System (EAS) rules that would allow but not require EAS p...

03/04/2025

Amagi Expands Global NOC Operations With Its First Local Facility In U.S.

NEW YORK In the run-up to the 2025 NAB Show, Amagi has announced the establishment of a Broadcast Network Operations Center (NOC) in Princeton, N.J....

03/04/2025

CNBC+ Launches on Apple TV and Roku

ENGLEWOOD CLIFFS, N.J. CNBC has announced distribution deals that will launch its subscription streaming offering CNBC+ on Apple TV and Roku....

03/04/2025

Vinten to Unveil Updates to VEGA Control System at 2025 NAB Show

BURY ST EDMUNDS, U.K. Vinten, a global provider in robotic camera support systems and a Videndum brand, will unveil significant advancements to its VEGA contro...

03/04/2025

Lightcraft Jetset Expands iPhone Virtual Production Tool with a Dozen New Features

Lightcraft Jetset Expands iPhone Virtual Production Tool with a Dozen New Featur...

03/04/2025

Celtx launches Screenplay Plugin to help editors automate post-production workflows

Celtx launches Screenplay Plugin to help editors automate post-production workfl...

03/04/2025

Maxon One Release Delivers Greater Creative Freedom and Workflow Performance for Every Artist

Maxon One Release Delivers Greater Creative Freedom and Workflow Performance for...

03/04/2025

Berklee Announces Paul Dworkis as Executive Vice President and Chief Financial Officer

Berklee Announces Paul Dworkis as Executive Vice President and Chief Financial O...

03/04/2025

Nintendo Switch 2 Leveled Up With NVIDIA AI-Powered DLSS and 4K Gaming

The Nintendo Switch 2, unveiled April 2, takes performance to the next level, powered by a custom NVIDIA processor featuring an NVIDIA GPU with dedicated RT Cor...

03/04/2025

NVIDIA Showcases Real-Time AI and Intelligent Media Workflows at NAB

Real-time AI is unlocking new possibilities in media and entertainment, improving viewer engagement and advancing intelligent content creation. At NAB Show, a...

03/04/2025

From Browsing to Buying: How AI Agents Enhance Online Shopping

Editor's note: This post is part of the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and copi...

03/04/2025

No Foolin': GeForce NOW Gets 21 Games in April

GeForce NOW isn't fooling around. This month, 21 games are joining the cloud gaming library of over 2,000 titles. Whether chasing epic adventures, testing ...

02/04/2025

Release Rundown: What to Watch in April, From Freaky Tales to The Wedding Banquet

Pedro Pascal appears in Anna Boden and Ryan Fleck's Freaky Tales, which pr...

02/04/2025

An Innovation Imperative: Advancing the Next Generation of Solid Rocket Motors

Scott Alexander, President of Missile Solutions, Aerojet Rocketdyne, L3Harris, writes in Breaking Defense: L3Harris is building the factories of the future that...

02/04/2025

2025 NAB Show Exhibitor Insight: Canon U.S.A. Inc.

TV Tech: What do you anticipate will be the most significant technology trends at the 2025 NAB Show?...

02/04/2025

SKY Plus and DGO consolidate the premium experience for t...

SKY and DGO, the streaming and live TV platforms of DIRECTV Latin America and SKY Brasil, are moving forward with consolidating the highest-level experience fo...

02/04/2025

IABM Unveils Bold Transformation at NAB Show Prioritizing...

IABM is delivering a strategic transformation at NAB Show designed to fiercely champion members amidst global, industry challenges, elevating and innovating to ...

02/04/2025

Hiltron Promotes Latest Generation Satcom Products and Se...

Following a well-attended February 27th-28th GovSatCom in Luxembourg, Hiltron Communications promoted its wide range of satellite communication products, system...

02/04/2025

MASV Drives Enterprise Media Workflow Transformation with...

MASV, the fastest large file transfer platform for media professionals, is revolutionizing enterprise media workflows by enabling faster, more reliable, and sca...

02/04/2025

AgileTV and CANAL Plus Germany partnering to launch the B...

AgileTV, a leader in TV and video technology solutions, is partnering with CANAL Germany, the leading B2B TV-licensing provider in Germany, to introduce "The E...

02/04/2025

Magewells USB Capture Family Grows with Addition of New U...

New model leverages 20Gbps USB 3.2 Gen 2x2 interface to capture 12G SDI without a driver or external power Magewell, developer of innovative, high-performance ...

02/04/2025

MwareTV empowers operators with powerful intuitive respon...

MwareTV, a leading cloud-based multi-tenant TV platform provider, is set to launch a ground-breaking new toolset at NAB 2025 (booth W3457, Las Vegas Convention ...

02/04/2025

LiveU Redefines News Production with New Automated Story...

LiveU will spotlight its latest technical collaborations around efficient story-centric workflows and cloud collaboration in its expanded EcoSystem at the upcom...

02/04/2025

Live Media Group Names Ryan Hatch Vice President Strategi...

Live Media Group, a leader in live broadcast solutions and event production, has named Ryan Hatch as Vice President, Strategic Accounts, effective April 1st. In...

02/04/2025

New AI Innovation in Industry-Leading Adobe Premiere Pro Empowers Video Pros to Generate, Edit and Search Footage at Lightning Speed

New AI Innovation in Industry-Leading Adobe Premiere Pro Empowers Video Pros to ...

02/04/2025

DigitalGlue and Symply Partner to Deliver Next-Generation Storage Solutions for Creative Professionals

DigitalGlue and Symply Partner to Deliver Next-Generation Storage Solutions for ...

02/04/2025

Music Therapy Students Awarded First Internship Stipend from Children's Music Fund

Music Therapy Students Awarded First Internship Stipend from Children's Musi...

02/04/2025

Penn & Teller to Receive 2025 NAB Television Chairman's Award

WASHINGTON The National Association of Broadcasters (NAB) will present the Television Chairman's Award to renowned magicians and television personalities, P...

02/04/2025

Gray, FOX 9 Bring Minnesota Twins Back to Broadcast TV

MINNEAPOLIS-ST. PAUL The Minnesota Twin have inked a new, multi-year partnership with Gray Media and FOX 9, KMSP, to broadcast 10 Tuesday night regular season g...

02/04/2025

NAB Show: Adobe Launches Generative Extend AI for Premiere Pro

SAN JOSE Adobe today announced the official launch of its Generative Extend AI tool for Premiere Pro. The feature announced at its Adobe Max conference last fa...

02/04/2025

Backlight to unveil AI and Automation innovations at the...

Global media technology company Backlight will showcase new advancements in AI-driven automation, media management, and live content production at the 2025 NAB ...

02/04/2025

Radio Marca Chooses DHD RX2 and DX2 Mixers for New Studio...

Radio Marca, a Spanish radio broadcaster transmitting round-the-clock sports coverage, has chosen DHD audio mixers and routing as the heart of recently expanded...

02/04/2025

Net Insight strengthens media facility security and quali...

As media organizations increasingly rely on IP-based media production and distribution, security remains a critical challenge. Net Insight is addressing these i...

02/04/2025

Lightware Moves to New Sustainable Headquarters at HOP Te...

Lightware, a leader in connectivity solutions for the professional AV industry, has officially opened its new global headquarters at the HOP Technology Office P...

02/04/2025

Matrox Video DSX ST 2110 NICs to Power Pixotope Advanced...

Groundbreaking Product Integration Enables Pixotope Customers to Add ST 2110 Support to Existing or New Solutions Matrox Video today announced that Pixotope, a...

02/04/2025

V-Nova Joins Access Advance HEVC Patent Pool

V-Nova, best known for MPEG-5 LCEVC, joins the Access Advance program to contribute its essential HEVC patents Inaugural participation underscores V-Nova'...

02/04/2025

Sports Studio Inc Partners with Amagi

Amagi, a cloud-based SaaS solutions provider for broadcast and streaming TV (CTV), announced that it has been chosen by Sports Studio, Inc, a premier sports pla...

02/04/2025

Calrec expands ecosystem at NAB 2025 giving broadcasters...

Helping broadcasters meet the shifting needs of media consumption, Calrec is showcasing an expanded suite of interconnected technologies at NAB 2025, on Booth #...