Peak Training: Blackwell Delivers Next-Level MLPerf Training Performance
13/11/2024
In MLPerf Training 4.1 industry benchmarks, the NVIDIA Blackwell platform delivered impressive results on workloads across all tests - and up to 2.2x more performance per GPU on LLM benchmarks, including Llama 2 70B fine-tuning and GPT-3 175B pretraining.
In addition, NVIDIA's submissions on the NVIDIA Hopper platform continued to hold at-scale records on all benchmarks, including a submission with 11,616 Hopper GPUs on the GPT-3 175B benchmark.
Leaps and Bounds With Blackwell The first Blackwell training submission to the MLCommons Consortium - which creates standardized, unbiased and rigorously peer-reviewed testing for industry participants - highlights how the architecture is advancing generative AI training performance.
For instance, the architecture includes new kernels that make more efficient use of Tensor Cores. Kernels are optimized, purpose-built math operations like matrix-multiplies that are at the heart of many deep learning algorithms.
Blackwell's higher per-GPU compute throughput and significantly larger and faster high-bandwidth memory allows it to run the GPT-3 175B benchmark on fewer GPUs while achieving excellent per-GPU performance.
Taking advantage of larger, higher-bandwidth HBM3e memory, just 64 Blackwell GPUs were able to run in the GPT-3 LLM benchmark without compromising per-GPU performance. The same benchmark run using Hopper needed 256 GPUs.
The Blackwell training results follow an earlier submission to MLPerf Inference 4.1, where Blackwell delivered up to 4x more LLM inference performance versus the Hopper generation. Taking advantage of the Blackwell architecture's FP4 precision, along with the NVIDIA QUASAR Quantization System, the submission revealed powerful performance while meeting the benchmark's accuracy requirements.
Relentless Optimization NVIDIA platforms undergo continuous software development, racking up performance and feature improvements in training and inference for a wide variety of frameworks, models and applications.
In this round of MLPerf training submissions, Hopper delivered a 1.3x improvement on GPT-3 175B per-GPU training performance since the introduction of the benchmark.
NVIDIA also submitted large-scale results on the GPT-3 175B benchmark using 11,616 Hopper GPUs connected with NVIDIA NVLink and NVSwitch high-bandwidth GPU-to-GPU communication and NVIDIA Quantum-2 InfiniBand networking.
NVIDIA Hopper GPUs have more than tripled scale and performance on the GPT-3 175B benchmark since last year. In addition, on the Llama 2 70B LoRA fine-tuning benchmark, NVIDIA increased performance by 26% using the same number of Hopper GPUs, reflecting continued software enhancements.
NVIDIA's ongoing work on optimizing its accelerated computing platforms enables continued improvements in MLPerf test results - driving performance up in containerized software, bringing more powerful computing to partners and customers on existing platforms and delivering more return on their platform investment.
Partnering Up NVIDIA partners, including system makers and cloud service providers like ASUSTek, Azure, Cisco, Dell, Fujitsu, Giga Computing, Lambda Labs, Lenovo, Oracle Cloud, Quanta Cloud Technology and Supermicro also submitted impressive results to MLPerf in this latest round.
A founding member of MLCommons, NVIDIA sees the role of industry-standard benchmarks and benchmarking best practices in AI computing as vital. With access to peer-reviewed, streamlined comparisons of AI and HPC platforms, companies can keep pace with the latest AI computing innovations and access crucial data that can help guide important platform investment decisions.
Learn more about the latest MLPerf results on the NVIDIA Technical Blog.
More from Nvidia
21/11/2024
Efficiency Meets Personalization: How AI Agents Improve Customer Service
Editor's note: This post is the first in the AI On blog series, which explores the latest techniques and real-world applications of agentic AI, chatbots and...
21/11/2024
First Star Wars Outlaws' Story Pack Hits GeForce NOW
Get ready to dive deeper into the criminal underworld of a galaxy far, far away as GeForce NOW brings the first major story pack for Star Wars Outlaws to the cl...
21/11/2024
Into the Omniverse: How Generative AI Fuels Personalized, Brand-Accurate Visuals With OpenUSD
Editor's note: This post is part of Into the Omniverse, a blog series focuse...
20/11/2024
The Need for Speed: NVIDIA Accelerates Majority of World's Supercomputers to Drive Advancements in Science and Technology
Starting with the release of CUDA in 2006, NVIDIA has driven advancements in AI ...
19/11/2024
AI at COP29: Balancing Innovation and Sustainability
As COP29 attendees gather in Baku, Azerbaijan, to tackle climate change, the role AI plays in environmental sustainability is front and center. A panel hosted ...
19/11/2024
How the Department of Energy's AI Initiatives Are Transforming Science, Industry and Government
The U.S. Department of Energy oversees national energy policy and production. As...
19/11/2024
Microsoft and NVIDIA Supercharge AI Development on RTX AI PCs
Generative AI-powered laptops and PCs are unlocking advancements in gaming, content creation, productivity and development. Today, over 600 Windows apps and gam...
19/11/2024
NVIDIA and Microsoft Showcase Blackwell Preview, Omniverse Industrial AI and RTX AI PCs at Microsoft Ignite
NVIDIA and Microsoft today unveiled product integrations designed to advance ful...
18/11/2024
From Algorithms to Atoms: NVIDIA ALCHEMI NIM Catalyzes Sustainable Materials Research for EV Batteries, Solar Panels and More
More than 96% of all manufactured goods - ranging from everyday products, like l...
18/11/2024
Foxconn Expands Blackwell Testing and Production With New Factories in U.S., Mexico and Taiwan
To meet demand for Blackwell, now in full production, Foxconn, the world's l...
18/11/2024
Hopper Scales New Heights, Accelerating AI and HPC Applications for Mainstream Enterprise Servers
Since its introduction, the NVIDIA Hopper architecture has transformed the AI an...
18/11/2024
NVIDIA Releases cuPyNumeric, Enabling Scientists to Harness GPU Acceleration at Cluster Scale
Whether they're looking at nanoscale electron behaviors or starry galaxies c...
18/11/2024
Faster Forecasts: NVIDIA Launches Earth-2 NIM Microservices for 500x Speedup in Delivering Higher-Resolution Simulations
NVIDIA today at SC24 announced two new NVIDIA NIM microservices that can acceler...
18/11/2024
AI Will Drive Scientific Breakthroughs, NVIDIA CEO Says at SC24
NVIDIA kicked off SC24 in Atlanta with a wave of AI and supercomputing tools set to revolutionize industries like biopharma and climate science. The announceme...
14/11/2024
From Seed to Stream: Farming Simulator 25' Sprouts on GeForce NOW
Grab a pitchfork and fire up the tractor - the fields of GeForce NOW are about to get a whole lot greener with Farming Simulator 25. Whether looking for a time...
14/11/2024
Open for Development: NVIDIA Works With Cloud-Native Community to Advance AI and ML
Cloud-native technologies have become crucial for developers to create and imple...
14/11/2024
NVIDIA Ranks No. 1 as Forbes Debuts List of America's Best Companies 2025
NVIDIA ranked No. 1 on Forbes magazine's new list - America's Best Companies - based on more than 60 measures in nearly a dozen categories that cover fi...
14/11/2024
Keeping an AI on Diabetes Risk: Gen AI Model Predicts Blood Sugar Levels Four Years Out
Diabetics - or others monitoring their sugar intake - may look at a cookie and w...
13/11/2024
Indonesia Tech Leaders Team With NVIDIA and Partners to Launch Nation's AI
Working with NVIDIA and its partners, Indonesia's technology leaders have launched an initiative to bring sovereign AI to the nation's more than 277 mil...
13/11/2024
2025 Predictions: AI Finds a Reason to Tap Industry Data Lakes
Since the advent of the computer age, industries have been so awash in stored data that most of it never gets put to use. This data is estimated to be in the n...
13/11/2024
Peak Training: Blackwell Delivers Next-Level MLPerf Training Performance
Generative AI applications that use text, computer code, protein chains, summaries, video and even 3D graphics require data-center-scale accelerated computing t...
12/11/2024
Japan Tech Leaders Supercharge Sovereign AI With NVIDIA AI Enterprise and Omniverse
From call centers to factories to hospitals, AI is sweeping Japan. Undergirding...
12/11/2024
Japan's Startups Drive AI Innovation With NVIDIA Accelerated Computing
Lifelike digital humans engage with audiences in real time. Autonomous systems streamline complex logistics. And AI-driven language tools break down communicati...
12/11/2024
NVIDIA and Global Consulting Leaders Speed AI Adoption Across Japan's Industries
Consulting giants including Accenture, Deloitte, EY Strategy and Consulting Co.,...
12/11/2024
Lab Confidential: Japan Research Keeps Healthcare Data Secure
Established 77 years ago, Mitsui & Co stays vibrant by building businesses and ecosystems with new technologies like generative AI and confidential computing. ...
12/11/2024
Japan Develops Next-Generation Drug Design, Healthcare Robotics and Digital Health Platforms
To provide high-quality medical care to its population - around 30% of whom are ...
12/11/2024
Japan's Market Innovators Bring Physical AI to Industries With NVIDIA AI and Omniverse
Robots transporting heavy metal at a Toyota plant. Yaskawa's robots working ...
12/11/2024
Every Industry, Every Company, Every Country Must Produce a New Industrial Revolution,' Says NVIDIA CEO Jensen Huang at AI Summit Japan
The next technology revolution is here, and Japan is poised to be a major part o...
12/11/2024
GPU's Companion: NVIDIA App Supercharges RTX GPUs With AI-Powered Tools and Features
The NVIDIA app - officially releasing today - is a companion platform for conten...
07/11/2024
Jensen Huang to Discuss AI's Future with Masayoshi Son at AI Summit Japan
NVIDIA founder and CEO Jensen Huang will join SoftBank Group Chairman and CEO Masayoshi Son in a fireside chat at NVIDIA AI Summit Japan to discuss the transfor...
07/11/2024
Welcome to GeForce NOW Performance: Priority Members Get Instant Upgrade
This GFN Thursday, the GeForce NOW Priority membership is getting enhancements and a fresh name to go along with it. The new Performance membership offers more ...
06/11/2024
Hugging Face and NVIDIA to Accelerate Open-Source AI Robotics Research and Development
At the Conference for Robot Learning (CoRL) in Munich, Germany, Hugging Face and...
06/11/2024
NVIDIA Advances Robot Learning and Humanoid Development With New AI and Simulation Tools
www.1x.tech Robotics developers can greatly accelerate their work on AI-enabled...
06/11/2024
Get Plugged In: How to Use Generative AI Tools in Obsidian
Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...
05/11/2024
Austin Calling: As Texas Absorbs Influx of Residents, Rekor Taps NVIDIA Technology for Roadway Safety, Traffic Relief
Austin is drawing people to jobs, music venues, comedy clubs, barbecue and more....
04/11/2024
Give AI a Look: Any Industry Can Now Search and Summarize Vast Volumes of Visual Data
Enterprises and public sector organizations around the world are developing AI a...
31/10/2024
Startup Helps Surgeons Target Breast Cancers With AI-Powered 3D Visualizations
A new AI-powered, imaging-based technology that creates accurate three-dimensional models of tumors, veins and other soft tissue offers a promising new method t...
31/10/2024
Scale New Heights With Dragon Age: The Veilguard' in the Cloud on GeForce NOW
Even post-spooky season, GFN Thursday has some treats for GeForce NOW members: a...
30/10/2024
Spooks Await at the Haunted Sanctuary,' Built With RTX and AI
Among the artists using AI to enhance and accelerate their creative endeavors is Sabour Amirazodi, a creator and tech marketing and workflow specialist at NVIDI...
29/10/2024
A New ERA of AI Factories: NVIDIA Unveils Enterprise Reference Architectures
As the world transitions from general-purpose to accelerated computing, finding a path to building data center infrastructure at scale is becoming more importan...
28/10/2024
Bring Receipts: New NVIDIA AI Workflow Detects Fraudulent Credit Card Transactions
Financial losses from worldwide credit card transaction fraud are expected to re...
28/10/2024
Fintech Leaders Tap Generative AI for Safer, Faster, More Accurate Financial Services
An overwhelming 91% of financial services industry (FSI) companies are either as...
24/10/2024
India Should Manufacture Its Own AI,' Declares NVIDIA CEO
Artificial intelligence will be the driving force behind India's digital transformation, fueling innovation, economic growth, and global leadership, NVIDIA ...
24/10/2024
Zoom's AI-First Transformation to Boost Business Productivity, Collaboration
Zoom, a company that helped change the way people work during the COVID-19 pandemic, is continuing to reimagine the future of work by transforming itself into a...
24/10/2024
Call of Duty: Black Ops 6' Storms Into the Cloud With GeForce NOW
Attention, recruits! It's time to test combat skills and strategic prowess. Drop into the heart of the action this GFN Thursday with the launch of the highl...
23/10/2024
Healthcare Leaders Across India Bring NVIDIA NIM for Hindi Language to LLM Applications
Life sciences and healthcare organizations across India are using generative AI ...
23/10/2024
India Manufacturers Build Factory Digital Twins With NVIDIA AI and Omniverse
Manufacturers and service providers in India are adopting NVIDIA Omniverse to tap into simulation, digital twins and generative AI to accelerate their factory p...
23/10/2024
India's Robotics Ecosystem Adopts NVIDIA Isaac and Omniverse to Build Next Wave of Physical AI
In vast warehouses, Addverb's robots work tirelessly, picking, sorting and d...
23/10/2024
Open for AI: India Tech Leaders Build AI Factories for Economic Transformation
India's leading cloud infrastructure providers and server manufacturers are ramping up accelerated data center capacity. By year's end, they'll have...
23/10/2024
World's Greatest Upskill: Consulting Giants Team With NVIDIA to Transform India Into Front Office for AI Era
Information technology giants including Infosys, TCS, Tech Mahindra and Wipro ar...