How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs
05/02/2025
These GPUs were built to accelerate the latest generative AI workloads, delivering up to 3,352 AI trillion operations per second (TOPS), enabling incredible experiences for AI enthusiasts, gamers, creators and developers.
To help AI developers and enthusiasts harness these capabilities, NVIDIA at the CES trade show last month unveiled NVIDIA NIM and AI Blueprints for RTX. NVIDIA NIM microservices are prepackaged generative AI models that let developers and enthusiasts easily get started with generative AI, iterate quickly and harness the power of RTX for accelerating AI on Windows PCs. NVIDIA AI Blueprints are reference projects that show developers how to use NIM microservices to build the next generation of AI experiences.
NIM and AI Blueprints are optimized for GeForce RTX 50 Series GPUs. These technologies work together seamlessly to help developers and enthusiasts build, iterate and deliver cutting-edge AI experiences on AI PCs.
NVIDIA NIM Accelerates Generative AI on PCs While AI model development is rapidly advancing, bringing these innovations to PCs remains a challenge for many people. Models posted on platforms like Hugging Face must be curated, adapted and quantized to run on PC. They also need to be integrated into new AI application programming interfaces (APIs) to ensure compatibility with existing tools, and converted to optimized inference backends for peak performance.
NVIDIA NIM microservices for RTX AI PCs and workstations can ease the complexity of this process by providing access to community-driven and NVIDIA-developed AI models. These microservices are easy to download and connect to via industry-standard APIs and span the key modalities essential for AI PCs. They are also compatible with a wide range of AI tools and offer flexible deployment options, whether on PCs, in data centers, or in the cloud.
NIM microservices include everything needed to run optimized models on PCs with RTX GPUs, including prebuilt engines for specific GPUs, the NVIDIA TensorRT software development kit (SDK), the open-source NVIDIA TensorRT-LLM library for accelerated inference using Tensor Cores, and more.
Microsoft and NVIDIA worked together to enable NIM microservices and AI Blueprints for RTX in Windows Subsystem for Linux (WSL2). With WSL2, the same AI containers that run on data center GPUs can now run efficiently on RTX PCs, making it easier for developers to build, test and deploy AI models across platforms.
In addition, NIM and AI Blueprints harness key innovations of the Blackwell architecture that the GeForce RTX 50 series is built on, including fifth-generation Tensor Cores and support for FP4 precision.
Tensor Cores Drive Next-Gen AI Performance AI calculations are incredibly demanding and require vast amounts of processing power. Whether generating images and videos or understanding language and making real-time decisions, AI models rely on hundreds of trillions of mathematical operations to be completed every second. To keep up, computers need specialized hardware built specifically for AI.
NVIDIA GeForce RTX desktop GPUs deliver up to 3,352 AI TOPS for unmatched speed and efficiency in AI-powered workflows. In 2018, NVIDIA GeForce RTX GPUs changed the game by introducing Tensor Cores - dedicated AI processors designed to handle these intensive workloads. Unlike traditional computing cores, Tensor Cores are built to accelerate AI by performing calculations faster and more efficiently. This breakthrough helped bring AI-powered gaming, creative tools and productivity applications into the mainstream.
Blackwell architecture takes AI acceleration to the next level. The fifth-generation Tensor Cores in Blackwell GPUs deliver up to 3,352 AI TOPS to handle even more demanding AI tasks and simultaneously run multiple AI models. This means faster AI-driven experiences, from real-time rendering to intelligent assistants, that pave the way for greater innovation in gaming, content creation and beyond.
FP4 - Smaller Models, Bigger Performance Another way to optimize AI performance is through quantization, a technique that reduces model sizes, enabling the models to run faster while reducing the memory requirements.
Enter FP4 - an advanced quantization format that allows AI models to run faster and leaner without compromising output quality. Compared with FP16, it reduces model size by up to 60% and more than doubles performance, with minimal degradation.
For example, Black Forest Labs' FLUX.1 [dev] model at FP16 requires over 23GB of VRAM, meaning it can only be supported by the GeForce RTX 4090 and professional GPUs. With FP4, FLUX.1 [dev] requires less than 10GB, so it can run locally on more GeForce RTX GPUs.
On a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with just 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds.
FP4 is natively supported by the Blackwell architecture, making it easier than ever to deploy high-performance AI on local PCs. It's also integrated into NIM microservices, effectively optimizing models that were previously difficult to quantize. By enabling more efficient AI processing, FP4 helps to bring faster, smarter AI experiences for content creation.
AI Blueprints Power Advanced AI Workflows on RTX PCs NVIDIA AI Blueprints, built on NIM microservices, provide prepackaged, optimized reference implementations that make it easier to develop advanced AI-powered projects - whether for digital humans, podcast generators or application assistants.
At CES, NVIDIA demonstrated PDF to Podcast
LINK: | https://blogs.nvidia.com/blog/rtx-ai-garage-blackwell-nim-blueprints-p... |
See more stories from nvidia |
Most recent headlines
05/02/2025
Inside IBC's innovation boom: what's powering the future of media?
Speaking exclusively to TVBEurope, four of its experts weigh in on the IBCs role in fostering innovation, the technological shifts on the horizon, and why sport...
05/02/2025
Meet the director of product marketing
Vincent Noyer, director of product marketing at LYNX Technik, tells TVBEurope how remaining engaged and proactive leads to opportunities for growth By Matthew ...
05/02/2025
ESPN Viewing Hits Decade-Long Highs in January
ESPN is reporting that its NFL playoffs and College Football Playoff games in January boosted audiences to levels not seen at the network in years....
05/02/2025
Roku Remains Top U.S. Streaming Device
A new study from Pixalate indicates that Roku remains by far the dominant player among streaming devices in North America where its market share was more than d...
05/02/2025
Panasonic To Ship New 4K 60p 10-Bit Camcorders In March
NEWARK, N.J. Panasonic next month will release new 4K 60p 10-bit professional camcorder models, including the AG-CX20, HC-X1200 and HC-X2100 for video productio...
05/02/2025
Tegna Shuts Down National Fact-Checking Operation
WASHINGTON Tegna has shut down Verify, the station group's national fact-checking operation, laying off around 18 journalists, producers, researchers and ot...
05/02/2025
FCC Chair Brendan Carr Fills More Key Staff Positions
WASHINGTON, D.C. Federal Communications Commission chair Brendan Carr continues to build out his leadership team with the announcement of more staff appointment...
05/02/2025
Sanctuary Pictures Unveils Punk-Horror Feature Penny Lane Is Dead
04 02 2025 - Media release Sanctuary Pictures Unveils Punk-Horror Feature Penny Lane Is Dead Writer/Director of Penny Lane Is Dead, Mia'Kate Russell Sanc...
05/02/2025
Faculty Notes: Fall 2024
Faculty Notes: Fall 2024 Recent accomplishments, releases, and events by Berklee faculty. February 4, 2025 Professor Tomo Fujita played the John Coltrane In...
05/02/2025
Other Voices makes its return to RT this Spring with an incredible lineup
Laura Marling, Lisa O'Neill, James Dean Bradfield, Jacob Alon, Bashy, Morgana and more to feature in Other Voices Series 23 this Spring RT 2 & RT Player |...
05/02/2025
AI Pays Off: Survey Reveals Financial Industry's Latest Technological Trends
The financial services industry is reaching an important milestone with AI, as organizations move beyond testing and experimentation to successful AI implementa...
05/02/2025
How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs
NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA...
04/02/2025
Spotify Reports Fourth Quarter 2024 Earnings
Today, we announced our fourth quarter 2024 earnings, closing Q4 stronger than ever by outperforming across key metrics and celebrating our first full year of p...
04/02/2025
Spotify rapporterar intkter fr fjrde kvartalet 2024
Idag rapporterar vi int kter f r fj rde kvartalet 2024. Vi avslutade Q4 starkare n n gonsin genom att vertr ffa f rv ntningarna p v ra nyckeltal och kan d rm...
04/02/2025
SGL Carbon opts for green electricity at its German sites
As a technology-based company and one of the worlds leading companies in the development and production of carbon-based solutions, SGL Carbon opts for innovativ...
04/02/2025
ST Engineering iDirect Names Sridhar Kuppanna as Chief Technology Officer
Ground segment technology innovator appoints new CTO to execute bold technological vision Herndon, Va., February 4, 2025 ST Engineering iDirect, global leade...
04/02/2025
L3Harris Signs Multi-Year Pilot Training Agreement With Thai Airways
L3Harris has signed a two-year agreement with Thai Airways International to provide training service on its A320 Full Flight Simulator (FFS). This significant a...
04/02/2025
US Air Force Completes First Flight of L3Harris Viper Shield Electronic Warfare System
L3Harris' all-digital electronic warfare suite, Viper Shield , completed its...
04/02/2025
Radio Botswana chooses Calrec's IP-native Type R mixing system
The shift from analogue to IP was driven by a desire for greater flexibility in our operations. IP simplifies connectivity, reduces the physical footprint of th...
04/02/2025
Simplifying Gray Media News Operations with Calrec's Type R
Streamline, standardise and save: how Gray Media has simplified news operations At TVNewsCheck's News Tech Forum 2024, Gray Media's Peter Gogas and Calr...
04/02/2025
Bending Spoons closes $233 million acquisition of Brightcove
Boston, MA-February 4, 2025 | Bending Spoons, the Italy-based technology company, completed its previously announced acquisition of US-based streaming technolog...
04/02/2025
PARAMOUNT AND NIELSEN SIGN MULTI-YEAR MEASUREMENT AND ANALYTICS DEAL ACROSS PARAMOUNT'S LEADING BROADCAST, CABLE AND STREAMING PLATFORMS
Nielsen Reports Major Recent Ratings Milestones for CBS and Paramount Series N...
04/02/2025
Grup Mediapro to Collaborate with Google Cloud on Gen AI
BARCELONA Grup Mediapro and Google Cloud have expanded their collaboration to create an innovation lab focused on generative AI to develop solutions for the med...
04/02/2025
EditShare Receives SOCE 2 Type II Certification
WATERTOWN, Mass. EditShare this week said it has received SOC 2 Type II certification, an independently audited evaluation of an organization's information ...
04/02/2025
Executive Creative Director Halle Petro Named Partner of Sonic Union
Executive Creative Director Halle Petro Named Partner of Sonic Union Brie Clayton February 4, 2025 0 Comments Sonic Union is excited to announce Execu...
04/02/2025
Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update
Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update Brie Clayton February 4, 2025 0 Comments New update adds support for Xiaomi Pad 6...
04/02/2025
CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams
CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams Brie Clayton February 4, 2025 0 Comments CETA Software, creators of p...
04/02/2025
OWC Announces General Availability Launch of OWC Dock Ejector 2.0
OWC Announces General Availability Launch of OWC Dock Ejector 2.0 Brie Clayton February 4, 2025 0 Comments The Ultimate Tool for Efficiently and Safel...
04/02/2025
Colourist Claudio Del Bravo on grading Queer
Explaining the process to TVBEurope, Del Bravo said the films look was inspired by the Technicolor three-strip' process, evoking the rich colours of early ...
04/02/2025
Paramount, Nielsen Sign Multiyear Measurement and Analytics Deal
NEW YORK Paramount Global and Nielsen have inked a new, multiyear deal that will provide measurement for all of the company's platforms, including national ...
04/02/2025
2d Animated Short Concerning a Project for Schools
2d Animated Short Concerning a Project for Schools Brie Clayton February 3, 2025 0 Comments 2d animated short concerning a project for schools Febru...
04/02/2025
Step by step guide to using 3D Models in After Effects
Step by step guide to using 3D Models in After Effects Graham Quince February 3, 2025 0 Comments Since 2024, Adobe After Effects has had native suppor...
04/02/2025
Powerful Premiere Automation with new Excalibur Update
Powerful Premiere Automation with new Excalibur Update Colin Smith February 3, 2025 0 Comments This tutorial takes you through the new update for auto...
04/02/2025
Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to create a 1960's era film
Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to creat...
04/02/2025
DIY to DA: Ela Minus Breaks Through
DIY to D A: Ela Minus Breaks Through The electronic artist and producer tells Rolling Stone about her new album, D A, and how shes forged a career outside the...
04/02/2025
NVIDIA Blackwell Now Generally Available in the Cloud
AI reasoning models and agents are set to transform industries, but delivering their full potential at scale requires massive compute and optimized software. Th...
04/02/2025
The Future of Football? Technology and Entertainment Merge in the Kings World Cup Nations
The future of football? Technology and entertainment merge in the Kings World Cu...
04/02/2025
Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Trends
Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Tr...
04/02/2025
SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics, SNY
SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics,...
04/02/2025
SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Productions Like a Professional
SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Pr...
04/02/2025
EA Acquires TRACAB Technologies as It Looks to Move Beyond Games
EA Acquires TRACAB Technologies as It Looks to Move Beyond Games EA believes TRACABs sports tracking/analysis technology will help to make the EA SPORTS App the...
04/02/2025
Kingdom Come: Alamiya Media on Bringing the Supercoppa Italiana and Supercopa de Espaa to Saudi Arabia
Kingdom come: Alamiya Media on bringing the Supercoppa Italiana and Supercopa de...
04/02/2025
Alamiya Media at 50: Preparing for Rapid Change, an International Broadcast Center and the FIFA World Cup
Alamiya Media at 50: Preparing for rapid change, an international broadcast cent...
04/02/2025
An update on our TV and broadband prices
An update on our TV and broadband pricesTuesday 4 February 2025 An update on our TV and broadband prices Devesh Raj, Chief Operating Officer, Sky This April,...
04/02/2025
Sky extends partnership with the PDC to remain the home of darts until 2030
Sky extends partnership with the PDC to remain the home of darts until 2030Tuesday 4 February 2025 Following another record-breaking PDC World Darts Championsh...
04/02/2025
Frankfurt is the world's first airport to regularly use walk-through scanners from Rohde & Schwarz for passengers
Frankfurt is the world's first airport to regularly use walk-through scanner...
04/02/2025
Riedel Unveils Next Generation of StageLink Edge Devices
Wuppertal February 4, 2025 Riedel Unveils Next Generation of StageLink Edge DevicesRiedel Communications today announced the launch of its StageLink family of...
04/02/2025
Clara Galle, Claudia Salas and Paula Usero Star in 'That Night,' the New Netflix Series Based on the Bestselling Novel by Gillian McAllister
Back to All News Clara Galle, Claudia Salas and Paula Usero Star in That Night,...
04/02/2025
Fox Corporation Reports Second Quarter Fiscal 2025 Financial Results
Fox Corporation Reports Second Quarter Fiscal 2025 Financial Results NEW YORK, NY, February 4, 2025 - Fox Corporation (Nasdaq: FOXA, FOX; FOX or the Compan...
04/02/2025
Introducing our fully digital, true diversity wideband wireless mic solution
DPA Microphones is moving into the wireless market with the release of its new N-Series Digital Wireless System at ISE 2025 (Stand 7P600). A fully digital, true...