How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs
05/02/2025
These GPUs were built to accelerate the latest generative AI workloads, delivering up to 3,352 AI trillion operations per second (TOPS), enabling incredible experiences for AI enthusiasts, gamers, creators and developers.
To help AI developers and enthusiasts harness these capabilities, NVIDIA at the CES trade show last month unveiled NVIDIA NIM and AI Blueprints for RTX. NVIDIA NIM microservices are prepackaged generative AI models that let developers and enthusiasts easily get started with generative AI, iterate quickly and harness the power of RTX for accelerating AI on Windows PCs. NVIDIA AI Blueprints are reference projects that show developers how to use NIM microservices to build the next generation of AI experiences.
NIM and AI Blueprints are optimized for GeForce RTX 50 Series GPUs. These technologies work together seamlessly to help developers and enthusiasts build, iterate and deliver cutting-edge AI experiences on AI PCs.
NVIDIA NIM Accelerates Generative AI on PCs While AI model development is rapidly advancing, bringing these innovations to PCs remains a challenge for many people. Models posted on platforms like Hugging Face must be curated, adapted and quantized to run on PC. They also need to be integrated into new AI application programming interfaces (APIs) to ensure compatibility with existing tools, and converted to optimized inference backends for peak performance.
NVIDIA NIM microservices for RTX AI PCs and workstations can ease the complexity of this process by providing access to community-driven and NVIDIA-developed AI models. These microservices are easy to download and connect to via industry-standard APIs and span the key modalities essential for AI PCs. They are also compatible with a wide range of AI tools and offer flexible deployment options, whether on PCs, in data centers, or in the cloud.
NIM microservices include everything needed to run optimized models on PCs with RTX GPUs, including prebuilt engines for specific GPUs, the NVIDIA TensorRT software development kit (SDK), the open-source NVIDIA TensorRT-LLM library for accelerated inference using Tensor Cores, and more.
Microsoft and NVIDIA worked together to enable NIM microservices and AI Blueprints for RTX in Windows Subsystem for Linux (WSL2). With WSL2, the same AI containers that run on data center GPUs can now run efficiently on RTX PCs, making it easier for developers to build, test and deploy AI models across platforms.
In addition, NIM and AI Blueprints harness key innovations of the Blackwell architecture that the GeForce RTX 50 series is built on, including fifth-generation Tensor Cores and support for FP4 precision.
Tensor Cores Drive Next-Gen AI Performance AI calculations are incredibly demanding and require vast amounts of processing power. Whether generating images and videos or understanding language and making real-time decisions, AI models rely on hundreds of trillions of mathematical operations to be completed every second. To keep up, computers need specialized hardware built specifically for AI.
NVIDIA GeForce RTX desktop GPUs deliver up to 3,352 AI TOPS for unmatched speed and efficiency in AI-powered workflows. In 2018, NVIDIA GeForce RTX GPUs changed the game by introducing Tensor Cores - dedicated AI processors designed to handle these intensive workloads. Unlike traditional computing cores, Tensor Cores are built to accelerate AI by performing calculations faster and more efficiently. This breakthrough helped bring AI-powered gaming, creative tools and productivity applications into the mainstream.
Blackwell architecture takes AI acceleration to the next level. The fifth-generation Tensor Cores in Blackwell GPUs deliver up to 3,352 AI TOPS to handle even more demanding AI tasks and simultaneously run multiple AI models. This means faster AI-driven experiences, from real-time rendering to intelligent assistants, that pave the way for greater innovation in gaming, content creation and beyond.
FP4 - Smaller Models, Bigger Performance Another way to optimize AI performance is through quantization, a technique that reduces model sizes, enabling the models to run faster while reducing the memory requirements.
Enter FP4 - an advanced quantization format that allows AI models to run faster and leaner without compromising output quality. Compared with FP16, it reduces model size by up to 60% and more than doubles performance, with minimal degradation.
For example, Black Forest Labs' FLUX.1 [dev] model at FP16 requires over 23GB of VRAM, meaning it can only be supported by the GeForce RTX 4090 and professional GPUs. With FP4, FLUX.1 [dev] requires less than 10GB, so it can run locally on more GeForce RTX GPUs.
On a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with just 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds.
FP4 is natively supported by the Blackwell architecture, making it easier than ever to deploy high-performance AI on local PCs. It's also integrated into NIM microservices, effectively optimizing models that were previously difficult to quantize. By enabling more efficient AI processing, FP4 helps to bring faster, smarter AI experiences for content creation.
AI Blueprints Power Advanced AI Workflows on RTX PCs NVIDIA AI Blueprints, built on NIM microservices, provide prepackaged, optimized reference implementations that make it easier to develop advanced AI-powered projects - whether for digital humans, podcast generators or application assistants.
At CES, NVIDIA demonstrated PDF to Podcast
LINK: | https://blogs.nvidia.com/blog/rtx-ai-garage-blackwell-nim-blueprints-p... |
See more stories from nvidia |
North America Stories
05/02/2025
ESPN Viewing Hits Decade-Long Highs in January
ESPN is reporting that its NFL playoffs and College Football Playoff games in January boosted audiences to levels not seen at the network in years....
05/02/2025
Roku Remains Top U.S. Streaming Device
A new study from Pixalate indicates that Roku remains by far the dominant player among streaming devices in North America where its market share was more than d...
05/02/2025
Panasonic To Ship New 4K 60p 10-Bit Camcorders In March
NEWARK, N.J. Panasonic next month will release new 4K 60p 10-bit professional camcorder models, including the AG-CX20, HC-X1200 and HC-X2100 for video productio...
05/02/2025
Tegna Shuts Down National Fact-Checking Operation
WASHINGTON Tegna has shut down Verify, the station group's national fact-checking operation, laying off around 18 journalists, producers, researchers and ot...
05/02/2025
FCC Chair Brendan Carr Fills More Key Staff Positions
WASHINGTON, D.C. Federal Communications Commission chair Brendan Carr continues to build out his leadership team with the announcement of more staff appointment...
05/02/2025
Faculty Notes: Fall 2024
Faculty Notes: Fall 2024 Recent accomplishments, releases, and events by Berklee faculty. February 4, 2025 Professor Tomo Fujita played the John Coltrane In...
05/02/2025
AI Pays Off: Survey Reveals Financial Industry's Latest Technological Trends
The financial services industry is reaching an important milestone with AI, as organizations move beyond testing and experimentation to successful AI implementa...
05/02/2025
How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs
NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA...
04/02/2025
L3Harris Signs Multi-Year Pilot Training Agreement With Thai Airways
L3Harris has signed a two-year agreement with Thai Airways International to provide training service on its A320 Full Flight Simulator (FFS). This significant a...
04/02/2025
US Air Force Completes First Flight of L3Harris Viper Shield Electronic Warfare System
L3Harris' all-digital electronic warfare suite, Viper Shield , completed its...
04/02/2025
PARAMOUNT AND NIELSEN SIGN MULTI-YEAR MEASUREMENT AND ANALYTICS DEAL ACROSS PARAMOUNT'S LEADING BROADCAST, CABLE AND STREAMING PLATFORMS
Nielsen Reports Major Recent Ratings Milestones for CBS and Paramount Series N...
04/02/2025
Grup Mediapro to Collaborate with Google Cloud on Gen AI
BARCELONA Grup Mediapro and Google Cloud have expanded their collaboration to create an innovation lab focused on generative AI to develop solutions for the med...
04/02/2025
EditShare Receives SOCE 2 Type II Certification
WATERTOWN, Mass. EditShare this week said it has received SOC 2 Type II certification, an independently audited evaluation of an organization's information ...
04/02/2025
Executive Creative Director Halle Petro Named Partner of Sonic Union
Executive Creative Director Halle Petro Named Partner of Sonic Union Brie Clayton February 4, 2025 0 Comments Sonic Union is excited to announce Execu...
04/02/2025
Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update
Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update Brie Clayton February 4, 2025 0 Comments New update adds support for Xiaomi Pad 6...
04/02/2025
CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams
CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams Brie Clayton February 4, 2025 0 Comments CETA Software, creators of p...
04/02/2025
OWC Announces General Availability Launch of OWC Dock Ejector 2.0
OWC Announces General Availability Launch of OWC Dock Ejector 2.0 Brie Clayton February 4, 2025 0 Comments The Ultimate Tool for Efficiently and Safel...
04/02/2025
Paramount, Nielsen Sign Multiyear Measurement and Analytics Deal
NEW YORK Paramount Global and Nielsen have inked a new, multiyear deal that will provide measurement for all of the company's platforms, including national ...
04/02/2025
2d Animated Short Concerning a Project for Schools
2d Animated Short Concerning a Project for Schools Brie Clayton February 3, 2025 0 Comments 2d animated short concerning a project for schools Febru...
04/02/2025
Step by step guide to using 3D Models in After Effects
Step by step guide to using 3D Models in After Effects Graham Quince February 3, 2025 0 Comments Since 2024, Adobe After Effects has had native suppor...
04/02/2025
Powerful Premiere Automation with new Excalibur Update
Powerful Premiere Automation with new Excalibur Update Colin Smith February 3, 2025 0 Comments This tutorial takes you through the new update for auto...
04/02/2025
Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to create a 1960's era film
Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to creat...
04/02/2025
DIY to DA: Ela Minus Breaks Through
DIY to D A: Ela Minus Breaks Through The electronic artist and producer tells Rolling Stone about her new album, D A, and how shes forged a career outside the...
04/02/2025
NVIDIA Blackwell Now Generally Available in the Cloud
AI reasoning models and agents are set to transform industries, but delivering their full potential at scale requires massive compute and optimized software. Th...
04/02/2025
The Future of Football? Technology and Entertainment Merge in the Kings World Cup Nations
The future of football? Technology and entertainment merge in the Kings World Cu...
04/02/2025
Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Trends
Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Tr...
04/02/2025
SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics, SNY
SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics,...
04/02/2025
SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Productions Like a Professional
SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Pr...
04/02/2025
EA Acquires TRACAB Technologies as It Looks to Move Beyond Games
EA Acquires TRACAB Technologies as It Looks to Move Beyond Games EA believes TRACABs sports tracking/analysis technology will help to make the EA SPORTS App the...
04/02/2025
Kingdom Come: Alamiya Media on Bringing the Supercoppa Italiana and Supercopa de Espaa to Saudi Arabia
Kingdom come: Alamiya Media on bringing the Supercoppa Italiana and Supercopa de...
04/02/2025
Alamiya Media at 50: Preparing for Rapid Change, an International Broadcast Center and the FIFA World Cup
Alamiya Media at 50: Preparing for rapid change, an international broadcast cent...
04/02/2025
Clara Galle, Claudia Salas and Paula Usero Star in 'That Night,' the New Netflix Series Based on the Bestselling Novel by Gillian McAllister
Back to All News Clara Galle, Claudia Salas and Paula Usero Star in That Night,...
03/02/2025
L3Harris Technology Enhances US Torpedo Capability
The L3Harris IPLCS is a fiber-optic tether connecting a torpedo to the origin vessel, providing data in real time. Credit: L3Harris...
03/02/2025
VidTrans 2025 to Focus on Security, Dynamic Media Production
BOTHELL, Wash. Video Services Forum (VSF) today announced that the VidTrans 2025 conference and exposition will take place Feb. 25-27 at the Marina del Rey Marr...
03/02/2025
Legislation Proposed to Require Refunds During TV Blackouts
WASHINGTON Last week Rep. Pat Ryan (D-N.Y.) and Sen. Chris Murphy (D-Conn.) introduced the Stop Sports Blackouts Act to make cable and satellite companies ref...
03/02/2025
New Vendors Gain Amazon Prime Video Preferred Certification
Amazon Prime Video has added more companies to its Preferred Vendor Services Program....
03/02/2025
Grand Slam Track Inks Media Rights Deal with The CW, NBC Sports
BURBANK, Calif. The CW, NBC Sports and Grand Slam Track, a new global track competition, have announced a media rights deal that makes The CW the exclusive U.S....
03/02/2025
CJP building virtual production studio for BNU
Adding LED wall to green screen studio expands creative options CJP Broadcast Service Solutions, systems integration, production and content digitisation speci...
03/02/2025
CJP Broadcast Becomes Certified QuickLink Gold Partner to...
CJP Broadcast Service Solutions, a systems integration and content digitisation specialist, has been appointed as a certified QuickLink Gold Partner. This strat...
03/02/2025
2d Animated Short Concerned a Project for Schools
2d Animated Short Concerned a Project for Schools Brie Clayton February 3, 2025 0 Comments 2d animated short concerning a project for schools Februa...
03/02/2025
Academy Award-Winning Production Company Caviar Signs Director Dawit N.M.
Academy Award-Winning Production Company Caviar Signs Director Dawit N.M. Brie Clayton February 3, 2025 0 Comments Academy Award-winning independent p...
03/02/2025
Blackmagic Design Announces Customizable Blackmagic URSA Cine 12K Body
Blackmagic Design Announces Customizable Blackmagic URSA Cine 12K Body Brie Clayton February 3, 2025 0 Comments New body only model of Blackmagic URSA...
03/02/2025
Berklee Alumni Recognized at the 2025 Grammy Awards
Berklee Alumni Recognized at the 2025 Grammy Awards Winners took home trophies in 12 categories, including Album of the Year, Best Rock Song, and Songwriter o...
03/02/2025
KVM Advances Simplify Complex Workflows, With More on the Way
For media production companies, the drive for increased efficiency without extensive incremental costs or added complexity is always near the top of the priorit...
03/02/2025
Hybrid Uses of Virtual Production Take Hold Industry-Wide
Any lingering resistance to virtual production involving next-generation elements like LED walls, in-camera visual effects (ICVFX) and mixed reality (MR) is rap...
03/02/2025
Audio Consoles: Surface Still Matters
The physical function of audio mixing remains relatively unchanged today despite technology's onward march. There are now many ways to mix studio and outsid...
03/02/2025
Lightbridge 40 Percent Off CRLS Reflectors
Lightbridge, creators of the renowned Precision Reflectors for motion picture lighting, celebrates 8 years of innovation with a 40% discount on all current CRLS...
03/02/2025
Camera Corps Streamlines Live HDR Sports Production with...
High dynamic range (HDR) has transformed the sports fan experience, enabling broadcasters and OTT providers to deliver live events with remarkably lifelike colo...
03/02/2025
MainConcept Unveils Enhanced Easy Video API with JPEG XS...
MainConcept, the leading provider of video and audio codecs, announces the latest additions to its MainConcept EVA (Easy Video API) technology, set to greatly e...
03/02/2025
Last Samurai Standing' Unveils 14 New Cast Members Ahead of November Debut
Back to All News Last Samurai Standing' Unveils 14 New Cast Members Ahead of November Debut Entertainment 03 February 2025 GlobalJapan Link copied to ...