Sony Pixel Power calrec Sony

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

05/02/2025

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA DLSS 4 technology, lower latency with NVIDIA Reflex 2 and enhanced graphical fidelity with NVIDIA RTX neural shaders.

These GPUs were built to accelerate the latest generative AI workloads, delivering up to 3,352 AI trillion operations per second (TOPS), enabling incredible experiences for AI enthusiasts, gamers, creators and developers.

To help AI developers and enthusiasts harness these capabilities, NVIDIA at the CES trade show last month unveiled NVIDIA NIM and AI Blueprints for RTX. NVIDIA NIM microservices are prepackaged generative AI models that let developers and enthusiasts easily get started with generative AI, iterate quickly and harness the power of RTX for accelerating AI on Windows PCs. NVIDIA AI Blueprints are reference projects that show developers how to use NIM microservices to build the next generation of AI experiences.

NIM and AI Blueprints are optimized for GeForce RTX 50 Series GPUs. These technologies work together seamlessly to help developers and enthusiasts build, iterate and deliver cutting-edge AI experiences on AI PCs.

NVIDIA NIM Accelerates Generative AI on PCs While AI model development is rapidly advancing, bringing these innovations to PCs remains a challenge for many people. Models posted on platforms like Hugging Face must be curated, adapted and quantized to run on PC. They also need to be integrated into new AI application programming interfaces (APIs) to ensure compatibility with existing tools, and converted to optimized inference backends for peak performance.

NVIDIA NIM microservices for RTX AI PCs and workstations can ease the complexity of this process by providing access to community-driven and NVIDIA-developed AI models. These microservices are easy to download and connect to via industry-standard APIs and span the key modalities essential for AI PCs. They are also compatible with a wide range of AI tools and offer flexible deployment options, whether on PCs, in data centers, or in the cloud.

NIM microservices include everything needed to run optimized models on PCs with RTX GPUs, including prebuilt engines for specific GPUs, the NVIDIA TensorRT software development kit (SDK), the open-source NVIDIA TensorRT-LLM library for accelerated inference using Tensor Cores, and more.

Microsoft and NVIDIA worked together to enable NIM microservices and AI Blueprints for RTX in Windows Subsystem for Linux (WSL2). With WSL2, the same AI containers that run on data center GPUs can now run efficiently on RTX PCs, making it easier for developers to build, test and deploy AI models across platforms.

In addition, NIM and AI Blueprints harness key innovations of the Blackwell architecture that the GeForce RTX 50 series is built on, including fifth-generation Tensor Cores and support for FP4 precision.

Tensor Cores Drive Next-Gen AI Performance AI calculations are incredibly demanding and require vast amounts of processing power. Whether generating images and videos or understanding language and making real-time decisions, AI models rely on hundreds of trillions of mathematical operations to be completed every second. To keep up, computers need specialized hardware built specifically for AI.

NVIDIA GeForce RTX desktop GPUs deliver up to 3,352 AI TOPS for unmatched speed and efficiency in AI-powered workflows. In 2018, NVIDIA GeForce RTX GPUs changed the game by introducing Tensor Cores - dedicated AI processors designed to handle these intensive workloads. Unlike traditional computing cores, Tensor Cores are built to accelerate AI by performing calculations faster and more efficiently. This breakthrough helped bring AI-powered gaming, creative tools and productivity applications into the mainstream.

Blackwell architecture takes AI acceleration to the next level. The fifth-generation Tensor Cores in Blackwell GPUs deliver up to 3,352 AI TOPS to handle even more demanding AI tasks and simultaneously run multiple AI models. This means faster AI-driven experiences, from real-time rendering to intelligent assistants, that pave the way for greater innovation in gaming, content creation and beyond.

FP4 - Smaller Models, Bigger Performance Another way to optimize AI performance is through quantization, a technique that reduces model sizes, enabling the models to run faster while reducing the memory requirements.

Enter FP4 - an advanced quantization format that allows AI models to run faster and leaner without compromising output quality. Compared with FP16, it reduces model size by up to 60% and more than doubles performance, with minimal degradation.

For example, Black Forest Labs' FLUX.1 [dev] model at FP16 requires over 23GB of VRAM, meaning it can only be supported by the GeForce RTX 4090 and professional GPUs. With FP4, FLUX.1 [dev] requires less than 10GB, so it can run locally on more GeForce RTX GPUs.

On a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with just 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds.

FP4 is natively supported by the Blackwell architecture, making it easier than ever to deploy high-performance AI on local PCs. It's also integrated into NIM microservices, effectively optimizing models that were previously difficult to quantize. By enabling more efficient AI processing, FP4 helps to bring faster, smarter AI experiences for content creation.

AI Blueprints Power Advanced AI Workflows on RTX PCs NVIDIA AI Blueprints, built on NIM microservices, provide prepackaged, optimized reference implementations that make it easier to develop advanced AI-powered projects - whether for digital humans, podcast generators or application assistants.

At CES, NVIDIA demonstrated PDF to Podcast
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-blackwell-nim-blueprints-p...
See more stories from nvidia

North America Stories

05/02/2025

ESPN Viewing Hits Decade-Long Highs in January

ESPN is reporting that its NFL playoffs and College Football Playoff games in January boosted audiences to levels not seen at the network in years....

05/02/2025

Roku Remains Top U.S. Streaming Device

A new study from Pixalate indicates that Roku remains by far the dominant player among streaming devices in North America where its market share was more than d...

05/02/2025

Panasonic To Ship New 4K 60p 10-Bit Camcorders In March

NEWARK, N.J. Panasonic next month will release new 4K 60p 10-bit professional camcorder models, including the AG-CX20, HC-X1200 and HC-X2100 for video productio...

05/02/2025

Tegna Shuts Down National Fact-Checking Operation

WASHINGTON Tegna has shut down Verify, the station group's national fact-checking operation, laying off around 18 journalists, producers, researchers and ot...

05/02/2025

FCC Chair Brendan Carr Fills More Key Staff Positions

WASHINGTON, D.C. Federal Communications Commission chair Brendan Carr continues to build out his leadership team with the announcement of more staff appointment...

05/02/2025

Faculty Notes: Fall 2024

Faculty Notes: Fall 2024 Recent accomplishments, releases, and events by Berklee faculty. February 4, 2025 Professor Tomo Fujita played the John Coltrane In...

05/02/2025

AI Pays Off: Survey Reveals Financial Industry's Latest Technological Trends

The financial services industry is reaching an important milestone with AI, as organizations move beyond testing and experimentation to successful AI implementa...

05/02/2025

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA...

04/02/2025

L3Harris Signs Multi-Year Pilot Training Agreement With Thai Airways

L3Harris has signed a two-year agreement with Thai Airways International to provide training service on its A320 Full Flight Simulator (FFS). This significant a...

04/02/2025

US Air Force Completes First Flight of L3Harris Viper Shield Electronic Warfare System

L3Harris' all-digital electronic warfare suite, Viper Shield , completed its...

04/02/2025

Grup Mediapro to Collaborate with Google Cloud on Gen AI

BARCELONA Grup Mediapro and Google Cloud have expanded their collaboration to create an innovation lab focused on generative AI to develop solutions for the med...

04/02/2025

EditShare Receives SOCE 2 Type II Certification

WATERTOWN, Mass. EditShare this week said it has received SOC 2 Type II certification, an independently audited evaluation of an organization's information ...

04/02/2025

Executive Creative Director Halle Petro Named Partner of Sonic Union

Executive Creative Director Halle Petro Named Partner of Sonic Union Brie Clayton February 4, 2025 0 Comments Sonic Union is excited to announce Execu...

04/02/2025

Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update

Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update Brie Clayton February 4, 2025 0 Comments New update adds support for Xiaomi Pad 6...

04/02/2025

CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams

CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams Brie Clayton February 4, 2025 0 Comments CETA Software, creators of p...

04/02/2025

OWC Announces General Availability Launch of OWC Dock Ejector 2.0

OWC Announces General Availability Launch of OWC Dock Ejector 2.0 Brie Clayton February 4, 2025 0 Comments The Ultimate Tool for Efficiently and Safel...

04/02/2025

Paramount, Nielsen Sign Multiyear Measurement and Analytics Deal

NEW YORK Paramount Global and Nielsen have inked a new, multiyear deal that will provide measurement for all of the company's platforms, including national ...

04/02/2025

2d Animated Short Concerning a Project for Schools

2d Animated Short Concerning a Project for Schools Brie Clayton February 3, 2025 0 Comments 2d animated short concerning a project for schools Febru...

04/02/2025

Step by step guide to using 3D Models in After Effects

Step by step guide to using 3D Models in After Effects Graham Quince February 3, 2025 0 Comments Since 2024, Adobe After Effects has had native suppor...

04/02/2025

Powerful Premiere Automation with new Excalibur Update

Powerful Premiere Automation with new Excalibur Update Colin Smith February 3, 2025 0 Comments This tutorial takes you through the new update for auto...

04/02/2025

Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to create a 1960's era film

Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to creat...

04/02/2025

DIY to DA: Ela Minus Breaks Through

DIY to D A: Ela Minus Breaks Through The electronic artist and producer tells Rolling Stone about her new album, D A, and how shes forged a career outside the...

04/02/2025

NVIDIA Blackwell Now Generally Available in the Cloud

AI reasoning models and agents are set to transform industries, but delivering their full potential at scale requires massive compute and optimized software. Th...

04/02/2025

The Future of Football? Technology and Entertainment Merge in the Kings World Cup Nations

The future of football? Technology and entertainment merge in the Kings World Cu...

04/02/2025

Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Trends

Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Tr...

04/02/2025

SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics, SNY

SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics,...

04/02/2025

SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Productions Like a Professional

SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Pr...

04/02/2025

EA Acquires TRACAB Technologies as It Looks to Move Beyond Games

EA Acquires TRACAB Technologies as It Looks to Move Beyond Games EA believes TRACABs sports tracking/analysis technology will help to make the EA SPORTS App the...

04/02/2025

Kingdom Come: Alamiya Media on Bringing the Supercoppa Italiana and Supercopa de Espaa to Saudi Arabia

Kingdom come: Alamiya Media on bringing the Supercoppa Italiana and Supercopa de...

04/02/2025

Alamiya Media at 50: Preparing for Rapid Change, an International Broadcast Center and the FIFA World Cup

Alamiya Media at 50: Preparing for rapid change, an international broadcast cent...

03/02/2025

L3Harris Technology Enhances US Torpedo Capability

The L3Harris IPLCS is a fiber-optic tether connecting a torpedo to the origin vessel, providing data in real time. Credit: L3Harris...

03/02/2025

VidTrans 2025 to Focus on Security, Dynamic Media Production

BOTHELL, Wash. Video Services Forum (VSF) today announced that the VidTrans 2025 conference and exposition will take place Feb. 25-27 at the Marina del Rey Marr...

03/02/2025

Legislation Proposed to Require Refunds During TV Blackouts

WASHINGTON Last week Rep. Pat Ryan (D-N.Y.) and Sen. Chris Murphy (D-Conn.) introduced the Stop Sports Blackouts Act to make cable and satellite companies ref...

03/02/2025

New Vendors Gain Amazon Prime Video Preferred Certification

Amazon Prime Video has added more companies to its Preferred Vendor Services Program....

03/02/2025

Grand Slam Track Inks Media Rights Deal with The CW, NBC Sports

BURBANK, Calif. The CW, NBC Sports and Grand Slam Track, a new global track competition, have announced a media rights deal that makes The CW the exclusive U.S....

03/02/2025

CJP building virtual production studio for BNU

Adding LED wall to green screen studio expands creative options CJP Broadcast Service Solutions, systems integration, production and content digitisation speci...

03/02/2025

CJP Broadcast Becomes Certified QuickLink Gold Partner to...

CJP Broadcast Service Solutions, a systems integration and content digitisation specialist, has been appointed as a certified QuickLink Gold Partner. This strat...

03/02/2025

2d Animated Short Concerned a Project for Schools

2d Animated Short Concerned a Project for Schools Brie Clayton February 3, 2025 0 Comments 2d animated short concerning a project for schools Februa...

03/02/2025

Academy Award-Winning Production Company Caviar Signs Director Dawit N.M.

Academy Award-Winning Production Company Caviar Signs Director Dawit N.M. Brie Clayton February 3, 2025 0 Comments Academy Award-winning independent p...

03/02/2025

Blackmagic Design Announces Customizable Blackmagic URSA Cine 12K Body

Blackmagic Design Announces Customizable Blackmagic URSA Cine 12K Body Brie Clayton February 3, 2025 0 Comments New body only model of Blackmagic URSA...

03/02/2025

Berklee Alumni Recognized at the 2025 Grammy Awards

Berklee Alumni Recognized at the 2025 Grammy Awards Winners took home trophies in 12 categories, including Album of the Year, Best Rock Song, and Songwriter o...

03/02/2025

KVM Advances Simplify Complex Workflows, With More on the Way

For media production companies, the drive for increased efficiency without extensive incremental costs or added complexity is always near the top of the priorit...

03/02/2025

Hybrid Uses of Virtual Production Take Hold Industry-Wide

Any lingering resistance to virtual production involving next-generation elements like LED walls, in-camera visual effects (ICVFX) and mixed reality (MR) is rap...

03/02/2025

Audio Consoles: Surface Still Matters

The physical function of audio mixing remains relatively unchanged today despite technology's onward march. There are now many ways to mix studio and outsid...

03/02/2025

Lightbridge 40 Percent Off CRLS Reflectors

Lightbridge, creators of the renowned Precision Reflectors for motion picture lighting, celebrates 8 years of innovation with a 40% discount on all current CRLS...

03/02/2025

Camera Corps Streamlines Live HDR Sports Production with...

High dynamic range (HDR) has transformed the sports fan experience, enabling broadcasters and OTT providers to deliver live events with remarkably lifelike colo...

03/02/2025

MainConcept Unveils Enhanced Easy Video API with JPEG XS...

MainConcept, the leading provider of video and audio codecs, announces the latest additions to its MainConcept EVA (Easy Video API) technology, set to greatly e...

03/02/2025

Last Samurai Standing' Unveils 14 New Cast Members Ahead of November Debut

Back to All News Last Samurai Standing' Unveils 14 New Cast Members Ahead of November Debut Entertainment 03 February 2025 GlobalJapan Link copied to ...