Sony Pixel Power calrec Sony

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

05/02/2025

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA DLSS 4 technology, lower latency with NVIDIA Reflex 2 and enhanced graphical fidelity with NVIDIA RTX neural shaders.

These GPUs were built to accelerate the latest generative AI workloads, delivering up to 3,352 AI trillion operations per second (TOPS), enabling incredible experiences for AI enthusiasts, gamers, creators and developers.

To help AI developers and enthusiasts harness these capabilities, NVIDIA at the CES trade show last month unveiled NVIDIA NIM and AI Blueprints for RTX. NVIDIA NIM microservices are prepackaged generative AI models that let developers and enthusiasts easily get started with generative AI, iterate quickly and harness the power of RTX for accelerating AI on Windows PCs. NVIDIA AI Blueprints are reference projects that show developers how to use NIM microservices to build the next generation of AI experiences.

NIM and AI Blueprints are optimized for GeForce RTX 50 Series GPUs. These technologies work together seamlessly to help developers and enthusiasts build, iterate and deliver cutting-edge AI experiences on AI PCs.

NVIDIA NIM Accelerates Generative AI on PCs While AI model development is rapidly advancing, bringing these innovations to PCs remains a challenge for many people. Models posted on platforms like Hugging Face must be curated, adapted and quantized to run on PC. They also need to be integrated into new AI application programming interfaces (APIs) to ensure compatibility with existing tools, and converted to optimized inference backends for peak performance.

NVIDIA NIM microservices for RTX AI PCs and workstations can ease the complexity of this process by providing access to community-driven and NVIDIA-developed AI models. These microservices are easy to download and connect to via industry-standard APIs and span the key modalities essential for AI PCs. They are also compatible with a wide range of AI tools and offer flexible deployment options, whether on PCs, in data centers, or in the cloud.

NIM microservices include everything needed to run optimized models on PCs with RTX GPUs, including prebuilt engines for specific GPUs, the NVIDIA TensorRT software development kit (SDK), the open-source NVIDIA TensorRT-LLM library for accelerated inference using Tensor Cores, and more.

Microsoft and NVIDIA worked together to enable NIM microservices and AI Blueprints for RTX in Windows Subsystem for Linux (WSL2). With WSL2, the same AI containers that run on data center GPUs can now run efficiently on RTX PCs, making it easier for developers to build, test and deploy AI models across platforms.

In addition, NIM and AI Blueprints harness key innovations of the Blackwell architecture that the GeForce RTX 50 series is built on, including fifth-generation Tensor Cores and support for FP4 precision.

Tensor Cores Drive Next-Gen AI Performance AI calculations are incredibly demanding and require vast amounts of processing power. Whether generating images and videos or understanding language and making real-time decisions, AI models rely on hundreds of trillions of mathematical operations to be completed every second. To keep up, computers need specialized hardware built specifically for AI.

NVIDIA GeForce RTX desktop GPUs deliver up to 3,352 AI TOPS for unmatched speed and efficiency in AI-powered workflows. In 2018, NVIDIA GeForce RTX GPUs changed the game by introducing Tensor Cores - dedicated AI processors designed to handle these intensive workloads. Unlike traditional computing cores, Tensor Cores are built to accelerate AI by performing calculations faster and more efficiently. This breakthrough helped bring AI-powered gaming, creative tools and productivity applications into the mainstream.

Blackwell architecture takes AI acceleration to the next level. The fifth-generation Tensor Cores in Blackwell GPUs deliver up to 3,352 AI TOPS to handle even more demanding AI tasks and simultaneously run multiple AI models. This means faster AI-driven experiences, from real-time rendering to intelligent assistants, that pave the way for greater innovation in gaming, content creation and beyond.

FP4 - Smaller Models, Bigger Performance Another way to optimize AI performance is through quantization, a technique that reduces model sizes, enabling the models to run faster while reducing the memory requirements.

Enter FP4 - an advanced quantization format that allows AI models to run faster and leaner without compromising output quality. Compared with FP16, it reduces model size by up to 60% and more than doubles performance, with minimal degradation.

For example, Black Forest Labs' FLUX.1 [dev] model at FP16 requires over 23GB of VRAM, meaning it can only be supported by the GeForce RTX 4090 and professional GPUs. With FP4, FLUX.1 [dev] requires less than 10GB, so it can run locally on more GeForce RTX GPUs.

On a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with just 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds.

FP4 is natively supported by the Blackwell architecture, making it easier than ever to deploy high-performance AI on local PCs. It's also integrated into NIM microservices, effectively optimizing models that were previously difficult to quantize. By enabling more efficient AI processing, FP4 helps to bring faster, smarter AI experiences for content creation.

AI Blueprints Power Advanced AI Workflows on RTX PCs NVIDIA AI Blueprints, built on NIM microservices, provide prepackaged, optimized reference implementations that make it easier to develop advanced AI-powered projects - whether for digital humans, podcast generators or application assistants.

At CES, NVIDIA demonstrated PDF to Podcast
LINK: https://blogs.nvidia.com/blog/rtx-ai-garage-blackwell-nim-blueprints-p...
See more stories from nvidia

Most recent headlines

05/02/2025

Inside IBC's innovation boom: what's powering the future of media?

Speaking exclusively to TVBEurope, four of its experts weigh in on the IBCs role in fostering innovation, the technological shifts on the horizon, and why sport...

05/02/2025

Meet the director of product marketing

Vincent Noyer, director of product marketing at LYNX Technik, tells TVBEurope how remaining engaged and proactive leads to opportunities for growth By Matthew ...

05/02/2025

ESPN Viewing Hits Decade-Long Highs in January

ESPN is reporting that its NFL playoffs and College Football Playoff games in January boosted audiences to levels not seen at the network in years....

05/02/2025

Roku Remains Top U.S. Streaming Device

A new study from Pixalate indicates that Roku remains by far the dominant player among streaming devices in North America where its market share was more than d...

05/02/2025

Panasonic To Ship New 4K 60p 10-Bit Camcorders In March

NEWARK, N.J. Panasonic next month will release new 4K 60p 10-bit professional camcorder models, including the AG-CX20, HC-X1200 and HC-X2100 for video productio...

05/02/2025

Tegna Shuts Down National Fact-Checking Operation

WASHINGTON Tegna has shut down Verify, the station group's national fact-checking operation, laying off around 18 journalists, producers, researchers and ot...

05/02/2025

FCC Chair Brendan Carr Fills More Key Staff Positions

WASHINGTON, D.C. Federal Communications Commission chair Brendan Carr continues to build out his leadership team with the announcement of more staff appointment...

05/02/2025

Sanctuary Pictures Unveils Punk-Horror Feature Penny Lane Is Dead

04 02 2025 - Media release Sanctuary Pictures Unveils Punk-Horror Feature Penny Lane Is Dead Writer/Director of Penny Lane Is Dead, Mia'Kate Russell Sanc...

05/02/2025

Faculty Notes: Fall 2024

Faculty Notes: Fall 2024 Recent accomplishments, releases, and events by Berklee faculty. February 4, 2025 Professor Tomo Fujita played the John Coltrane In...

05/02/2025

Other Voices makes its return to RT this Spring with an incredible lineup

Laura Marling, Lisa O'Neill, James Dean Bradfield, Jacob Alon, Bashy, Morgana and more to feature in Other Voices Series 23 this Spring RT 2 & RT Player |...

05/02/2025

AI Pays Off: Survey Reveals Financial Industry's Latest Technological Trends

The financial services industry is reaching an important milestone with AI, as organizations move beyond testing and experimentation to successful AI implementa...

05/02/2025

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA...

04/02/2025

Spotify Reports Fourth Quarter 2024 Earnings

Today, we announced our fourth quarter 2024 earnings, closing Q4 stronger than ever by outperforming across key metrics and celebrating our first full year of p...

04/02/2025

Spotify rapporterar intkter fr fjrde kvartalet 2024

Idag rapporterar vi int kter f r fj rde kvartalet 2024. Vi avslutade Q4 starkare n n gonsin genom att vertr ffa f rv ntningarna p v ra nyckeltal och kan d rm...

04/02/2025

SGL Carbon opts for green electricity at its German sites

As a technology-based company and one of the worlds leading companies in the development and production of carbon-based solutions, SGL Carbon opts for innovativ...

04/02/2025

ST Engineering iDirect Names Sridhar Kuppanna as Chief Technology Officer

Ground segment technology innovator appoints new CTO to execute bold technological vision Herndon, Va., February 4, 2025 ST Engineering iDirect, global leade...

04/02/2025

L3Harris Signs Multi-Year Pilot Training Agreement With Thai Airways

L3Harris has signed a two-year agreement with Thai Airways International to provide training service on its A320 Full Flight Simulator (FFS). This significant a...

04/02/2025

US Air Force Completes First Flight of L3Harris Viper Shield Electronic Warfare System

L3Harris' all-digital electronic warfare suite, Viper Shield , completed its...

04/02/2025

Radio Botswana chooses Calrec's IP-native Type R mixing system

The shift from analogue to IP was driven by a desire for greater flexibility in our operations. IP simplifies connectivity, reduces the physical footprint of th...

04/02/2025

Simplifying Gray Media News Operations with Calrec's Type R

Streamline, standardise and save: how Gray Media has simplified news operations At TVNewsCheck's News Tech Forum 2024, Gray Media's Peter Gogas and Calr...

04/02/2025

Bending Spoons closes $233 million acquisition of Brightcove

Boston, MA-February 4, 2025 | Bending Spoons, the Italy-based technology company, completed its previously announced acquisition of US-based streaming technolog...

04/02/2025

Grup Mediapro to Collaborate with Google Cloud on Gen AI

BARCELONA Grup Mediapro and Google Cloud have expanded their collaboration to create an innovation lab focused on generative AI to develop solutions for the med...

04/02/2025

EditShare Receives SOCE 2 Type II Certification

WATERTOWN, Mass. EditShare this week said it has received SOC 2 Type II certification, an independently audited evaluation of an organization's information ...

04/02/2025

Executive Creative Director Halle Petro Named Partner of Sonic Union

Executive Creative Director Halle Petro Named Partner of Sonic Union Brie Clayton February 4, 2025 0 Comments Sonic Union is excited to announce Execu...

04/02/2025

Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update

Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update Brie Clayton February 4, 2025 0 Comments New update adds support for Xiaomi Pad 6...

04/02/2025

CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams

CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams Brie Clayton February 4, 2025 0 Comments CETA Software, creators of p...

04/02/2025

OWC Announces General Availability Launch of OWC Dock Ejector 2.0

OWC Announces General Availability Launch of OWC Dock Ejector 2.0 Brie Clayton February 4, 2025 0 Comments The Ultimate Tool for Efficiently and Safel...

04/02/2025

Colourist Claudio Del Bravo on grading Queer

Explaining the process to TVBEurope, Del Bravo said the films look was inspired by the Technicolor three-strip' process, evoking the rich colours of early ...

04/02/2025

Paramount, Nielsen Sign Multiyear Measurement and Analytics Deal

NEW YORK Paramount Global and Nielsen have inked a new, multiyear deal that will provide measurement for all of the company's platforms, including national ...

04/02/2025

2d Animated Short Concerning a Project for Schools

2d Animated Short Concerning a Project for Schools Brie Clayton February 3, 2025 0 Comments 2d animated short concerning a project for schools Febru...

04/02/2025

Step by step guide to using 3D Models in After Effects

Step by step guide to using 3D Models in After Effects Graham Quince February 3, 2025 0 Comments Since 2024, Adobe After Effects has had native suppor...

04/02/2025

Powerful Premiere Automation with new Excalibur Update

Powerful Premiere Automation with new Excalibur Update Colin Smith February 3, 2025 0 Comments This tutorial takes you through the new update for auto...

04/02/2025

Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to create a 1960's era film

Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to creat...

04/02/2025

DIY to DA: Ela Minus Breaks Through

DIY to D A: Ela Minus Breaks Through The electronic artist and producer tells Rolling Stone about her new album, D A, and how shes forged a career outside the...

04/02/2025

NVIDIA Blackwell Now Generally Available in the Cloud

AI reasoning models and agents are set to transform industries, but delivering their full potential at scale requires massive compute and optimized software. Th...

04/02/2025

The Future of Football? Technology and Entertainment Merge in the Kings World Cup Nations

The future of football? Technology and entertainment merge in the Kings World Cu...

04/02/2025

Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Trends

Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Tr...

04/02/2025

SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics, SNY

SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics,...

04/02/2025

SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Productions Like a Professional

SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Pr...

04/02/2025

EA Acquires TRACAB Technologies as It Looks to Move Beyond Games

EA Acquires TRACAB Technologies as It Looks to Move Beyond Games EA believes TRACABs sports tracking/analysis technology will help to make the EA SPORTS App the...

04/02/2025

Kingdom Come: Alamiya Media on Bringing the Supercoppa Italiana and Supercopa de Espaa to Saudi Arabia

Kingdom come: Alamiya Media on bringing the Supercoppa Italiana and Supercopa de...

04/02/2025

Alamiya Media at 50: Preparing for Rapid Change, an International Broadcast Center and the FIFA World Cup

Alamiya Media at 50: Preparing for rapid change, an international broadcast cent...

04/02/2025

An update on our TV and broadband prices

An update on our TV and broadband pricesTuesday 4 February 2025 An update on our TV and broadband prices Devesh Raj, Chief Operating Officer, Sky This April,...

04/02/2025

Sky extends partnership with the PDC to remain the home of darts until 2030

Sky extends partnership with the PDC to remain the home of darts until 2030Tuesday 4 February 2025 Following another record-breaking PDC World Darts Championsh...

04/02/2025

Frankfurt is the world's first airport to regularly use walk-through scanners from Rohde & Schwarz for passengers

Frankfurt is the world's first airport to regularly use walk-through scanner...

04/02/2025

Riedel Unveils Next Generation of StageLink Edge Devices

Wuppertal February 4, 2025 Riedel Unveils Next Generation of StageLink Edge DevicesRiedel Communications today announced the launch of its StageLink family of...

04/02/2025

Fox Corporation Reports Second Quarter Fiscal 2025 Financial Results

Fox Corporation Reports Second Quarter Fiscal 2025 Financial Results NEW YORK, NY, February 4, 2025 - Fox Corporation (Nasdaq: FOXA, FOX; FOX or the Compan...

04/02/2025

Introducing our fully digital, true diversity wideband wireless mic solution

DPA Microphones is moving into the wireless market with the release of its new N-Series Digital Wireless System at ISE 2025 (Stand 7P600). A fully digital, true...