
Generative AI is redefining computing, unlocking new ways to build, train and optimize AI models on PCs and workstations. From content creation and large and small language models to software development, AI-powered PCs and workstations are transforming workflows and enhancing productivity.
At GTC 2025, running March 17-21 in the San Jose Convention Center, experts from across the AI ecosystem will share insights on deploying AI locally, optimizing models and harnessing cutting-edge hardware and software to enhance AI workloads - highlighting key advancements in RTX AI PCs and workstations.
Develop and Deploy on RTX RTX GPUs are built with specialized AI hardware called Tensor Cores that provide the compute performance needed to run the latest and most demanding AI models. These high-performance GPUs can help build digital humans, chatbots, AI-generated podcasts and more.
With more than 100 million GeForce RTX and NVIDIA RTX GPUs users, developers have a large audience to target when new AI apps and features are deployed. In the session Build Digital Humans, Chatbots, and AI-Generated Podcasts for RTX PCs and Workstations, Annamalai Chockalingam, senior product manager at NVIDIA, will showcase the end-to-end suite of tools developers can use to streamline development and deploy incredibly fast AI-enabled applications.
Model Behavior Large language models (LLMs) can be used for an abundance of use cases - and scale to tackle complex tasks like writing code or translating Japanese into Greek. But since they're typically trained with a wide spectrum of knowledge for broad applications, they may not be the right fit for specific tasks, like nonplayer character dialog generation in a video game. In contrast, small language models balance need with reduced size, maintaining accuracy while running locally on more devices.
In the session Watch Your Language: Create Small Language Models That Run On-Device, Oluwatobi Olabiyi, senior engineering manager at NVIDIA, will present tools and techniques that developers and enthusiasts can use to generate, curate and distill a dataset - then train a small language model that can perform tasks designed for it.
Maximizing AI Performance on Windows Workstations Optimizing AI inference and model execution on Windows-based workstations requires strategic software and hardware tuning due to diverse hardware configurations and software environments. The session Optimizing AI Workloads on Windows Workstations: Strategies and Best Practices, will explore best practices for AI optimization, including model quantization, inference pipeline enhancements and hardware-aware tuning.
A team of NVIDIA software engineers will also cover hardware-aware optimizations for ONNX Runtime, NVIDIA TensorRT and llama.cpp, helping developers maximize AI efficiency across GPUs, CPUs and NPUs.
Advancing Local AI Development Building, testing and deploying AI models on local infrastructure ensures security and performance even without a connection to cloud-based services. Accelerated with NVIDIA RTX GPUs, both Dell Pro Max AI and Z by HP solutions provide powerful tools for on-prem AI development, helping professionals maintain control over data and IP while optimizing performance.
Learn more by attending the following sessions:
Dell Pro Max and NVIDIA: Unleashing the Future of AI Development: This session introduces Dell Pro Max PCs, performance laptops and desktops for professionals, powered by NVIDIA RTX GPUs. Discover how this powerful duo can help jumpstart AI initiatives and transform the way AI developers, data scientists, creators and power users innovate.
Develop and Observe Gen AI On-Prem With Z by HP GenAI Lab and AI Studio: This session demonstrates how Z by HP solutions simplify local model training and deployment, harnessing models in the NVIDIA NGC catalog and Galileo evaluation technology to refine generative AI projects securely and efficiently.
Supercharge Gen AI Development With Z by HP GenAI Lab and AI Studio: This session explores how Z by HP's GenAI Lab and AI Studio enable on-premises LLM development while maintaining complete data security and control. Learn how these tools streamline the entire AI lifecycle, from experimentation to deployment, while integrating models available in the NVIDIA NGC catalog for collaboration and workflow efficiency.
Developers and enthusiasts can get started with AI development on RTX AI PCs and workstations using NVIDIA NIM microservices. Rolling out today, the initial public beta release includes the Llama 3.1 LLM, NVIDIA Riva Parakeet for automatic speech recognition (ASR), and YOLOX for computer vision.
NIM microservices are optimized, prepackaged models for generative AI. They span modalities important for PC development, and are easy to download and connect to via industry-standard application programming interfaces.
Attend GTC 2025 From the keynote by NVIDIA founder and CEO Jensen Huang to over 1,000 inspiring sessions, 300+ exhibits, technical hands-on training and tons of unique networking events - GTC is set to put a spotlight on AI and all its benefits.
Follow NVIDIA AI PC on Facebook, Instagram, TikTok and X - and stay informed by subscribing to the RTX AI PC newsletter.
More from Nvidia
10/03/2025
A new AI education initiative in the State of Utah, developed in collaboration with NVIDIA, is set to advance the state's commitment to workforce training a...
06/03/2025
For the past 16 years, NVIDIA technologies have been working behind the scenes o...
06/03/2025
Time for a roaring-good time with Capcom's hit Monster Hunter Wilds. GeForce NOW members can hunt even the largest, most daunting monsters with the sharpest...
05/03/2025
Ninety percent of information transmitted to the human brain is visual. The importance of sight in understanding the world makes computer vision essential for A...
03/03/2025
From Seattle, Washington, to Cape Town, South Africa - and everywhere around and between - AI is helping conserve the wild plants and animals that make up the i...
28/02/2025
Editor's note: This is the next topic in our new CUDA Accelerated news series, which showcases the latest software libraries, NVIDIA NIM microservices and t...
27/02/2025
Norway's first sustainable and secure AI cloud service demonstrates how coun...
27/02/2025
From improving customer experiences to boosting operational efficiency, agentic AI - advanced AI systems designed to autonomously reason, plan and execute compl...
27/02/2025
GeForce NOW is blooming further with an array of 14 new titles in March.
A garden of gaming delights will have members marching straight into action and advent...
26/02/2025
Generative AI is redefining computing, unlocking new ways to build, train and op...
24/02/2025
To better prepare communities for extreme weather, forecasters first need to see...
20/02/2025
American Sign Language is the third most prevalent language in the United States...
20/02/2025
The NVIDIA GeForce RTX 5070 Ti graphics cards - built on the NVIDIA Blackwell ar...
20/02/2025
Editor's note: This post is part of Into the Omniverse, a series focused on ...
20/02/2025
Wield magic and steel as GeForce NOW's fifth-anniversary celebration summons Obsidian Entertainment's highly anticipated Avowed to the cloud.
This firs...
19/02/2025
In financial services, AI has traditionally been used primarily for fraud detect...
19/02/2025
Scientists everywhere can now access Evo 2, a powerful new foundation model that...
19/02/2025
The telecom industry's efforts to drive efficiencies with AI are beginning to show fruit.
An increasing focus on deploying AI into radio access networks (R...
13/02/2025
Asteroids were responsible for extinction events hundreds of millions of years a...
13/02/2025
Asteroids were responsible for extinction events hundreds of millions of years a...
13/02/2025
It's a match made in heaven - GeForce NOW and Warner Bros. Games are collabo...
12/02/2025
Just as there are widely understood empirical laws of nature - for example, what goes up must come down, or every action has an equal and opposite reaction - th...
12/02/2025
The rapid evolution of generative AI has created countless opportunities for inn...
11/02/2025
NVIDIA's contributions to accelerating medical imaging, genomics, computatio...
11/02/2025
Tara Chklovski has spent much of her career inspiring young women to take on som...
07/02/2025
Every year, venomous snakes kill over 100,000 people and leave 300,000 more with devastating injuries - amputations, paralysis and permanent disabilities. The v...
06/02/2025
AI built for speech is now decoding the language of earthquakes.
A team of researchers from the Earth and environmental sciences division at Los Alamos Nationa...
06/02/2025
GeForce NOW celebrates its fifth anniversary this February with a lineup of five major releases. The month kicks off with Kingdom Come: Deliverance II. Prepare ...
05/02/2025
When non-technical users can create and deploy reliable AI workflows, organizations can do more to serve their clientele
Platforms for developing no- and low-c...
05/02/2025
The financial services industry is reaching an important milestone with AI, as organizations move beyond testing and experimentation to successful AI implementa...
05/02/2025
NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA...
04/02/2025
AI reasoning models and agents are set to transform industries, but delivering their full potential at scale requires massive compute and optimized software. Th...
31/01/2025
The recently released DeepSeek-R1 model family has brought a new wave of excitement to the AI community, allowing enthusiasts and developers to run state-of-the...
30/01/2025
DeepSeek-R1 is an open model with state-of-the-art reasoning capabilities. Instead of offering direct responses, AI models like DeepSeek-R1 perform reasoning th...
30/01/2025
GeForce NOW turns five this February. Five incredible years of high-performance gaming have been made possible thanks to the members who've joined the cloud...
30/01/2025
New GeForce RTX 5090 and RTX 5080 GPUs - built on the NVIDIA Blackwell architect...
29/01/2025
AI agents with advanced perception and cognition capabilities are making digital experiences more dynamic and personalized across retail, finance, entertainment...
27/01/2025
Named after Greek mythology's goddess of the sea, France-based startup Amphi...
23/01/2025
Businesses across every industry are rolling out AI services this year. For Microsoft, Oracle, Perplexity, Snap and hundreds of other leading companies, using t...
23/01/2025
GeForce NOW is expanding mod support for hit game Baldur's Gate 3 in collaboration with Larian Studios and mod.io for Ultimate and Performance members.
Thi...
22/01/2025
Companies and organizations are increasingly using AI to protect their customers and thwart the efforts of fraudsters around the world.
Voice security company ...
22/01/2025
Editor's note: This post is part of Into the Omniverse, a series focused on ...
22/01/2025
AI agents - which can understand, adapt to and support each user's unique journey - are making online shopping and digital marketing more efficient and pers...
21/01/2025
More than 90 million new vehicles are introduced to roads across the globe every...
16/01/2025
Time to suit up, members. The multiverse is about to get a whole lot cloudier as GeForce NOW opens a portal to the first season of hit game Marvel Rivals from N...
16/01/2025
AI agents are poised to transform productivity for the world's billion knowledge workers with knowledge robots that can accomplish a variety of tasks. To ...
15/01/2025
Troves of unwatched surgical video footage are finding new life, fueling AI tools that help make surgery safer and enhance surgical education. The Surgical Data...
14/01/2025
AI is making inroads across the entire healthcare industry - from genomic research to drug discovery, clinical trial workflows and patient care.
In a fireside ...
14/01/2025
Quantum computing is one of the most exciting areas in computer science, promising progress in accelerated computing beyond what's considered possible today...
13/01/2025
For decades, leadership in computing and software ecosystems has been a cornerst...