Sony Pixel Power calrec Sony

Microsoft and NVIDIA Supercharge AI Development on RTX AI PCs

19/11/2024

Generative AI-powered laptops and PCs are unlocking advancements in gaming, content creation, productivity and development. Today, over 600 Windows apps and games are already running AI locally on more than 100 million GeForce RTX AI PCs worldwide, delivering fast, reliable and low-latency performance.

At Microsoft Ignite, NVIDIA and Microsoft announced tools to help Windows developers quickly build and optimize AI-powered apps on RTX AI PCs, making local AI more accessible. These new tools enable application and game developers to harness powerful RTX GPUs to accelerate complex AI workflows for applications such as AI agents, app assistants and digital humans.

RTX AI PCs Power Digital Humans With Multimodal Small Language Models Meet James, an interactive digital human knowledgeable about NVIDIA and its products. James uses a collection of NVIDIA NIM microservices, NVIDIA ACE and ElevenLabs digital human technologies to provide natural and immersive responses. NVIDIA ACE is a suite of digital human technologies that brings life to agents, assistants and avatars. To achieve a higher level of understanding so that they can respond with greater context-awareness, digital humans must be able to visually perceive the world like humans do.

Enhancing digital human interactions with greater realism demands technology that enables perception and understanding of their surroundings with greater nuance. To achieve this, NVIDIA developed multimodal small language models that can process both text and imagery, excel in role-playing and are optimized for rapid response times.

The NVIDIA Nemovision-4B-Instruct model, soon to be available, uses the latest NVIDIA VILA and NVIDIA NeMo framework for distilling, pruning and quantizing to become small enough to perform on RTX GPUs with the accuracy developers need.

The model enables digital humans to understand visual imagery in the real world and on the screen to deliver relevant responses. Multimodality serves as the foundation for agentic workflows and offers a sneak peek into a future where digital humans can reason and take action with minimal assistance from a user.

NVIDIA is also introducing the Mistral NeMo Minitron 128k Instruct family, a suite of large-context small language models designed for optimized, efficient digital human interactions, coming soon. Available in 8B-, 4B- and 2B-parameter versions, these models offer flexible options for balancing speed, memory usage and accuracy on RTX AI PCs. They can handle large datasets in a single pass, eliminating the need for data segmentation and reassembly. Built in the GGUF format, these models enhance efficiency on low-power devices and support compatibility with multiple programming languages.

Turbocharge Gen AI With NVIDIA TensorRT Model Optimizer for Windows When bringing models to PC environments, developers face the challenge of limited memory and compute resources for running AI locally. And they want to make models available to as many people as possible, with minimal accuracy loss.

Today, NVIDIA announced updates to NVIDIA TensorRT Model Optimizer (ModelOpt) to offer Windows developers an improved way to optimize models for ONNX Runtime deployment.

With the latest updates, TensorRT ModelOpt enables models to be optimized into an ONNX checkpoint for deploying the model within ONNX runtime environments - using GPU execution providers such as CUDA, TensorRT and DirectML.

TensorRT-ModelOpt includes advanced quantization algorithms, such as INT4-Activation Aware Weight Quantization. Compared to other tools such as Olive, the new method reduces the memory footprint of the model and improves throughput performance on RTX GPUs.

During deployment, the models can have up to 2.6x reduced memory footprint compared to FP16 models. This results in faster throughput, with minimal accuracy degradation, allowing them to run on a wider range of PCs.

Learn more about how developers on Microsoft systems, from Windows RTX AI PCs to NVIDIA Blackwell-powered Azure servers, are transforming how users interact with AI on a daily basis.
LINK: https://blogs.nvidia.com/blog/ai-decoded-microsoft-ignite-rtx/...
See more stories from nvidia

Most recent headlines

05/02/2025

ESPN Viewing Hits Decade-Long Highs in January

ESPN is reporting that its NFL playoffs and College Football Playoff games in January boosted audiences to levels not seen at the network in years....

05/02/2025

Roku Remains Top U.S. Streaming Device

A new study from Pixalate indicates that Roku remains by far the dominant player among streaming devices in North America where its market share was more than d...

05/02/2025

Panasonic To Ship New 4K 60p 10-Bit Camcorders In March

NEWARK, N.J. Panasonic next month will release new 4K 60p 10-bit professional camcorder models, including the AG-CX20, HC-X1200 and HC-X2100 for video productio...

05/02/2025

Tegna Shuts Down National Fact-Checking Operation

WASHINGTON Tegna has shut down Verify, the station group's national fact-checking operation, laying off around 18 journalists, producers, researchers and ot...

05/02/2025

FCC Chair Brendan Carr Fills More Key Staff Positions

WASHINGTON, D.C. Federal Communications Commission chair Brendan Carr continues to build out his leadership team with the announcement of more staff appointment...

05/02/2025

Sanctuary Pictures Unveils Punk-Horror Feature Penny Lane Is Dead

04 02 2025 - Media release Sanctuary Pictures Unveils Punk-Horror Feature Penny Lane Is Dead Writer/Director of Penny Lane Is Dead, Mia'Kate Russell Sanc...

05/02/2025

Faculty Notes: Fall 2024

Faculty Notes: Fall 2024 Recent accomplishments, releases, and events by Berklee faculty. February 4, 2025 Professor Tomo Fujita played the John Coltrane In...

05/02/2025

Other Voices makes its return to RT this Spring with an incredible lineup

Laura Marling, Lisa O'Neill, James Dean Bradfield, Jacob Alon, Bashy, Morgana and more to feature in Other Voices Series 23 this Spring RT 2 & RT Player |...

05/02/2025

AI Pays Off: Survey Reveals Financial Industry's Latest Technological Trends

The financial services industry is reaching an important milestone with AI, as organizations move beyond testing and experimentation to successful AI implementa...

05/02/2025

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA...

04/02/2025

Spotify Reports Fourth Quarter 2024 Earnings

Today, we announced our fourth quarter 2024 earnings, closing Q4 stronger than ever by outperforming across key metrics and celebrating our first full year of p...

04/02/2025

Spotify rapporterar intkter fr fjrde kvartalet 2024

Idag rapporterar vi int kter f r fj rde kvartalet 2024. Vi avslutade Q4 starkare n n gonsin genom att vertr ffa f rv ntningarna p v ra nyckeltal och kan d rm...

04/02/2025

SGL Carbon opts for green electricity at its German sites

As a technology-based company and one of the worlds leading companies in the development and production of carbon-based solutions, SGL Carbon opts for innovativ...

04/02/2025

ST Engineering iDirect Names Sridhar Kuppanna as Chief Technology Officer

Ground segment technology innovator appoints new CTO to execute bold technological vision Herndon, Va., February 4, 2025 ST Engineering iDirect, global leade...

04/02/2025

L3Harris Signs Multi-Year Pilot Training Agreement With Thai Airways

L3Harris has signed a two-year agreement with Thai Airways International to provide training service on its A320 Full Flight Simulator (FFS). This significant a...

04/02/2025

US Air Force Completes First Flight of L3Harris Viper Shield Electronic Warfare System

L3Harris' all-digital electronic warfare suite, Viper Shield , completed its...

04/02/2025

Radio Botswana chooses Calrec's IP-native Type R mixing system

The shift from analogue to IP was driven by a desire for greater flexibility in our operations. IP simplifies connectivity, reduces the physical footprint of th...

04/02/2025

Simplifying Gray Media News Operations with Calrec's Type R

Streamline, standardise and save: how Gray Media has simplified news operations At TVNewsCheck's News Tech Forum 2024, Gray Media's Peter Gogas and Calr...

04/02/2025

Bending Spoons closes $233 million acquisition of Brightcove

Boston, MA-February 4, 2025 | Bending Spoons, the Italy-based technology company, completed its previously announced acquisition of US-based streaming technolog...

04/02/2025

Grup Mediapro to Collaborate with Google Cloud on Gen AI

BARCELONA Grup Mediapro and Google Cloud have expanded their collaboration to create an innovation lab focused on generative AI to develop solutions for the med...

04/02/2025

EditShare Receives SOCE 2 Type II Certification

WATERTOWN, Mass. EditShare this week said it has received SOC 2 Type II certification, an independently audited evaluation of an organization's information ...

04/02/2025

Executive Creative Director Halle Petro Named Partner of Sonic Union

Executive Creative Director Halle Petro Named Partner of Sonic Union Brie Clayton February 4, 2025 0 Comments Sonic Union is excited to announce Execu...

04/02/2025

Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update

Blackmagic Design Announces Blackmagic Camera for Android 2.0 Update Brie Clayton February 4, 2025 0 Comments New update adds support for Xiaomi Pad 6...

04/02/2025

CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams

CETA Software Launches Artist Access: The Time-Tracking Tool for Creative Teams Brie Clayton February 4, 2025 0 Comments CETA Software, creators of p...

04/02/2025

OWC Announces General Availability Launch of OWC Dock Ejector 2.0

OWC Announces General Availability Launch of OWC Dock Ejector 2.0 Brie Clayton February 4, 2025 0 Comments The Ultimate Tool for Efficiently and Safel...

04/02/2025

Colourist Claudio Del Bravo on grading Queer

Explaining the process to TVBEurope, Del Bravo said the films look was inspired by the Technicolor three-strip' process, evoking the rich colours of early ...

04/02/2025

Paramount, Nielsen Sign Multiyear Measurement and Analytics Deal

NEW YORK Paramount Global and Nielsen have inked a new, multiyear deal that will provide measurement for all of the company's platforms, including national ...

04/02/2025

2d Animated Short Concerning a Project for Schools

2d Animated Short Concerning a Project for Schools Brie Clayton February 3, 2025 0 Comments 2d animated short concerning a project for schools Febru...

04/02/2025

Step by step guide to using 3D Models in After Effects

Step by step guide to using 3D Models in After Effects Graham Quince February 3, 2025 0 Comments Since 2024, Adobe After Effects has had native suppor...

04/02/2025

Powerful Premiere Automation with new Excalibur Update

Powerful Premiere Automation with new Excalibur Update Colin Smith February 3, 2025 0 Comments This tutorial takes you through the new update for auto...

04/02/2025

Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to create a 1960's era film

Cinematography of A Complete Unknown: Shooting 12,800 iso Sony Venice 2 to creat...

04/02/2025

DIY to DA: Ela Minus Breaks Through

DIY to D A: Ela Minus Breaks Through The electronic artist and producer tells Rolling Stone about her new album, D A, and how shes forged a career outside the...

04/02/2025

NVIDIA Blackwell Now Generally Available in the Cloud

AI reasoning models and agents are set to transform industries, but delivering their full potential at scale requires massive compute and optimized software. Th...

04/02/2025

The Future of Football? Technology and Entertainment Merge in the Kings World Cup Nations

The future of football? Technology and entertainment merge in the Kings World Cu...

04/02/2025

Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Trends

Virtual Production and AR Graphics: Demystifying the Tools, Technologies, and Tr...

04/02/2025

SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics, SNY

SVG All-Stars: Russell Fink, Senior Director, Programming and Content Analytics,...

04/02/2025

SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Productions Like a Professional

SVG New Sponsor Spotlight: farmerswife's Jodi Clifford on Organizing Your Pr...

04/02/2025

EA Acquires TRACAB Technologies as It Looks to Move Beyond Games

EA Acquires TRACAB Technologies as It Looks to Move Beyond Games EA believes TRACABs sports tracking/analysis technology will help to make the EA SPORTS App the...

04/02/2025

Kingdom Come: Alamiya Media on Bringing the Supercoppa Italiana and Supercopa de Espaa to Saudi Arabia

Kingdom come: Alamiya Media on bringing the Supercoppa Italiana and Supercopa de...

04/02/2025

Alamiya Media at 50: Preparing for Rapid Change, an International Broadcast Center and the FIFA World Cup

Alamiya Media at 50: Preparing for rapid change, an international broadcast cent...

04/02/2025

An update on our TV and broadband prices

An update on our TV and broadband pricesTuesday 4 February 2025 An update on our TV and broadband prices Devesh Raj, Chief Operating Officer, Sky This April,...

04/02/2025

Sky extends partnership with the PDC to remain the home of darts until 2030

Sky extends partnership with the PDC to remain the home of darts until 2030Tuesday 4 February 2025 Following another record-breaking PDC World Darts Championsh...

04/02/2025

Frankfurt is the world's first airport to regularly use walk-through scanners from Rohde & Schwarz for passengers

Frankfurt is the world's first airport to regularly use walk-through scanner...

04/02/2025

Riedel Unveils Next Generation of StageLink Edge Devices

Wuppertal February 4, 2025 Riedel Unveils Next Generation of StageLink Edge DevicesRiedel Communications today announced the launch of its StageLink family of...

04/02/2025

Fox Corporation Reports Second Quarter Fiscal 2025 Financial Results

Fox Corporation Reports Second Quarter Fiscal 2025 Financial Results NEW YORK, NY, February 4, 2025 - Fox Corporation (Nasdaq: FOXA, FOX; FOX or the Compan...

04/02/2025

Introducing our fully digital, true diversity wideband wireless mic solution

DPA Microphones is moving into the wireless market with the release of its new N-Series Digital Wireless System at ISE 2025 (Stand 7P600). A fully digital, true...

04/02/2025

2025-02-04

CUPERTINO, CALIFORNIA Apple today introduced Apple Invites, a new app for iPhone that helps users create custom invitations to gather friends and family for any...

04/02/2025

ABS appoints Sameer Karimbhai as New General Counsel

ABS appoints Sameer Karimbhai as New General Counsel...