Sony Pixel Power calrec Sony

From Generative to Agentic AI, Wrapping the Year's AI Advancements

24/12/2024

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, software, tools and accelerations for GeForce RTX PC and NVIDIA RTX workstation users.

The AI Decoded series over the past year has broken down all things AI - from simplifying the complexities of large language models (LLMs) to highlighting the power of RTX AI PCs and workstations.

Recapping the latest AI advancements, this roundup highlights how the technology has changed the way people write, game, learn and connect with each other online.

NVIDIA GeForce RTX GPUs offer the power to deliver these experiences on PC laptops, desktops and workstations. They feature specialized AI Tensor Cores that can deliver more than 1,300 trillion operations per second (TOPS) of processing power for cutting-edge performance in gaming, creating, everyday productivity and more. For workstations, NVIDIA RTX GPUs deliver over 1,400 TOPS, enabling next-level AI acceleration and efficiency.

Unlocking Productivity and Creativity With AI-Powered Chatbots AI Decoded earlier this year explored what LLMs are, why they matter and how to use them.

For many, tools like ChatGPT were their first introduction to AI. LLM-powered chatbots have transformed computing from basic, rule-based interactions to dynamic conversations. They can suggest vacation ideas, write customer service emails, spin up original poetry and even write code for users.

Introduced in March, ChatRTX is a demo app that lets users personalize a GPT LLM with their own content, such as documents, notes and images.

With features like retrieval-augmented generation (RAG), NVIDIA TensorRT-LLM and RTX acceleration, ChatRTX enables users to quickly search and ask questions about their own data. And since the app runs locally on RTX PCs or workstations, results are both fast and private.

NVIDIA offers the broadest selection of foundation models for enthusiasts and developers, including Gemma 2, Mistral and Llama-3. These models can run locally on NVIDIA GeForce and RTX GPUs for fast, secure performance without needing to rely on cloud services.

Download ChatRTX today.

Introducing RTX-Accelerated Partner Applications AI is being incorporated into more and more apps and use cases, including games, content creation apps, software development and productivity tools.

This expansion is fueled by the wide selection of RTX-accelerated developer and community tools, software development kits, models and frameworks have made it easier than ever to run models locally in popular applications.

AI Decoded in October spotlighted how Brave Browser's Leo AI, powered by NVIDIA RTX GPUs and the open-source Ollama platform, enables users to run local LLMs like Llama 3 directly on their RTX PCs or workstations.

This local setup offers fast, responsive AI performance while keeping user data private - without relying on the cloud. NVIDIA's optimizations for tools like Ollama offer accelerated performance for tasks like summarizing articles, answering questions and extracting insights, all directly within the Brave browser. Users can switch between local and cloud models, providing flexibility and control over their AI experience.

For simple instructions on how to add local LLM support via Ollama, read Brave's blog. Once configured to point to Ollama, Leo AI will use the locally hosted LLM for prompts and queries.

Agentic AI - Enabling Complex Problem-Solving Agentic AI is the next frontier of AI, capable of using sophisticated reasoning and iterative planning to autonomously solve complex, multi-step problems.

AI Decoded explored how the AI community is experimenting with the technology to create smarter, more capable AI systems.

Partner applications like AnythingLLM showcase how AI is going beyond simple question-answering to improving productivity and creativity. Users can harness the application to deploy built-in agents that can tackle tasks like searching the web or scheduling meetings.

Example of a user invoking an AI agent in AnythingLLM to complete a web search query. AnythingLLM lets users interact with documents through intuitive interfaces, automate complex tasks with AI agents and run advanced LLMs locally. Harnessing the power of RTX GPUs, it delivers faster, smarter and more responsive AI workflows - all within a single local desktop application. The application also works offline and is fast and private, capable of using local data and tools typically inaccessible with cloud-based solutions.

AnythingLLM's Community Hub lets anyone easily access system prompts that can help them steer LLM behavior, discover productivity-boosting slash commands and build specialized AI agent skills for unique workflows and custom tools.

By enabling users to run agentic AI workflows on their own systems with full privacy, AnythingLLM is fueling innovation and making it easier to experiment with the latest technologies.

AI Decoded Wrapped Over 600 Windows apps and games today are already running AI locally on more than 100 million GeForce RTX AI PCs and workstations worldwide, delivering fast, reliable and low-latency performance. Learn more about NVIDIA GeForce RTX AI PCs and NVIDIA RTX AI workstations.

Tune into the CES keynote delivered by NVIDIA founder and CEO Jensen Huang on Jan. 6. to discover how the latest in AI is supercharging gaming, content creation and development.

Generative AI is transforming gaming, videoconferencing and interactive experiences of all kinds. Make sense of what's new and what's next by subscribing to the AI Decoded newsletter.
LINK: https://blogs.nvidia.com/blog/ai-decoded-recap-ai-pc-rtx-ai/...
See more stories from nvidia

More from Nvidia

31/01/2025

Accelerate DeepSeek Reasoning Models With NVIDIA GeForce RTX 50 Series AI PCs

The recently released DeepSeek-R1 model family has brought a new wave of excitement to the AI community, allowing enthusiasts and developers to run state-of-the...

30/01/2025

DeepSeek-R1 Now Live With NVIDIA NIM

DeepSeek-R1 is an open model with state-of-the-art reasoning capabilities. Instead of offering direct responses, reasoning models like DeepSeek-R1 perform multi...

30/01/2025

GeForce NOW Celebrates Five Years of Cloud Gaming With AAA Blockbusters

GeForce NOW turns five this February. Five incredible years of high-performance gaming have been made possible thanks to the members who've joined the cloud...

30/01/2025

Lights, Camera, Action: New NVIDIA Broadcast AI Features Now Streaming With GeForce RTX 50 Series GPUs

New GeForce RTX 5090 and RTX 5080 GPUs - built on the NVIDIA Blackwell architect...

29/01/2025

Leveling Up User Experiences With Agentic AI, From Bots to Autonomous Agents

AI agents with advanced perception and cognition capabilities are making digital experiences more dynamic and personalized across retail, finance, entertainment...

27/01/2025

Amphitrite Rides AI Wave to Boost Maritime Shipping, Ocean Cleanup With Real-Time Weather Prediction and Simulation

Named after Greek mythology's goddess of the sea, France-based startup Amphi...

23/01/2025

Fast, Low-Cost Inference Offers Key to Profitable AI

Businesses across every industry are rolling out AI services this year. For Microsoft, Oracle, Perplexity, Snap and hundreds of other leading companies, using t...

23/01/2025

Baldur's Gate 3' Mod Support Launches in the Cloud

GeForce NOW is expanding mod support for hit game Baldur's Gate 3 in collaboration with Larian Studios and mod.io for Ultimate and Performance members. Thi...

22/01/2025

How AI Helps Fight Fraud in Financial Services, Healthcare, Government and More

Companies and organizations are increasingly using AI to protect their customers and thwart the efforts of fraudsters around the world. Voice security company ...

22/01/2025

Into the Omniverse: OpenUSD Workflows Advance Physical AI for Robotics, Autonomous Vehicles

Editor's note: This post is part of Into the Omniverse, a series focused on ...

22/01/2025

The Future of Marketing: How AI Agents Can Enhance Customer Journeys in Retail

AI agents - which can understand, adapt to and support each user's unique journey - are making online shopping and digital marketing more efficient and pers...

21/01/2025

NoTraffic Reduces Road Delays, Carbon Emissions With NVIDIA AI and Accelerated Computing

More than 90 million new vehicles are introduced to roads across the globe every...

16/01/2025

Fantastic Four-ce Awakens: Season One of Marvel Rivals' Joins GeForce NOW

Time to suit up, members. The multiverse is about to get a whole lot cloudier as GeForce NOW opens a portal to the first season of hit game Marvel Rivals from N...

16/01/2025

NVIDIA Releases NIM Microservices to Safeguard Applications for Agentic AI

AI agents are poised to transform productivity for the world's billion knowledge workers with knowledge robots that can accomplish a variety of tasks. To ...

15/01/2025

How AI Is Enhancing Surgical Safety and Education

Troves of unwatched surgical video footage are finding new life, fueling AI tools that help make surgery safer and enhance surgical education. The Surgical Data...

14/01/2025

Healthcare Leaders, NVIDIA CEO Share AI Innovation Across the Industry

AI is making inroads across the entire healthcare industry - from genomic research to drug discovery, clinical trial workflows and patient care. In a fireside ...

14/01/2025

NVIDIA GTC 2025: Quantum Day to Illuminate the Future of Quantum Computing

Quantum computing is one of the most exciting areas in computer science, promising progress in accelerated computing beyond what's considered possible today...

13/01/2025

NVIDIA Statement on the Biden Administration's Misguided AI Diffusion' Rule

For decades, leadership in computing and software ecosystems has been a cornerst...

13/01/2025

NVIDIA Statement on the Biden Administration's Misguided ‘AI Diffusion’ Rule

For decades, leadership in computing and software ecosystems has been a cornerst...

13/01/2025

NVIDIA and IQVIA Build Domain-Expert Agentic AI for Healthcare and Life Sciences

IQVIA, the world's leading provider of clinical research services, commercial insights and healthcare intelligence, is working with NVIDIA to build custom f...

10/01/2025

AI Gets Real for Retailers: 9 Out of 10 Retailers Now Adopting or Piloting AI, Latest NVIDIA Survey Finds

Artificial intelligence is rapidly becoming the cornerstone of innovation in the...

09/01/2025

Hyundai Motor Group Embraces NVIDIA AI and Omniverse for Next-Gen Mobility

Driving the future of smart mobility, Hyundai Motor Group (the Group) is partnering with NVIDIA to develop the next generation of safe, secure mobility with AI ...

09/01/2025

GeForce NOW at CES: Bring PC RTX Gaming Everywhere With the Power of GeForce NOW

This GFN Thursday recaps the latest cloud announcements from the CES trade show, including GeForce RTX gaming expansion across popular devices such as Steam Dec...

08/01/2025

Unveiling a New Era of Local AI With NVIDIA NIM Microservices and AI Blueprints

Over the past year, generative AI has transformed the way people live, work and play, enhancing everything from writing and content creation to gaming, learning...

07/01/2025

Why Enterprises Need AI Query Engines to Fuel Agentic AI

Data is the fuel of AI applications, but the magnitude and scale of enterprise data often make it too expensive and time-consuming to use effectively. Accordin...

07/01/2025

Why World Foundation Models Will Be Key to Advancing Physical AI

In the fast-evolving landscape of AI, it's becoming increasingly important to develop models that can accurately simulate and predict outcomes in physical, ...

06/01/2025

Now See This: NVIDIA Launches Blueprint for AI Agents That Can Analyze Video

The next big moment in AI is in sight - literally. Today, more than 1.5 billion enterprise level cameras deployed worldwide are generating roughly 7 trillion h...

06/01/2025

Building Smarter Autonomous Machines: NVIDIA Announces Early Access for Omniverse Sensor RTX

Generative AI and foundation models let autonomous machines generalize beyond th...

06/01/2025

NVIDIA Unveils Mega' Omniverse Blueprint for Building Industrial Robot Fleet Digital Twins

According to Gartner, the worldwide end-user spending on all IT products for 202...

02/01/2025

How AI Is Helping Us Do Better-for the Planet and for Each Other

Artificial intelligence and accelerated computing are being used to help solve the world's greatest challenges. NVIDIA has reinvented the computing stack -...

02/01/2025

GeForce NOW Rings in the New Year With 14 New Games

GeForce NOW is kicking off 2025 by delivering 14 games to the cloud this month, with two available to stream this week so members can get started on their New Y...

30/12/2024

Research Galore From 2024: Recapping AI Advancements in 3D Simulation, Climate Science and Audio Engineering

The pace of technology innovation has accelerated in the past year, most dramati...

27/12/2024

Have You Heard? 5 AI Podcast Episodes Listeners Loved in 2024

NVIDIA's AI Podcast gives listeners the inside scoop on the ways AI is transforming nearly every industry. Since the show's debut in 2016, it's gar...

26/12/2024

Cheers to 2024: GeForce NOW Recaps Year of Ultimate Cloud Gaming

This GFN Thursday wraps up another incredible year for cloud gaming. Take a look back at the top games and new features that made 2024 a standout for GeForce NO...

24/12/2024

From Generative to Agentic AI, Wrapping the Year's AI Advancements

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...

19/12/2024

AI's in Style: Ulta Beauty Helps Shoppers Virtually Try New Hairstyles

Shoppers pondering a new hairstyle can now try styles before committing to curls or a new color. An AI app by Ulta Beauty, the largest specialty beauty retailer...

19/12/2024

NieR Perfect: GeForce NOW Loops Square Enix's NieR:Automata' and NieR Replicant ver.1.22474487139' Into the Cloud

Stuck in a gaming rut? Get out of the loop this GFN Thursday with four new games...

18/12/2024

AI at Your Service: Digital Avatars With Speech Capabilities Offer Interactive Customer Experiences

Editor's note: This post is part of the AI On blog series, which explores th...

18/12/2024

Imbue's Kanjun Qiu Shares Insights on How to Build Smarter AI Agents

Imagine a future in which everyone is empowered to build and use their own AI agents. That future may not be far off, as new software is infused with intelligen...

18/12/2024

NVIDIA Awards up to $60,000 Research Fellowships to PhD Students

For more than two decades, the NVIDIA Graduate Fellowship Program has supported graduate students doing outstanding work relevant to NVIDIA technologies. Today,...

17/12/2024

AI in Your Own Words: NVIDIA Debuts NeMo Retriever Microservices for Multilingual Generative AI Fueled by Data

In enterprise AI, understanding and working across multiple languages is no long...

17/12/2024

NVIDIA Unveils Its Most Affordable Generative AI Supercomputer

NVIDIA is taking the wraps off a new compact generative AI supercomputer, offering increased performance at a lower price with a software upgrade. The new NVID...

16/12/2024

Tech Leader, AI Visionary, Endlessly Curious Jensen Huang to Keynote CES 2025

On Jan. 6 at 6:30 p.m. PT, NVIDIA founder and CEO Jensen Huang - with his trademark leather jacket and an unwavering vision - will step onto the CES 2025 stage....

12/12/2024

Ready Player Fun: GFN Thursday Brings Six New Adventures to the Cloud

From heart-pounding action games to remastered classics, there's something for everyone this GFN Thursday. Six new titles join the cloud this week, startin...

11/12/2024

Driving Mobility Forward, Vay Brings Advanced Automotive Solutions to Roads With NVIDIA DRIVE AGX

Vay, a Berlin-based provider of automotive-grade remote driving (teledriving) te...

11/12/2024

Built for the Era of AI, NVIDIA RTX AI PCs Enhance Content Creation, Gaming, Entertainment and More

Editor's note: This post is part of the AI Decoded series, which demystifies...

11/12/2024

Into the Omniverse: How OpenUSD-Based Simulation and Synthetic Data Generation Advance Robot Learning

Editor's note: This post is part of Into the Omniverse, a series focused on ...

10/12/2024

AI Pioneers Win Nobel Prizes for Physics and Chemistry

Artificial intelligence, once the realm of science fiction, claimed its place at the pinnacle of scientific achievement Monday in Sweden. In a historic ceremon...

10/12/2024

Turn Down the Noise: CUDA-Q Enables Industry-First Quantum Computing Demo With Logical Qubits

Quantum computing has the potential to transform industries ranging from drug di...

09/12/2024

Crowning Achievement: NVIDIA Research Model Enables Fast, Efficient Dynamic Scene Reconstruction

Content streaming and engagement are entering a new dimension with QUEEN, an AI ...