Sony Pixel Power calrec Sony

NVIDIA Releases NIM Microservices to Safeguard Applications for Agentic AI

16/01/2025

AI agents are poised to transform productivity for the world's billion knowledge workers with knowledge robots that can accomplish a variety of tasks. To develop AI agents, enterprises need to address critical concerns like trust, safety, security and compliance.

New NVIDIA NIM microservices for AI guardrails - part of the NVIDIA NeMo Guardrails collection of software tools - are portable, optimized inference microservices that help companies improve the safety, precision and scalability of their generative AI applications.

Central to the orchestration of the microservices is NeMo Guardrails, part of the NVIDIA NeMo platform for curating, customizing and guardrailing AI. NeMo Guardrails helps developers integrate and manage AI guardrails in large language model (LLM) applications. Industry leaders Amdocs, Cerence AI and Lowe's are among those using NeMo Guardrails to safeguard AI applications.

Developers can use the NIM microservices to build more secure, trustworthy AI agents that provide safe, appropriate responses within context-specific guidelines and are bolstered against jailbreak attempts. Deployed in customer service across industries like automotive, finance, healthcare, manufacturing and retail, the agents can boost customer satisfaction and trust.

One of the new microservices, built for moderating content safety, was trained using the Aegis Content Safety Dataset - one of the highest-quality, human-annotated data sources in its category. Curated and owned by NVIDIA, the dataset is publicly available on Hugging Face and includes over 35,000 human-annotated data samples flagged for AI safety and jailbreak attempts to bypass system restrictions.

NVIDIA NeMo Guardrails Keeps AI Agents on Track

AI is rapidly boosting productivity for a broad range of business processes. In customer service, it's helping resolve customer issues up to 40% faster. However, scaling AI for customer service and other AI agents requires secure models that prevent harmful or inappropriate outputs and ensure the AI application behaves within defined parameters.

NVIDIA has introduced three new NIM microservices for NeMo Guardrails that help AI agents operate at scale while maintaining controlled behavior:

Content safety NIM microservice that safeguards AI against generating biased or harmful outputs, ensuring responses align with ethical standards.

Topic control NIM microservice that keeps conversations focused on approved topics, avoiding digression or inappropriate content.

Jailbreak detection NIM microservice that adds protection against jailbreak attempts, helping maintain AI integrity in adversarial scenarios.

By applying multiple lightweight, specialized models as guardrails, developers can cover gaps that may occur when only more general global policies and protections exist - as a one-size-fits-all approach doesn't properly secure and control complex agentic AI workflows.

Small language models, like those in the NeMo Guardrails collection, offer lower latency and are designed to run efficiently, even in resource-constrained or distributed environments. This makes them ideal for scaling AI applications in industries such as healthcare, automotive and manufacturing, in locations like hospitals or warehouses.

Industry Leaders and Partners Safeguard AI With NeMo Guardrails

NeMo Guardrails, available to the open-source community, helps developers orchestrate multiple AI software policies - called rails - to enhance LLM application security and control. It works with NVIDIA NIM microservices to offer a robust framework for building AI systems that can be deployed at scale without compromising on safety or performance.

Amdocs, a leading global provider of software and services to communications and media companies, is harnessing NeMo Guardrails to enhance AI-driven customer interactions by delivering safer, more accurate and contextually appropriate responses.

Technologies like NeMo Guardrails are essential for safeguarding generative AI applications, helping make sure they operate securely and ethically, said Anthony Goonetilleke, group president of technology and head of strategy at Amdocs. By integrating NVIDIA NeMo Guardrails into our amAIz platform, we are enhancing the platform's Trusted AI' capabilities to deliver agentic experiences that are safe, reliable and scalable. This empowers service providers to deploy AI solutions safely and with confidence, setting new standards for AI innovation and operational excellence.

Cerence AI, a company specializing in AI solutions for the automotive industry, is using NVIDIA NeMo Guardrails to help ensure its in-car assistants deliver contextually appropriate, safe interactions powered by its CaLLM family of large and small language models.

Cerence AI relies on high-performing, secure solutions from NVIDIA to power our in-car assistant technologies, said Nils Schanz, executive vice president of product and technology at Cerence AI. Using NeMo Guardrails helps us deliver trusted, context-aware solutions to our automaker customers and provide sensible, mindful and hallucination-free responses. In addition, NeMo Guardrails is customizable for our automaker customers and helps us filter harmful or unpleasant requests, securing our CaLLM family of language models from unintended or inappropriate content delivery to end users.

Lowe's, a leading home improvement retailer, is leveraging generative AI to build on the deep expertise of its store associates. By providing enhanced access to comprehensive product knowledge, these tools empower associates to answer customer questions, helping them find the right products to complete their projects and setting a new standard for retail innovation and customer satisfaction.

We're always looking for ways to help associates go above and beyond for our customers, said
LINK: https://blogs.nvidia.com/blog/nemo-guardrails-nim-microservices/...
See more stories from nvidia

More from Nvidia

20/02/2025

It's a Sign: AI Platform for Teaching American Sign Language Aims to Bridge Communication Gaps

American Sign Language is the third most prevalent language in the United States...

20/02/2025

Calling All Creators: GeForce RTX 5070 Ti GPU Accelerates Generative AI and Content Creation Workflows in Video Editing, 3D and More

The NVIDIA GeForce RTX 5070 Ti graphics cards - built on the NVIDIA Blackwell ar...

20/02/2025

Into the Omniverse: How OpenUSD and Synthetic Data Are Shaping the Future for Humanoid Robots

Editor's note: This post is part of Into the Omniverse, a series focused on ...

20/02/2025

Step Into the World of Avowed' on GeForce NOW

Wield magic and steel as GeForce NOW's fifth-anniversary celebration summons Obsidian Entertainment's highly anticipated Avowed to the cloud. This firs...

19/02/2025

Temenos' Barb Morgan Shares How Chatbots and AI Agents Are Reshaping Customer Service in Banking

In financial services, AI has traditionally been used primarily for fraud detect...

19/02/2025

Massive Foundation Model for Biomolecular Sciences Now Available via NVIDIA BioNeMo

Scientists everywhere can now access Evo 2, a powerful new foundation model that...

19/02/2025

Telcos Dial Up AI: NVIDIA Survey Unveils Industry's AI Trends

The telecom industry's efforts to drive efficiencies with AI are beginning to show fruit. An increasing focus on deploying AI into radio access networks (R...

13/02/2025

Physicists Tap James Webb Space Telescope to Track New Asteroids and City-Killer Rock

Asteroids were responsible for extinction events hundreds of millions of years a...

13/02/2025

Physicists Tap James Web Space Telescope to Track New Asteroids and City-Killer Rock

Asteroids were responsible for extinction events hundreds of millions of years a...

13/02/2025

GeForce NOW Welcomes Warner Bros. Games to the Cloud With Batman: Arkham' Series

It's a match made in heaven - GeForce NOW and Warner Bros. Games are collabo...

12/02/2025

How Scaling Laws Drive Smarter, More Powerful AI

Just as there are widely understood empirical laws of nature - for example, what goes up must come down, or every action has an equal and opposite reaction - th...

12/02/2025

Safety First: Leading Partners Adopt NVIDIA Cybersecurity AI to Safeguard Critical Infrastructure

The rapid evolution of generative AI has created countless opportunities for inn...

11/02/2025

NVIDIA CEO Awarded for Advancing Precision Medicine With Accelerated Computing, AI

NVIDIA's contributions to accelerating medical imaging, genomics, computatio...

11/02/2025

Technovation Empowers Girls in AI, Making AI Education More Inclusive and Engaging

Tara Chklovski has spent much of her career inspiring young women to take on som...

07/02/2025

AI-Designed Proteins Take on Deadly Snake Venom

Every year, venomous snakes kill over 100,000 people and leave 300,000 more with devastating injuries - amputations, paralysis and permanent disabilities. The v...

06/02/2025

When the Earth Talks, AI Listens

AI built for speech is now decoding the language of earthquakes. A team of researchers from the Earth and environmental sciences division at Los Alamos Nationa...

06/02/2025

Medieval Mayhem Arrives With Kingdom Come: Deliverance II' on GeForce NOW

GeForce NOW celebrates its fifth anniversary this February with a lineup of five major releases. The month kicks off with Kingdom Come: Deliverance II. Prepare ...

05/02/2025

Building More Builders: Gooey.AI Makes AI More Accessible Across Communities

When non-technical users can create and deploy reliable AI workflows, organizations can do more to serve their clientele Platforms for developing no- and low-c...

05/02/2025

AI Pays Off: Survey Reveals Financial Industry's Latest Technological Trends

The financial services industry is reaching an important milestone with AI, as organizations move beyond testing and experimentation to successful AI implementa...

05/02/2025

How GeForce RTX 50 Series GPUs Are Built to Supercharge Generative AI on PCs

NVIDIA's GeForce RTX 5090 and 5080 GPUs - which are based on the groundbreaking NVIDIA Blackwell architecture -offer up to 8x faster frame rates with NVIDIA...

04/02/2025

NVIDIA Blackwell Now Generally Available in the Cloud

AI reasoning models and agents are set to transform industries, but delivering their full potential at scale requires massive compute and optimized software. Th...

31/01/2025

Accelerate DeepSeek Reasoning Models With NVIDIA GeForce RTX 50 Series AI PCs

The recently released DeepSeek-R1 model family has brought a new wave of excitement to the AI community, allowing enthusiasts and developers to run state-of-the...

30/01/2025

DeepSeek-R1 Now Live With NVIDIA NIM

DeepSeek-R1 is an open model with state-of-the-art reasoning capabilities. Instead of offering direct responses, AI models like DeepSeek-R1 perform reasoning th...

30/01/2025

GeForce NOW Celebrates Five Years of Cloud Gaming With AAA Blockbusters

GeForce NOW turns five this February. Five incredible years of high-performance gaming have been made possible thanks to the members who've joined the cloud...

30/01/2025

Lights, Camera, Action: New NVIDIA Broadcast AI Features Now Streaming With GeForce RTX 50 Series GPUs

New GeForce RTX 5090 and RTX 5080 GPUs - built on the NVIDIA Blackwell architect...

29/01/2025

Leveling Up User Experiences With Agentic AI, From Bots to Autonomous Agents

AI agents with advanced perception and cognition capabilities are making digital experiences more dynamic and personalized across retail, finance, entertainment...

27/01/2025

Amphitrite Rides AI Wave to Boost Maritime Shipping, Ocean Cleanup With Real-Time Weather Prediction and Simulation

Named after Greek mythology's goddess of the sea, France-based startup Amphi...

23/01/2025

Fast, Low-Cost Inference Offers Key to Profitable AI

Businesses across every industry are rolling out AI services this year. For Microsoft, Oracle, Perplexity, Snap and hundreds of other leading companies, using t...

23/01/2025

Baldur's Gate 3' Mod Support Launches in the Cloud

GeForce NOW is expanding mod support for hit game Baldur's Gate 3 in collaboration with Larian Studios and mod.io for Ultimate and Performance members. Thi...

22/01/2025

How AI Helps Fight Fraud in Financial Services, Healthcare, Government and More

Companies and organizations are increasingly using AI to protect their customers and thwart the efforts of fraudsters around the world. Voice security company ...

22/01/2025

Into the Omniverse: OpenUSD Workflows Advance Physical AI for Robotics, Autonomous Vehicles

Editor's note: This post is part of Into the Omniverse, a series focused on ...

22/01/2025

The Future of Marketing: How AI Agents Can Enhance Customer Journeys in Retail

AI agents - which can understand, adapt to and support each user's unique journey - are making online shopping and digital marketing more efficient and pers...

21/01/2025

NoTraffic Reduces Road Delays, Carbon Emissions With NVIDIA AI and Accelerated Computing

More than 90 million new vehicles are introduced to roads across the globe every...

16/01/2025

Fantastic Four-ce Awakens: Season One of Marvel Rivals' Joins GeForce NOW

Time to suit up, members. The multiverse is about to get a whole lot cloudier as GeForce NOW opens a portal to the first season of hit game Marvel Rivals from N...

16/01/2025

NVIDIA Releases NIM Microservices to Safeguard Applications for Agentic AI

AI agents are poised to transform productivity for the world's billion knowledge workers with knowledge robots that can accomplish a variety of tasks. To ...

15/01/2025

How AI Is Enhancing Surgical Safety and Education

Troves of unwatched surgical video footage are finding new life, fueling AI tools that help make surgery safer and enhance surgical education. The Surgical Data...

14/01/2025

Healthcare Leaders, NVIDIA CEO Share AI Innovation Across the Industry

AI is making inroads across the entire healthcare industry - from genomic research to drug discovery, clinical trial workflows and patient care. In a fireside ...

14/01/2025

NVIDIA GTC 2025: Quantum Day to Illuminate the Future of Quantum Computing

Quantum computing is one of the most exciting areas in computer science, promising progress in accelerated computing beyond what's considered possible today...

13/01/2025

NVIDIA Statement on the Biden Administration's Misguided AI Diffusion' Rule

For decades, leadership in computing and software ecosystems has been a cornerst...

13/01/2025

NVIDIA Statement on the Biden Administration's Misguided ‘AI Diffusion’ Rule

For decades, leadership in computing and software ecosystems has been a cornerst...

13/01/2025

NVIDIA and IQVIA Build Domain-Expert Agentic AI for Healthcare and Life Sciences

IQVIA, the world's leading provider of clinical research services, commercial insights and healthcare intelligence, is working with NVIDIA to build custom f...

10/01/2025

AI Gets Real for Retailers: 9 Out of 10 Retailers Now Adopting or Piloting AI, Latest NVIDIA Survey Finds

Artificial intelligence is rapidly becoming the cornerstone of innovation in the...

09/01/2025

Hyundai Motor Group Embraces NVIDIA AI and Omniverse for Next-Gen Mobility

Driving the future of smart mobility, Hyundai Motor Group (the Group) is partnering with NVIDIA to develop the next generation of safe, secure mobility with AI ...

09/01/2025

GeForce NOW at CES: Bring PC RTX Gaming Everywhere With the Power of GeForce NOW

This GFN Thursday recaps the latest cloud announcements from the CES trade show, including GeForce RTX gaming expansion across popular devices such as Steam Dec...

08/01/2025

Unveiling a New Era of Local AI With NVIDIA NIM Microservices and AI Blueprints

Over the past year, generative AI has transformed the way people live, work and play, enhancing everything from writing and content creation to gaming, learning...

07/01/2025

Why Enterprises Need AI Query Engines to Fuel Agentic AI

Data is the fuel of AI applications, but the magnitude and scale of enterprise data often make it too expensive and time-consuming to use effectively. Accordin...

07/01/2025

Why World Foundation Models Will Be Key to Advancing Physical AI

In the fast-evolving landscape of AI, it's becoming increasingly important to develop models that can accurately simulate and predict outcomes in physical, ...

06/01/2025

Now See This: NVIDIA Launches Blueprint for AI Agents That Can Analyze Video

The next big moment in AI is in sight - literally. Today, more than 1.5 billion enterprise level cameras deployed worldwide are generating roughly 7 trillion h...

06/01/2025

Building Smarter Autonomous Machines: NVIDIA Announces Early Access for Omniverse Sensor RTX

Generative AI and foundation models let autonomous machines generalize beyond th...

06/01/2025

NVIDIA Unveils Mega' Omniverse Blueprint for Building Industrial Robot Fleet Digital Twins

According to Gartner, the worldwide end-user spending on all IT products for 202...