Sony Pixel Power calrec Sony

How NVIDIA AI Foundry Lets Enterprises Forge Custom Generative AI Models

23/07/2024

Businesses seeking to harness the power of AI need customized models tailored to their specific industry needs.

NVIDIA AI Foundry is a service that enables enterprises to use data, accelerated computing and software tools to create and deploy custom models that can supercharge their generative AI initiatives.

Just as TSMC manufactures chips designed by other companies, NVIDIA AI Foundry provides the infrastructure and tools for other companies to develop and customize AI models - using DGX Cloud, foundation models, NVIDIA NeMo software, NVIDIA expertise, as well as ecosystem tools and support.

The key difference is the product: TSMC produces physical semiconductor chips, while NVIDIA AI Foundry helps create custom models. Both enable innovation and connect to a vast ecosystem of tools and partners.

Enterprises can use AI Foundry to customize NVIDIA and open community models, including the new Llama 3.1 collection, as well as NVIDIA Nemotron, CodeGemma by Google DeepMind, CodeLlama, Gemma by Google DeepMind, Mistral, Mixtral, Phi-3, StarCoder2 and others.

Industry Pioneers Drive AI Innovation Industry leaders Amdocs, Capital One, Getty Images, KT, Hyundai Motor Company, SAP, ServiceNow and Snowflake are among the first using NVIDIA AI Foundry. These pioneers are setting the stage for a new era of AI-driven innovation in enterprise software, technology, communications and media.

Organizations deploying AI can gain a competitive edge with custom models that incorporate industry and business knowledge, said Jeremy Barnes, vice president of AI Product at ServiceNow. ServiceNow is using NVIDIA AI Foundry to fine-tune and deploy models that can integrate easily within customers' existing workflows.

The Pillars of NVIDIA AI Foundry NVIDIA AI Foundry is supported by the key pillars of foundation models, enterprise software, accelerated computing, expert support and a broad partner ecosystem.

Its software includes AI foundation models from NVIDIA and the AI community as well as the complete NVIDIA NeMo software platform for fast-tracking model development.

The computing muscle of NVIDIA AI Foundry is NVIDIA DGX Cloud, a network of accelerated compute resources co-engineered with the world's leading public clouds - Amazon Web Services, Google Cloud and Oracle Cloud Infrastructure. With DGX Cloud, AI Foundry customers can develop and fine-tune custom generative AI applications with unprecedented ease and efficiency, and scale their AI initiatives as needed without significant upfront investments in hardware. This flexibility is crucial for businesses looking to stay agile in a rapidly changing market.

If an NVIDIA AI Foundry customer needs assistance, NVIDIA AI Enterprise experts are on hand to help. NVIDIA experts can walk customers through each of the steps required to build, fine-tune and deploy their models with proprietary data, ensuring the models tightly align with their business requirements.

NVIDIA AI Foundry customers have access to a global ecosystem of partners that can provide a full range of support. Accenture, Deloitte, Infosys, Tata Consultancy Services and Wipro are among the NVIDIA partners that offer AI Foundry consulting services that encompass design, implementation and management of AI-driven digital transformation projects. Accenture is first to offer its own AI Foundry-based offering for custom model development, the Accenture AI Refinery framework.

Additionally, service delivery partners such as Data Monsters, Quantiphi, Slalom and SoftServe help enterprises navigate the complexities of integrating AI into their existing IT landscapes, ensuring that AI applications are scalable, secure and aligned with business objectives.

Customers can develop NVIDIA AI Foundry models for production using AIOps and MLOps platforms from NVIDIA partners, including ActiveFence, AutoAlign, Cleanlab, DataDog, Dataiku, Dataloop, DataRobot, Deepchecks, Domino Data Lab, Fiddler AI, Giskard, New Relic, Scale, Tumeryk and Weights & Biases.

Customers can output their AI Foundry models as NVIDIA NIM inference microservices - which include the custom model, optimized engines and a standard API - to run on their preferred accelerated infrastructure.

Inferencing solutions like NVIDIA TensorRT-LLM deliver improved efficiency for Llama 3.1 models to minimize latency and maximize throughput. This enables enterprises to generate tokens faster while reducing total cost of running the models in production. Enterprise-grade support and security is provided by the NVIDIA AI Enterprise software suite.

NVIDIA NIM and TensorRT-LLM minimize inference latency and maximize throughput for Llama 3.1 models to generate tokens faster. The broad range of deployment options includes NVIDIA-Certified Systems from global server manufacturing partners including Cisco, Dell Technologies, Hewlett Packard Enterprise, Lenovo and Supermicro, as well as cloud instances from Amazon Web Services, Google Cloud and Oracle Cloud Infrastructure.

Additionally, Together AI, a leading AI acceleration cloud, today announced it will enable its ecosystem of over 100,000 developers and enterprises to use its NVIDIA GPU-accelerated inference stack to deploy Llama 3.1 endpoints and other open models on DGX Cloud.

Every enterprise running generative AI applications wants a faster user experience, with greater efficiency and lower cost, said Vipul Ved Prakash, founder and CEO of Together AI. Now, developers and enterprises using the Together Inference Engine can maximize performance, scalability and security on NVIDIA DGX Cloud.

NVIDIA NeMo Speeds and Simplifies Custom Model Development With NVIDIA NeMo integrated into AI Foundry, developers have at their fingertips the tools needed to curate data, customize foundation models and evaluate performance. NeMo technologies include:

NeMo Curator is a GPU-accelerated data
LINK: https://blogs.nvidia.com/blog/ai-foundry-enterprise-generative-ai/...
See more stories from nvidia

More from Nvidia

06/09/2024

How AI Is Personalizing Customer Service Experiences Across Industries

Customer service departments across industries are facing increased call volumes, high customer service agent turnover, talent shortages and shifting customer e...

05/09/2024

19 New Games to Drop for GeForce NOW in September

Fall will be here soon, so leaf it to GeForce NOW to bring the games, with 19 joining the cloud in September. Get started with the seven games available to str...

05/09/2024

Three Ways to Ride the Flywheel of Cybersecurity AI

The business transformations that generative AI brings come with risks that AI itself can help secure in a kind of flywheel of progress. Companies who were qui...

04/09/2024

Volvo Cars EX90 SUV Rolls Out, Built on NVIDIA Accelerated Computing and AI

Volvo Cars' new, fully electric EX90 is making its way from the automaker's assembly line in Charleston, South Carolina, to dealerships around the U.S. ...

04/09/2024

Do the Math: New RTX AI PC Hardware Delivers More AI, Faster

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...

04/09/2024

Hammer Time: Machina Labs' Edward Mehr on Autonomous Blacksmith Bots and More

Edward Mehr works where AI meets the anvil. The company he cofounded, Machina L...

04/09/2024

Manufacturing Intelligence: Deltia AI Delivers Assembly Line Gains With NVIDIA Metropolis and Jetson

It all started at Berlin's Merantix venture studio in 2022, when Silviu Homo...

29/08/2024

From RAG to Richness: Startup Uplevels Retrieval-Augmented Generation for Enterprises

Well before OpenAI upended the technology industry with its release of ChatGPT i...

29/08/2024

Crystal-Clear Gaming: Visions of Mana' Sharpens on GeForce NOW

It's time to mana-fest the spirit of adventure with Square Enix's highly anticipated action role-playing game, Visions of Mana, launching today in the c...

28/08/2024

NVIDIA Blackwell Sets New Standard for Generative AI in MLPerf Inference Debut

As enterprises race to adopt generative AI and bring new services to market, the demands on data center infrastructure have never been greater. Training large l...

28/08/2024

More Than Fine: Multi-LoRA Support Now Available in NVIDIA RTX AI Toolkit

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...

27/08/2024

From Prototype to Prompt: NVIDIA NIM Agent Blueprints Fast-Forward Next Wave of Enterprise Generative AI

The initial wave of generative AI was driven by its use in internet services tha...

27/08/2024

Better Molecules, Faster: NVIDIA NIM Agent Blueprint Redefines Hit Identification With Generative AI-Based Virtual Screening

Aiming at making the process faster and smarter, NVIDIA on Wednesday released th...

26/08/2024

NVIDIA Launches NIM Microservices for Generative AI in Japan, Taiwan

Nations around the world are pursuing sovereign AI to produce artificial intelligence using their own computing infrastructure, data, workforce and business net...

23/08/2024

NVIDIA to Present Innovations at Hot Chips That Boost Data Center Performance and Energy Efficiency

A deep technology conference for processor and system architects from industry a...

22/08/2024

Straight Out of Gamescom and Into Xbox PC Games, GeForce NOW Newly Supports Automatic Xbox Sign-In

Straight out of Gamescom, NVIDIA introduced GeForce NOW support for Xbox automat...

21/08/2024

How Snowflake Is Unlocking the Value of Data With Large Language Models

Snowflake is using AI to help enterprises transform data into insights and applications. In this episode of NVIDIA's AI Podcast, host Noah Kravitz and Baris...

21/08/2024

Lightweight Champ: NVIDIA Releases Small Language Model With State-of-the-Art Accuracy

Developers of generative AI typically face a tradeoff between model size and acc...

21/08/2024

SLMming Down Latency: How NVIDIA's First On-Device Small Language Model Makes Digital Humans More Lifelike

Editor's note: This post is part of the AI Decoded series, which demystifies...

20/08/2024

NVIDIA Showcases New AI Capabilities With ACE, RTX Games and More at Gamescom 2024

At Gamescom, the world's biggest gaming expo, NVIDIA has once again pushed t...

20/08/2024

High-Tech Highways: India Uses NVIDIA Accelerated Computing to Ease Tollbooth Traffic

India is home to the globe's second-largest road network, spanning nearly 4 ...

20/08/2024

Level Up: NVIDIA, MediaTek to Bring G-SYNC Display Technologies to More Gamers

Picture this: NVIDIA and MediaTek are working together to make the industry's best gaming display technologies more accessible to gamers globally. The comp...

20/08/2024

NVIDIA Announces First Digital Human Technologies On-Device Small Language Model, Improving Conversation for Game Characters

NVIDIA's first digital human technology small language model is being demons...

20/08/2024

At Gamescom 2024, GeForce NOW Brings Black Myth: Wukong' and FINAL FANTASY XVI Demo' to the Cloud

Each week, GeForce NOW elevates cloud gaming by bringing top PC games and new up...

19/08/2024

AI Chases the Storm: New NVIDIA Research Boosts Weather Prediction, Climate Simulation

As hurricanes, tornadoes and other extreme weather events occur with increased f...

15/08/2024

GeForce NOW and CurseForge Bring Mod Support to World of Warcraft: The War Within' in the Cloud

Time to be wowed: GeForce NOW members can now stream World of Warcraft on suppor...

14/08/2024

Decoding NVIDIA Edify - The Technology That Helps Developers Create Custom Models Trained on Their Data

Editor's note: This post is part of the AI Decoded series, which demystifies...

13/08/2024

Applications Now Open for $60,000 NVIDIA Graduate Fellowship Awards

Bringing together the world's brightest minds and the latest accelerated computing technology leads to powerful breakthroughs that help tackle some of the b...

09/08/2024

Golden Opportunities: California to Train Students, Educators in AI

The State of California today announced a first-of-its-kind AI education initiative with NVIDIA. The public-private collaboration supports the state's goal...

08/08/2024

GeForce NOW Celebrates 2,000 Games in the Cloud

Editor's note: This blog was updated on Aug. 9 to reflect changes to the availability of Warhammer 40,000: Speed Freeks.' This GFN Thursday marks 2,00...

08/08/2024

Figure Unveils Next-Gen Conversational Humanoid Robot With 3x AI Computing for Fully Autonomous Tasks

Silicon Valley's Figure has taken the wraps off of its next-generation Figur...

07/08/2024

Problem Solved: STEM Studies Supercharged With RTX and AI Technologies

Editor's note: This post is part of the AI Decoded series, which demystifies AI by making the technology more accessible, and showcases new hardware, softwa...

07/08/2024

Recursion CEO Chris Gibson on Accelerating the Biopharmaceutical Industry With AI

Techbio is a field combining data, technology and biology to enhance scientific ...

06/08/2024

Meet the Maker: High School Student Develops Robot Guide Dogs With NVIDIA Jetson

High school student Selin Alara Ornek is looking ahead - using machine learning and the NVIDIA Jetson platform for edge AI and robotics to create robot guide do...

06/08/2024

Editor's Paradise: NVIDIA RTX-Powered Video Software CyberLink PowerDirector Gains High-Efficiency Video Coding Upgrades

Editor's note: This post is part of our In the NVIDIA Studio series, which c...

01/08/2024

August Adventures Await: 18 New Games Coming to GeForce NOW

Members can choose their own adventure with GeForce NOW bringing 18 new games to the cloud in August - including Square Enix's fantasy role-playing game Vis...

31/07/2024

Oracle Cloud Infrastructure Expands NVIDIA GPU-Accelerated Instances for AI, Digital Twins and More

Enterprises are rapidly adopting generative AI, large language models (LLMs), ad...

31/07/2024

NVIDIA Researchers Harness Real-Time Gen AI to Build Immersive Desert World

NVIDIA researchers used NVIDIA Edify, a multimodal architecture for visual generative AI, to build a detailed 3D desert landscape within a few minutes in a live...

31/07/2024

NVIDIA and Zoox Pave the Way for Autonomous Ride-Hailing

In celebration of Zoox's 10th anniversary, NVIDIA founder and CEO Jensen Huang recently joined the robotaxi company's CEO, Aicha Evans, and its cofounde...

29/07/2024

For Your Edification: Shutterstock Releases Generative 3D, Getty Images Upgrades Service Powered by NVIDIA

Designers and artists have new and improved ways to boost their productivity wit...

29/07/2024

AI Gets Physical: New NVIDIA NIM Microservices Bring Generative AI to Digital Environments

Millions of people already use generative AI to assist in writing and learning. ...

29/07/2024

Hugging Face Offers Developers Inference-as-a-Service Powered by NVIDIA NIM

One of the world's largest AI communities - comprising 4 million developers on the Hugging Face platform - is gaining easy access to NVIDIA-accelerated infe...

29/07/2024

New NVIDIA Digital Human Technologies Enhance Customer Interactions Across Industries

Generative AI is unlocking new ways for enterprises to engage customers through ...

29/07/2024

NVIDIA Supercharges Digital Marketing With Greater Control Over Generative AI

The world's brands and agencies are using generative AI to create advertising and marketing content, but it doesn't always provide the desired outputs. ...

29/07/2024

Reality Reimagined: NVIDIA Introduces fVDB to Build Bigger Digital Models of the World

NVIDIA announced at SIGGRAPH fVDB, a new deep-learning framework for generating ...

29/07/2024

Recipe for Magic: WPP and NVIDIA Omniverse Help The Coca-Cola Company Scale Generative AI Content That Pops With Brand Authenticity

When The Coca-Cola Company produces thirst-quenching marketing, the creative ele...

29/07/2024

Everybody Will Have an AI Assistant,' NVIDIA CEO Tells SIGGRAPH Audience

The generative AI revolution - with deep roots in visual computing - is amplifying human creativity even as accelerated computing promises significant gains in ...

29/07/2024

Creators To Have Personalized AI Assistants, Meta CEO Mark Zuckerberg Tells NVIDIA CEO Jensen Huang

In a highly anticipated fireside chat at SIGGRAPH 2024, NVIDIA founder and CEO J...