Sony Pixel Power calrec Sony

How NVIDIA AI Foundry Lets Enterprises Forge Custom Generative AI Models

23/07/2024

Businesses seeking to harness the power of AI need customized models tailored to their specific industry needs.

NVIDIA AI Foundry is a service that enables enterprises to use data, accelerated computing and software tools to create and deploy custom models that can supercharge their generative AI initiatives.

Just as TSMC manufactures chips designed by other companies, NVIDIA AI Foundry provides the infrastructure and tools for other companies to develop and customize AI models - using DGX Cloud, foundation models, NVIDIA NeMo software, NVIDIA expertise, as well as ecosystem tools and support.

The key difference is the product: TSMC produces physical semiconductor chips, while NVIDIA AI Foundry helps create custom models. Both enable innovation and connect to a vast ecosystem of tools and partners.

Enterprises can use AI Foundry to customize NVIDIA and open community models, including the new Llama 3.1 collection, as well as NVIDIA Nemotron, CodeGemma by Google DeepMind, CodeLlama, Gemma by Google DeepMind, Mistral, Mixtral, Phi-3, StarCoder2 and others.

Industry Pioneers Drive AI Innovation Industry leaders Amdocs, Capital One, Getty Images, KT, Hyundai Motor Company, SAP, ServiceNow and Snowflake are among the first using NVIDIA AI Foundry. These pioneers are setting the stage for a new era of AI-driven innovation in enterprise software, technology, communications and media.

Organizations deploying AI can gain a competitive edge with custom models that incorporate industry and business knowledge, said Jeremy Barnes, vice president of AI Product at ServiceNow. ServiceNow is using NVIDIA AI Foundry to fine-tune and deploy models that can integrate easily within customers' existing workflows.

The Pillars of NVIDIA AI Foundry NVIDIA AI Foundry is supported by the key pillars of foundation models, enterprise software, accelerated computing, expert support and a broad partner ecosystem.

Its software includes AI foundation models from NVIDIA and the AI community as well as the complete NVIDIA NeMo software platform for fast-tracking model development.

The computing muscle of NVIDIA AI Foundry is NVIDIA DGX Cloud, a network of accelerated compute resources co-engineered with the world's leading public clouds - Amazon Web Services, Google Cloud and Oracle Cloud Infrastructure. With DGX Cloud, AI Foundry customers can develop and fine-tune custom generative AI applications with unprecedented ease and efficiency, and scale their AI initiatives as needed without significant upfront investments in hardware. This flexibility is crucial for businesses looking to stay agile in a rapidly changing market.

If an NVIDIA AI Foundry customer needs assistance, NVIDIA AI Enterprise experts are on hand to help. NVIDIA experts can walk customers through each of the steps required to build, fine-tune and deploy their models with proprietary data, ensuring the models tightly align with their business requirements.

NVIDIA AI Foundry customers have access to a global ecosystem of partners that can provide a full range of support. Accenture, Deloitte, Infosys, Tata Consultancy Services and Wipro are among the NVIDIA partners that offer AI Foundry consulting services that encompass design, implementation and management of AI-driven digital transformation projects. Accenture is first to offer its own AI Foundry-based offering for custom model development, the Accenture AI Refinery framework.

Additionally, service delivery partners such as Data Monsters, Quantiphi, Slalom and SoftServe help enterprises navigate the complexities of integrating AI into their existing IT landscapes, ensuring that AI applications are scalable, secure and aligned with business objectives.

Customers can develop NVIDIA AI Foundry models for production using AIOps and MLOps platforms from NVIDIA partners, including ActiveFence, AutoAlign, Cleanlab, DataDog, Dataiku, Dataloop, DataRobot, Deepchecks, Domino Data Lab, Fiddler AI, Giskard, New Relic, Scale, Tumeryk and Weights & Biases.

Customers can output their AI Foundry models as NVIDIA NIM inference microservices - which include the custom model, optimized engines and a standard API - to run on their preferred accelerated infrastructure.

Inferencing solutions like NVIDIA TensorRT-LLM deliver improved efficiency for Llama 3.1 models to minimize latency and maximize throughput. This enables enterprises to generate tokens faster while reducing total cost of running the models in production. Enterprise-grade support and security is provided by the NVIDIA AI Enterprise software suite.

NVIDIA NIM and TensorRT-LLM minimize inference latency and maximize throughput for Llama 3.1 models to generate tokens faster. The broad range of deployment options includes NVIDIA-Certified Systems from global server manufacturing partners including Cisco, Dell Technologies, Hewlett Packard Enterprise, Lenovo and Supermicro, as well as cloud instances from Amazon Web Services, Google Cloud and Oracle Cloud Infrastructure.

Additionally, Together AI, a leading AI acceleration cloud, today announced it will enable its ecosystem of over 100,000 developers and enterprises to use its NVIDIA GPU-accelerated inference stack to deploy Llama 3.1 endpoints and other open models on DGX Cloud.

Every enterprise running generative AI applications wants a faster user experience, with greater efficiency and lower cost, said Vipul Ved Prakash, founder and CEO of Together AI. Now, developers and enterprises using the Together Inference Engine can maximize performance, scalability and security on NVIDIA DGX Cloud.

NVIDIA NeMo Speeds and Simplifies Custom Model Development With NVIDIA NeMo integrated into AI Foundry, developers have at their fingertips the tools needed to curate data, customize foundation models and evaluate performance. NeMo technologies include:

NeMo Curator is a GPU-accelerated data
LINK: https://blogs.nvidia.com/blog/ai-foundry-enterprise-generative-ai/...
See more stories from nvidia

Most recent headlines

13/12/2025

YouTube TV to Launch Genre Packages

In a move that will help it offer more flexible and less costly programming options, YouTube TV has announced that it will be launching YouTube TV Plans with mo...

13/12/2025

Magna Systems Finishes UHD, IP-based OB Truck For Singapore Network

SINGAPORE Magna Systems has designed, built and completed what is believed to be the first full UHD and IP-based OB truck in Southeast Asia for a Singapore medi...

12/12/2025

SVG Summit 2025 Preview: Everything You Need to Know for Next Week's Big Show in NYC

SVG Summit 2025 Preview: Everything You Need to Know for Next Week's Big Sho...

12/12/2025

Hailey Gates and Alia Shawkat Welcome You to the Village of Atropia

Hailey Gates at the Atropia premiere (photo by George Pimentel / Shutterstock for Sundance Film Festival)...

12/12/2025

Spotify and ATP Tour Launch First Episode of New Video Series

Last month, Spotify announced a new collaboration with the ATP Tour, the global governing body of men's professional tennis, aimed at bringing the next gene...

12/12/2025

Arkansas TV Drops PBS Affiliation Amid Funding Cuts

CONWAY, Ark. In a notable example of how the elimination of Federal federal funding is forcing public stations to make massive cuts and changes in the way they...

12/12/2025

Wisycom and DPA Microphones Appoint Rene Moerch as Group...

Wisycom and DPA Microphones announce the appointment of Ren Moerch as Group Product Director, Wireless, a strategic leadership role that will guide the combine...

12/12/2025

SMPTE Releases Updated Engineering Report on Artificial I...

SMPTE , the home of media professionals, technologists, and engineers, in conjuncture with the European Broadcasting Union (EBU) and the Entertainment Technolog...

12/12/2025

Keepit and Ingram Micro form strategic relationship in Po...

Keepit, the vendor-independent, cloud-native data protection provider, today announced a strategic go-to-market relationship in Poland with Ingram Micro, a lead...

12/12/2025

Atomos Enhances FUJIFILM GFX ETERNA 55 with RAW Capabilit...

Atomos announced the immediate availability of a new firmware update for its Ninja TX GO and Ninja TX monitor-recorders, unlocking Open Gate 48P RAW recording w...

12/12/2025

Professional Wireless Systems Provides Comprehensive RF S...

Professional Wireless Systems (PWS) once again played a critical role in delivering flawless wireless coordination and support at the 2025 Latin Grammy Awards a...

12/12/2025

AIMS Announces Inaugural IPMX Product Testing and Certifi...

The Alliance for IP Media Solutions (AIMS), together with the Video Services Forum (VSF), the Advanced Media Workflow Association (AMWA) and the European Broadc...

12/12/2025

DHD Gears for Hamburg Open 2026 with Latest Audio Product...

DHD audio will demonstrate the latest additions to its range of digital audio production solutions on Booth 321 in Hall B6 at Hamburg Open 2026. The show will b...

12/12/2025

Chaos Brings macOS Support and AI Tools to V-Ray for Blen...

Chaos today announces the release of V-Ray for Blender, update 2, bringing its award-winning rendering technology to even more Blender users by adding support f...

12/12/2025

UltraLEDs Launches Precision LED Tape for Professional Fi...

Lighting specialist UltraLEDs has launched Precision LED Tape, a high-CRI lighting solution designed specifically for professional film, TV, and studio use. P...

12/12/2025

Zixi Appoints Roi Sasson as Vice President Engineering

Zixi, the Emmy Award-winning leader in live broadcast-quality video over IP, today announced that Roi Sasson has joined the company as Vice President, Engineer...

12/12/2025

BitFire and Appear Partner to Advance Cloud and Edge Work...

BitFire (bitfire.tv), the leader in software-defined live production and IP transmission, today announced a strategic partnership with Appear, a leader in high-...

12/12/2025

HPA Announces Tech 2026 Retreat Agenda

LOS ANGELES The Hollywood Professional Association (HPA) today said futurist Robert Tercek, creative technologist Jessie Hughes from Leonardo.AI and Emmy-winnin...

12/12/2025

BitFire, Appear Form Strategic Partnership Integrating IP-Based Solutions

HUDSON, Mass. BitFire and Appear have struck a strategic partnership aimed at offering broadcasters, sports leagues and streaming platforms a faster, more flexi...

12/12/2025

TV Tech, TVBEurope to Explore MXLs Impact on Media Production

The broadcast industry is evolving faster than ever. #IPWorkflows #remoteproduction, and next-gen audio systems are reshaping how teams design, deliver, and sca...

12/12/2025

Wrapbook Acquires TV and Film Production Scheduling Platform Cinapse

LOS ANGELES The payroll and production accounting platform Wrapbook has announced the acquisition of Cinapse, a modern scheduling platform for film and televisi...

12/12/2025

Ross Video Expands South Asian Operations

DEHLI Ross Video has announced that it is expanding and restructuring its commercial and technical teams in the South Asian Association for Regional Cooperation...

12/12/2025

Rise AV Launches Asia Pacific Council and Mentoring Program

LONDON Following the success of its UK launch in January 2025, Rise AV, the global not-for-profit initiative dedicated to supporting and advancing women in the ...

12/12/2025

Tubi To Introduce Matter Casting For Fire TV

SAN FRANCISCO Ad-supported streaming service Tubi next week will launch Matter Casting, a new casting standard that will enable seamless mobile-to-TV viewing di...

12/12/2025

HPA Announces Tech Retreat Highlights

LOS ANGELES The Hollywood Professional Association (HPA) today said futurist Robert Tercek, creative technologist Jessie Hughes from Leonardo.AI and Emmy-winnin...

12/12/2025

Cheers to AI: ADAM Robot Bartender Makes Drinks at Vegas Golden Knights Game

In Las Vegas's T-Mobile Arena, fans of the Golden Knights are getting more than just hockey - they're getting a taste of the future. ADAM, a robot devel...

12/12/2025

President of Ireland Catherine Connolly visit to RT Raidi na Gaeltachta in Casla, Connemara

Uachtar n na h ireann, Catherine Connolly visited RT Raidi na Gaeltachta's...

12/12/2025

TV Host and social media sensation Eric Roberts revealed as sixth contestant for Dancing with the Stars 2026

Ireland AM host Eric Roberts has been revealed as the sixth contestant taking to...

12/12/2025

December 11, 2025

Scripps Research team pioneers an efficient way to stereoselectively add fluorine to drug-like molecules A new method uses a novel catalyst and inexpensive fluo...

11/12/2025

AI for Sustainability: Lessons from Sarajevo

Thomson and the Center for News, Technology and Innovation (CNTI) convened a two-day workshop in Sarajevo bringing together more than 35 journalists, editors, p...

11/12/2025

ESPN's Aims for Spectacular With Heisman Trophy Show

ESPN's Aims for Spectacular With Heisman Trophy ShowEvent firsts include 1080p HDR production airing on both national broadcast and cableBy Dan Daley, Audio...

11/12/2025

SVG Students To Watch: Frankie Patton, University of Colorado

SVG Students To Watch: Frankie Patton, University of ColoradoThe 2025 grad is hitting the ground running as a PA on national broadcastsBy Brandon Costa, Directo...

11/12/2025

SVG Summit 2025 Technology Exhibits Preview, Part 3

SVG Summit 2025 Technology Exhibits Preview, Part 3By SVG Staff Thursday, December 11, 2025 - 7:24 am Print This Story | Subscribe Story Highlights The 2...

11/12/2025

SVG Sit-Down: What Makes Gen Z, X, and Y Fans Tick? Dave Gavant of WSC Sports Goes Inside the 2025 Fan Engagement Survey

SVG Sit-Down: What Makes Gen Z, X, and Y Fans Tick? Dave Gavant of WSC Sports Go...

11/12/2025

SVG Summit 2025 Preview: 5G, MXL, Spectrum Loss, and Outerspace on Tap for Tuesday Tech Talks'

SVG Summit 2025 Preview: 5G, MXL, Spectrum Loss, and Outerspace on Tap for Tues...

11/12/2025

2025 Sports Broadcasting Hall of Fame: David Levy, Turner Titan and Master of All Sports-Media Trades

2025 Sports Broadcasting Hall of Fame: David Levy, Turner Titan and Master of Al...

11/12/2025

SVG Launches Follow the Money' Podcast: Go Inside the Sports Media Biz with Sam McCleery and John Kosner

SVG Launches Follow the Money' Podcast: Go Inside the Sports Media Biz with...

11/12/2025

A Deep Dive Inside Game Creek Video's Bird and Magic Mobile Units, Home to Amazon's NBA on Prime Video'

A Deep Dive Inside Game Creek Video's Bird and Magic Mobile Units, Home to A...

11/12/2025

How Sound Effects for Monsters Funday Football' Emulated the Sonic Soul of Monsters, Inc.'

How Sound Effects for Monsters Funday Football' Emulated the Sonic Soul of ...

11/12/2025

SVG New Sponsor Spotlight: CSP Mobile Productions' Len Chase on Upgrading Truck Fleet to 1080p, HDR, and ST 2110

SVG New Sponsor Spotlight: CSP Mobile Productions' Len Chase on Upgrading Tr...

11/12/2025

Spotify and The Game Awards Debut Gaming-Inspired Spotify Singles From Labrinth, Evanescence x GUNSHIP, and Bilmuri

Having the right song soundtrack your moves can make all the difference when gam...

11/12/2025

Celebrate Taylor Swift's Record-Breaking Year and New Docuseries with Exclusive Playlist Cover Art Stickers

It's been a big year for Taylor Swift. Her highly anticipated album The Life...

11/12/2025

L3Harris Ramps Up Production of Next-Gen Missile Tracking Satellites at Expanded Florida Facility

New satellites for the SDA Tranche 1 Tracking program in production at L3Harris&...

11/12/2025

L3Harris Delivers First Meadowlands Production Unit to US Space Force

The Meadowlands system, a compact and mobile version of the CCS, uses ground-based radio frequency units to disrupt satellite communications....

11/12/2025

L3Harris Demonstrates Interoperable Network to Unify Department of War and U.S. Government Agencies

The L3Harris demonstration united tactical communications devices, counter-UAS c...

11/12/2025

2025: L3Harris Year in Review

Throughout 2025, L3Harris delivered innovative solutions to U.S. and allied warfighters across every domain. With an unrelenting commitment to excellence, our...

11/12/2025

Nielsen reveals exclusive new data and insights in annual Tops of Sports report

A Majority of the World's Population (51%) Identify As Soccer Fans The 2025 MLB postseason notched 58.2 billion viewing minutes, up +24% from the prior y...

11/12/2025

Zixi Names Roi Sasson Vice President, Engineering

WALTHAM, Mass. Video-over-IP software provider Zixi said Roi Sasson has joined the company as vice president, engineering....