NVIDIA NIM on AWS Supercharges AI Inference
04/12/2024
Expanding its collaboration with NVIDIA, Amazon Web Services (AWS) revealed today at its annual AWS re:Invent conference that it has extended NVIDIA NIM microservices across key AWS AI services to support faster AI inference and lower latency for generative AI applications.
NVIDIA NIM microservices are now available directly from the AWS Marketplace, as well as Amazon Bedrock Marketplace and Amazon SageMaker JumpStart, making it even easier for developers to deploy NVIDIA-optimized inference for commonly used models at scale.
NVIDIA NIM, part of the NVIDIA AI Enterprise software platform available in the AWS Marketplace, provides developers with a set of easy-to-use microservices designed for secure, reliable deployment of high-performance, enterprise-grade AI model inference across clouds, data centers and workstations.
These prebuilt containers are built on robust inference engines, such as NVIDIA Triton Inference Server, NVIDIA TensorRT, NVIDIA TensorRT-LLM and PyTorch, and support a broad spectrum of AI models - from open-source community ones to NVIDIA AI Foundation models and custom ones.
NIM microservices can be deployed across various AWS services, including Amazon Elastic Compute Cloud (EC2), Amazon Elastic Kubernetes Service (EKS) and Amazon SageMaker.
Developers can preview over 100 NIM microservices built from commonly used models and model families, including Meta's Llama 3, Mistral AI's Mistral and Mixtral, NVIDIA's Nemotron, Stability AI's SDXL and many more on the NVIDIA API catalog. The most commonly used ones are available for self-hosting to deploy on AWS services and are optimized to run on NVIDIA accelerated computing instances on AWS.
NIM microservices now available directly from AWS include:
NVIDIA Nemotron-4, available in Amazon Bedrock Marketplace, Amazon SageMaker Jumpstart and AWS Marketplace. This is a cutting-edge LLM designed to generate diverse synthetic data that closely mimics real-world data, enhancing the performance and robustness of custom LLMs across various domains.
Llama 3.1 8B-Instruct, available on AWS Marketplace. This 8-billion-parameter multilingual large language model is pretrained and instruction-tuned for language understanding, reasoning and text-generation use cases.
Llama 3.1 70B-Instruct, available on AWS Marketplace. This 70-billion-parameter pretrained, instruction-tuned model is optimized for multilingual dialogue.
Mixtral 8x7B Instruct v0.1, available on AWS Marketplace. This high-quality sparse mixture of experts model with open weights can follow instructions, complete requests and generate creative text formats.
NIM on AWS for Everyone Customers and partners across industries are tapping NIM on AWS to get to market faster, maintain security and control of their generative AI applications and data, and lower costs.
SoftServe, an IT consulting and digital services provider, has developed six generative AI solutions fully deployed on AWS and accelerated by NVIDIA NIM and AWS services. The solutions, available on AWS Marketplace, include SoftServe Gen AI Drug Discovery, SoftServe Gen AI Industrial Assistant, Digital Concierge, Multimodal RAG System, Content Creator and Speech Recognition Platform.
They're all based on NVIDIA AI Blueprints, comprehensive reference workflows that accelerate AI application development and deployment and feature NVIDIA acceleration libraries, software development kits and NIM microservices for AI agents, digital twins and more.
Start Now With NIM on AWS Developers can deploy NVIDIA NIM microservices on AWS according to their unique needs and requirements. By doing so, developers and enterprises can achieve high-performance AI with NVIDIA-optimized inference containers across various AWS services.
Visit the NVIDIA API catalog to try out over 100 different NIM-optimized models, and request either a developer license or 90-day NVIDIA AI Enterprise trial license to get started deploying the microservices on AWS services. Developers can also explore NIM microservices in the AWS Marketplace, Amazon Bedrock Marketplace or Amazon SageMaker JumpStart.
See notice regarding software product information.
LINK: | https://blogs.nvidia.com/blog/nim-microservices-aws-inference/... |
See more stories from nvidia |
Most recent headlines
20/01/2025
FAA Trial Shows L3Harris' SafeRoute+ Boosting Airspace Capacity
L3Harris Commercial Aviation Solutions (CAS) today announced the promising results of its first-year participation in a landmark FAA trial that demonstrates the...
20/01/2025
MIP LONDON
Heading to MIP LONDON this February? So are we! We'd love to meet you at London's biggest content week. Get in touch to discover how Blue Lucy's sol...
20/01/2025
Netflix approves Blackmagic URSA Cine 12K LF digital film camera
To qualify, cameras must meet capture requirements including dynamic range, codec, resolution and workflow compatibility By Matthew Corrigan Published: Janua...
20/01/2025
Best of Show at ISE 2025 deadline extended
Companies now have until January 24th to submit products for the annual awards By TVBEurope Staff Published: January 20, 2025 Companies now have until Jan...
20/01/2025
TVBEurope January/February 2025 issue now available
Focusing on streaming and OTT, the issue features an interview with Channel 4s director of technology and distribution, Grace Boswood; finds out how BTs infrast...
20/01/2025
Tiffen Intros Dual Purpose Fusion Filters
The Tiffen Company introduces new Fusion Filters, designed to combine the industry's most sought-after, original Tiffen diffusion effects with high quality ...
20/01/2025
kicker deploys the Bitmovin Player to power a winning spo...
Bitmovin, a leading provider of video streaming solutions, announces that kicker, a leading German digital sports publisher, has chosen the Bitmovin Player for ...
20/01/2025
Forderer ASC BVK Uses Astera to Recreate 70s Look on Sept...
Behind the Scenes on September 5 Summer 1972 marked just the second time the Olympics were televised live across the globe. Little did anyone anticipate the ...
20/01/2025
CFP National Championship 2025: ESPN's MegaCast Menu Highlighted by REMI Production of Field Pass With The Pat McAfee Show
CFP National Championship 2025: ESPN's MegaCast Menu Highlighted by REMI Pro...
20/01/2025
CFP National Championship 2025: Ohio State Football's Ethan Miller on Creative Storytelling From Spring Practice Until Monday's Title Game
CFP National Championship 2025: Ohio State Football's Ethan Miller on Creati...
20/01/2025
CFP National Championship 2025: Van Wagner To Hype Up Buckeyes, Fighting Irish Fans From New IP-Based Control Room
CFP National Championship 2025: Van Wagner To Hype Up Buckeyes, Fighting Irish F...
20/01/2025
CFP National Championship 2025: ESPN To Deliver Main Game Telecast in 4K for the First Time
CFP National Championship 2025: ESPN To Deliver Main Game Telecast in 4K for the...
20/01/2025
CFP National Championship 2025: It's All Hands on Deck for ESPN's Massive Production in Atlanta
CFP National Championship 2025: It's All Hands on Deck for ESPN's Massiv...
20/01/2025
NAMM Show 2025, Part 1: The Expo Has Become Pro Audio's Main Showcase
NAMM Show 2025, Part 1: The Expo Has Become Pro Audio's Main Showcase Moving beyond its music roots, the show begins to address broadcast's complex need...
20/01/2025
Sky to air definitive portrait of F1 champion Damon Hill
Sky to air definitive portrait of F1 champion Damon HillSky Exclusive documentary Hill will be available on Sky and streaming service NOW in 2025Monday 20 Janu...
20/01/2025
Rohde & Schwarz presents new wideband modulated load pull solution based on the R&S RTP oscilloscope
Rohde & Schwarz presents new wideband modulated load pull solution based on the ...
20/01/2025
Take a Wild Ride from the Mandap into Total Madness: Dhoom Dhaam' Launches on Netflix on February 14
Back to All News Take a Wild Ride from the Mandap into Total Madness: Dhoom Dh...
20/01/2025
Arqiva to deploy 1 million smart meters for United Utilities
Arqivas network already supports over 2m smart meters for some of the UKs largest water companies January 20, 2025 Winchester, UK Arqiva, a leading provide...
20/01/2025
Bobbi Arlo hoping to represent Ireland at Eurovision 2025 First listen and interview on today's Ray D'Arcy Show on RT Radio 1
Bobbi Arlo has today been announced as the first act bidding to represent Irelan...
20/01/2025
Mickey Joe Harte is the first celebrity to exit Dancing with the Stars
Mickey Joe Harte was the first celebrity to be eliminated from series eight of Dancing with the Stars tonight. Mickey Joe and pro partner Daniela Roze had the ...
20/01/2025
Nominations Announcement RT Radio 1 Folk Awards
Nominations Announcement - RT Radio 1 Folk Awards Celebrating the very best in folk music in Ireland from the past year Awards event LIVE in Vicar Street and...
18/01/2025
Sundance Institute Announces 2025 Screenwriters Lab and Screenwriters Intensive Fellows
10 Projects to Be Developed at Annual January Screenwriters Lab; 10 Projects t...
18/01/2025
Sinclair Launches New Sales and Content Divisions
BALTIMORE In a move to redefine how it works with advertisers and audiences, Sinclair Broadcast Group has announced the rebranding and launch of new sales and c...
18/01/2025
Gray Media Stations To Air Atlanta Braves Spring Training Games
ATLANTA Gray Media has announced that it will air ten Spring Training Major League Baseball games with the Atlanta Braves on two dozen Gray Media stations, begi...
18/01/2025
Lawo To Showcase IP-Based AV Solutions at ISE 2025
Lawo has announced it will be showcasing a variety of IP-based solutions at ISE 2025 in Barcelona (Feb. 4-7) at booth 5H700....
18/01/2025
NAB Publishes Long-Awaited Future of TV Initiative Report
WASHINGTON The National Association of Broadcasters today released the long-awaited Future of Television Initiative report, written to give the Federal Communic...
18/01/2025
DigitalGlues creativespace Storage Solution Delivers for...
DigitalGlue has announced that its creative.space shared storage solution was selected by the Ryan Seacrest Foundation (RSF) to house, organize and provide remo...
18/01/2025
Starin and Absen Announce Strategic Partnership to Expand...
Starin, a leading distributor specializing in the AV and Broadcast/M&E markets, is proud to announce a new partnership with Absen Inc., the US subsidiary of Abs...
18/01/2025
CVP debuts Production Solutions at BETT UK 2025
CVP, one of Europe's leading resellers and providers of professional video and broadcast solutions, will showcase a selection of its extensive range of prod...
18/01/2025
DPA Launches New MicroLock Compact Microphone Connector
DPA Microphones unveils MicroLock , its new compact microphone connector, which builds on the strengths of the renowned MicroDot connector currently deployed wi...
18/01/2025
Friend MTS appoints Dave Gilmore to build its business in...
Friend MTS, a leading global provider of content protection services, today announced that it is expanding its business intelligence services with a new team he...
18/01/2025
MAXHUB to Launch Industry-Leading 92 Inch Microsoft Teams...
Transforming Visual Collaboration with 5K Clarity and Versatile Design MAXHUB, a global leader in integrated communication displays and unified communications ...
18/01/2025
Witbe Joins the Digital TV Group to Share Video Monitorin...
Witbe (Euronext Growth FR0013143872 ALWIT), a global leader in test automation and monitoring technology for video service providers, today announced that i...
18/01/2025
Digital Alert Systems Hires Daniel Dillon as Product and...
Digital Alert Systems, the global leader in emergency communications solutions for media providers, today announced the appointment of Daniel Dillon as a produc...
18/01/2025
South Koreas Daegu Concert House Upgrades Communications...
Riedel Communications today announced that the prestigious Daegu Concert House in South Korea has upgraded its communications systems with a shift to Riedel'...
18/01/2025
Telos Alliance champions audio innovation for growing liv...
Telos Alliance (stand #4L700), trusted global leader in broadcast audio for more than three decades, announces it will showcase a range of industry-leading sol...
18/01/2025
Grass Valley Bolsters Sales Leadership with Key Hires to...
Grass Valley, the media and entertainment industry's leading technology provider, today announced the appointment of three industry veterans to key position...
18/01/2025
Grass Valley to Showcase Cutting-Edge Solutions for Conce...
Grass Valley, the media and entertainment industry's leading technology provider, is bringing its media production expertise to ISE 2025 to showcase how bro...
18/01/2025
Black Box at ISE 2025 Advanced IP KVM Solutions for High-...
At ISE 2025, Black Box will showcase its advanced IP KVM solutions, providing systems integrators, control room designers, live event producers, and Pro AV and...
18/01/2025
Lunar New Year Must-Watch: Period Rom-Com Drama Series Perfect Match' Premieres January 25
Back to All News Lunar New Year Must-Watch: Period Rom-Com Drama Series Perfec...
17/01/2025
EVO Enhances Existing Storage Systems
Discover how EVO shared storage can add value to your existing cloud and NAS systems, including Avid Nexis, Dell Isilon, and more. Your team's storage syst...
17/01/2025
Clear-Com Offers Secure Communication Network for the World's Largest Techno Festival
eds3_5_jq(document).ready(function($) { $(#eds_sliderM519).chameleonSlider_2_1({...
17/01/2025
Studio Technologies Latest ST 2110 to Dante Bridge and Related Innovations Presented at ISE 2025
Studio Technologies Latest ST 2110 to Dante Bridge and Related Innovations Prese...
17/01/2025
Flying through clouds with no plugins and AI images
Flying through clouds with no plugins and AI images Graham Quince January 17, 2025 0 Comments I've long been wondering about the best way to creat...
17/01/2025
Spectrum The Psychological Reasoning Behind Adobe's Design System!
Spectrum The Psychological Reasoning Behind Adobe's Design System! Colin Smith January 17, 2025 0 Comments This is a look at the new interface i...
17/01/2025
Marlow Film Studios inquiry to open next week
The revised proposals were supported by Hollywood director James Cameron and included a £20 million investment in local infrastructure By Matthew Corrigan Pu...
17/01/2025
The London-based channel delivering sustainable streaming in every way
TVBEurope meets the team behind RE:TV, a cross-platform channel, to discover how they keep both their production and streaming carbon footprints down by using r...
17/01/2025
Government funding boost to turbocharge' UK creative industries
In addition to the support, the government has also launched a new UK Soft Power Council to build trust and drive economic growth By Matthew Corrigan Publish...
17/01/2025
Government not considering' general taxation to replace BBC licence fee
But there is no question in my mind that the licence fee is not only insufficient, its raising insufficient money to support the BBC, but it also is deeply regr...
17/01/2025
IAB Survey: Ad Execs Bullish on Ad Growth and AI Usage in 2025
NEW YORK Despite record-breaking ad revenues in 2024 fueled by massive political spending and the Olympics, a new Internet Advertising Bureau (IAB) survey indic...