Take the Wheel: NVIDIA NeMo SteerLM Lets Companies Customize a Model's Responses During Inference
11/10/2023
NVIDIA NeMo SteerLM lets companies define knobs to dial in a model's responses as it's running in production, a process called inference. Unlike current methods for customizing an LLM, it lets a single training run create one model that can serve dozens or even hundreds of use cases, saving time and money.
NVIDIA researchers created SteerLM to teach AI models what users care about, like road signs to follow in their particular use cases or markets. These user-defined attributes can gauge nearly anything - for example, the degree of helpfulness or humor in the model's responses.
One Model, Many Uses The result is a new level of flexibility.
With SteerLM, users define all the attributes they want and embed them in a single model. Then they can choose the combination they need for a given use case while the model is running.
For example, a custom model can now be tuned during inference to the unique needs of, say, an accounting, sales or engineering department or a vertical market.
The method also enables a continuous improvement cycle. Responses from a custom model can serve as data for a future training run that dials the model into new levels of usefulness.
Saving Time and Money To date, fitting a generative AI model to the needs of a specific application has been the equivalent of rebuilding an engine's transmission. Developers had to painstakingly label datasets, write lots of new code, adjust the hyperparameters under the hood of the neural network and retrain the model several times.
SteerLM replaces those complex, time-consuming processes with three simple steps:
Using a basic set of prompts, responses and desired attributes, customize an AI model that predicts how those attributes will perform.
Automatically generating a dataset using this model.
Training the model with the dataset using standard supervised fine-tuning techniques.
Many Enterprise Use Cases Developers can adapt SteerLM to nearly any enterprise use case that requires generating text.
With SteerLM, a company might produce a single chatbot it can tailor in real time to customers' changing attitudes, demographics or circumstances in the many vertical markets or geographies it serves.
SteerLM also enables a single LLM to act as a flexible writing co-pilot for an entire corporation.
For example, lawyers can modify their model during inference to adopt a formal style for their legal communications. Or marketing staff can dial in a more conversational style for their audience.
Game On With SteerLM To show the potential of SteerLM, NVIDIA demonstrated it on one of its classic applications - gaming (see the video below).
Today, some games pack dozens of non-playable characters - characters that the player can't control - which mechanically repeat prerecorded text, regardless of the user or situation.
SteerLM makes these characters come alive, responding with more personality and emotion to players' prompts. It's a tool game developers can use to unlock unique new experiences for every player.
The Genesis of SteerLM The concept behind the new method arrived unexpectedly.
I woke up early one morning with this idea, so I jumped up and wrote it down, recalled Yi Dong, an applied research scientist at NVIDIA who initiated the work on SteerLM.
While building a prototype, he realized a popular model-conditioning technique could also be part of the method. Once all the pieces came together and his experiment worked, the team helped articulate the method in four simple steps.
It's the latest advance in model customization, a hot area in AI research.
It's a challenging field, a kind of holy grail for making AI more closely reflect a human perspective - and I love a new challenge, said the researcher, who earned a Ph.D. in computational neuroscience at Johns Hopkins University, then worked on machine learning algorithms in finance before joining NVIDIA.
Get Hands on the Wheel SteerLM is available as open-source software for developers to try out today. They can also get details on how to experiment with a Llama-2-13b model customized using the SteerLM method.
For users who want full enterprise security and support, SteerLM will be integrated into NVIDIA NeMo, a rich framework for building, customizing and deploying large generative AI models.
The SteerLM method works on all models supported on NeMo, including popular community-built pretrained LLMs such as Llama-2 and BLOOM.
Read a technical blog to learn more about SteerLM.
See notice regarding software product information.
LINK: | https://blogs.nvidia.com/blog/2023/10/11/customize-ai-models-steerlm/... |
See more stories from nvidia |
Most recent headlines
30/01/2025
Pliant Technologies Names Leading Technologies as New Ita...
Pliant Technologies, a leading provider of professional wireless intercom solutions, announces an important, new distribution agreement with Leading Technologie...
30/01/2025
Ateliere Expands Live Production Workflows with Rapid Int...
Ateliere Creative Technologies, a leading GenAI media software solutions company, has made available a new Application Programming Interface (API) for Ateliere ...
30/01/2025
NAKIVO Reveals 25 percent Revenue Growth and 10 percent C...
NAKIVO Inc., a leading provider of data protection solutions for physical, virtual, cloud, and SaaS environments, announced today strong Q4 2024 results, highli...
30/01/2025
TAG and Tencent Cloud Partner to Deliver Enhanced Cloud S...
TAG has announced a partnership with Tencent Cloud, the cloud business of the leading global technology company Tencent. Through this partnership, TAG will all...
30/01/2025
Chaos Releases V-Ray 7 for Maya and Houdini
Today, Chaos launches V-Ray 7 for Maya and V-Ray 7 for Houdini with new features that accelerate everything from fast 3D environment creation to production shad...
30/01/2025
EMG Gravity Media Australia and Supercars Media Set to De...
Supercars Media and EMG / Gravity Media, a world leading global provider of complex live creative production and media services, today outlined the broadcast an...
30/01/2025
Brainstorm contributes to FLECON-6G - The future of Media...
Brainstorm, leading manufacturer of real-time 3D graphics and virtual studio solutions, is contributing to the media and broadcasting world's undergoing rev...
29/01/2025
2025 Sundance Film Festival Short Film Program Award Winners Announced
PARK CITY, UTAH, January 28, 2025 - Tonight the nonprofit Sundance Institute awarded the prizes for the 2025 Sundance Film Festival Short Film Program at the Sh...
29/01/2025
SBS Learn shares the Year of the Snake with classrooms this Lunar New Year
SBS Learn shares the Year of the Snake with classrooms this Lunar New Year 29 January, 2025 Media releases SBS Learn is celebrating Lunar New Year, providi...
29/01/2025
Disney Remains No. 1, Pure-Play Streamers Benefit from a Dynamic Month of TV in Nielsen's December Media Distributor Gauge
Pure-play streaming companies account for over a quarter of total TV in December...
29/01/2025
DirecTV To Carry Texas Rangers Sports Network
ARLINGTON, Texas, and EL SEGUNDO, Calif. DirecTV today announced a multiyear distribution agreement with the MLB 2023 World Series champion Texas Rangers to bro...
29/01/2025
Brightcove Launches AI Content Suite
BOSTON Following a successful customer pilot program in 2024, Brightcove has officially launched its AI Content Suite. The suite features a range of artificial ...
29/01/2025
CES on Content Creation, Rebuilding Tomorrow
CES on Content Creation, Rebuilding Tomorrow Andy Marken January 29, 2025 0 Comments I can't lie about your chances. But you have my sympathies. ...
29/01/2025
Singaporean Holiday Movie Hi Noel Shot on URSA Mini Pro 12K
Singaporean Holiday Movie Hi Noel Shot on URSA Mini Pro 12K Brie Clayton January 29, 2025 0 Comments Small budget film delivers Hollywood quality Chri...
29/01/2025
Chaos Releases V-Ray 7 for Maya and Houdini
Chaos Releases V-Ray 7 for Maya and Houdini Brie Clayton January 29, 2025 0 Comments Gaussian Splat Support Accelerates World Building for 3D Artists;...
29/01/2025
Two Summers, Two Continents: How One Teen Found His Sound at Berklee
Two Summers, Two Continents: How One Teen Found His Sound at Berklee In back-to-back programs in Boston and Valencia, Spain, guitarist Miles Sam sharpened his...
29/01/2025
Haivision Ships New UI for Haivision Pro Transmitter
MONTREAL Haivision has begun shipping the latest version of software for its low-latency live video Haivision Pro transmitter and will show it at NAB Show, Apri...
29/01/2025
ARRI launches ALEXA 35 Base camera with flexible upgrade options
A suite of five licences can be purchased, upgrading the camera to meet specific requirements as needed By Matthew Corrigan Published: January 29, 2025 A ...
29/01/2025
Government considers extending UK TV Licence to streaming-only households
A range of options is reportedly under consideration ahead of the current Royal Charter periods ending in 2027 By Matthew Corrigan Published: January 29, 202...
29/01/2025
Orange Launches Nuanua Satellite Project in Wallis and Futuna
This ambitious initiative, meaning rainbow in Wallisian, aims to replace the existing satellite infrastructure of the Wallis and Futuna Islands Luxembourg, Par...
29/01/2025
Signiant Joins the MovieLabs Industry Forum
LEXINGTON, Mass. Signiant has announced that it has joined the MovieLabs Industry Forum, a collaborative initiative established by MovieLabs to tackle industry ...
29/01/2025
FCC Halts Efforts To Regulate Bulk Billing' Broadband Deals
WASHINGTON Federal Communications Commission Chair Brendan Carr has ended the agency's consideration of a proposal from last year that sought to regulate so...
29/01/2025
Survey: Global Investors See Big Opportunities in Sports Tech
ZURICH, Switzerland While buying and selling sports franchises often gets the most media attention, a new survey from Altman Solon finds that global sports exec...
29/01/2025
Dhar Mann, a Forbes Top Creator in 2024, Will Speak at NAB Show
WASHINGTON Dhar Mann, recognized by Forbes as a Top Creator in 2024, will be a featured speaker on the Main Stage at NAB Show, set for April 5-9 at the Las Vega...
29/01/2025
Study: YouTube Tops Streamers in Viewing Share; Netflix Has Biggest Reach
A new study sheds new light on the current state of the streaming wars with data showing that YouTube has the highest share of viewing time (21%), followed by N...
29/01/2025
ARRI To Feature New Alexa 35 Base at NAB Show
MUNICH ARRI today introduced a new entry model of the Alexa 35 and flexible licenses and will showcase the camera system at the 2025 NAB Show, April 5-9, at the...
29/01/2025
Super Bowl Sunday Is Becoming a Multiplatform Experience
DENVER Digital marketing agency Adtaxi has unveiled a new survey showing the Super Bowl has become a multiplatform experience, with almost 70% of Americans util...
29/01/2025
Kalamazoo College Deploys Extensive Blackmagic Production Resources
KALAMAZOO, Mich. Michigan's Kalamazoo College has deployed a variety of Blackmagic Design gear, including cameras, switchers and recorders for campuswide pr...
29/01/2025
IODYNE AND DIGIBOX ANNOUNCE PRO MINI DISTRIBUTION PARTNER...
iodyne, developer of the world's fastest Thunderbolt storage today announced a UK distribution partnership with DigiBox for the first ever Smart Drive, the ...
29/01/2025
RT Choice Music Prize Live Event Line-up Announcement
RT Choice Music Prize In association with IMRO and IRMA Celebrating 20 Years of the RT Choice Music Prize Live Event Line-up Announcement Vicar Street, Du...
29/01/2025
Leveling Up User Experiences With Agentic AI, From Bots to Autonomous Agents
AI agents with advanced perception and cognition capabilities are making digital experiences more dynamic and personalized across retail, finance, entertainment...
29/01/2025
ESPN Celebrates Black History Always on February 5 with Inaugural All-Black Staffed NBA Broadcast
ESPN Celebrates Black History Always on February 5 with Inaugural All-Black Staf...
29/01/2025
Producing a Live Gymnastics Meet: Behind the Broadcast with Auburn's War Eagle Productions
Producing a Live Gymnastics Meet: Behind the Broadcast with Auburn's War Eag...
29/01/2025
Live From Super Bowl LIX: Sony To Support Game, Studio, Halftime Show Production With 100+ Cameras
Live From Super Bowl LIX: Sony To Support Game, Studio, Halftime Show Production...
29/01/2025
SVG College Summit 2025: University of Wyoming's Dennis Trapani To Serve as Event Chair
SVG College Summit 2025: University of Wyoming's Dennis Trapani To Serve as ...
29/01/2025
SVG College Sports Media Awards 2025: Entry Window Opens on February 11; New Outstanding Cinematic Recap' Category Added
SVG College Sports Media Awards 2025: Entry Window Opens on February 11; New Ou...
29/01/2025
Official trailer revealed for third thrilling instalment of Sky Original drama Gangs of London, coming March
Official trailer revealed for third thrilling instalment of Sky Original drama G...
29/01/2025
Ashley Walters begins production on his debut feature film Animol
Ashley Walters begins production on his debut feature film AnimolWednesday 29 January 2025 Image Credit: Anthony Dickenson Image available to download HERE C...
29/01/2025
Introducing the Season Download Button: Get Caught Up on the Go With Just One Tap
Back to All News Introducing the Season Download Button: Get Caught Up on the G...
29/01/2025
Get Ready to Travel Back to Hawkins: Jazwares Goes Deeper Into the Upside Down With Netflix's 'Stranger Things' to Launch All-New Toy Line
Back to All News Get Ready to Travel Back to Hawkins: Jazwares Goes Deeper Into the Upside Down With Netflix's Stranger Things to Launch All-New Toy Line ...
29/01/2025
Carmen Sandiego' and Netflix Stories: Sex Education' Mobile Games Now Available on Netflix
Back to All News Carmen Sandiego' and Netflix Stories: Sex Education'...
29/01/2025
Haivision Unveils a Streamlined User Interface for the Haivision Pro Video Transmitter, Simplifying Ease-of-Use and Interoperability
Haivision Unveils a Streamlined User Interface for the Haivision Pro Video Trans...
29/01/2025
FilmLight introduces REMOTE
FilmLight REMOTE delivers high-quality, low-latency remote grading FilmLight has announced the introduction of FilmLight REMOTE, a singular solution for remote...
29/01/2025
January 29, 2025
Researchers illuminate new structures of a critical amyloid protein Insights could advance new drugs to treat the progressive, fatal disease known as transthyre...
29/01/2025
Ross Video Expands Corporate Market Presence in EMEA with the Appointment of Nancy Diaz Curiel as Regional Sales Director
Ottawa, ON - Wednesday, January 29, 2025 - Ross Video is strengthening its prese...
29/01/2025
Eyedea's AI-powered visual recognition software protected and monetized by Thales Sentinel Platform
Facebook Twitter LinkedIn Thales Sentinel protects Eyedea technologies bas...
29/01/2025
Thales partners with the State of Georgia Department of Driver Services to enhance citizen experience
Facebook Twitter LinkedIn Thales Enrollment Kiosks for credential issuance...
29/01/2025
Introducing the Versatile Phantom T2110
Vision Research announces the new Phantom T2110 camera with a custom 1-megapixel (Mpx), back side illuminated (BSI) sensor. Ideal for many scientific applicatio...
28/01/2025
Bruno Mars Makes Spotify History as the First Artist To Hit 150 Million Monthly Listeners
Bruno Mars has officially etched his name in the Spotify history books. On Janua...
28/01/2025
On Our $10 Billion Milestone and a Decade of Getting the World to Value Music
In 2014, the music industry reached a low point when global recorded music revenues hit $13 billion. Spotify's annual contribution at the time was around $1...