Now Hear This: World's Most Flexible Sound Machine Debuts
25/11/2024
While some AI models can compose a song or modify a voice, none have the dexterity of the new offering.
Called Fugatto (short for Foundational Generative Audio Transformer Opus 1), it generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files.
For example, it can create a music snippet based on a text prompt, remove or add instruments from an existing song, change the accent or emotion in a voice - even let people produce sounds never heard before.
This thing is wild, said Ido Zmishlany, a multi-platinum producer and songwriter - and cofounder of One Take Audio, a member of the NVIDIA Inception program for cutting-edge startups. Sound is my inspiration. It's what moves me to create music. The idea that I can create entirely new sounds on the fly in the studio is incredible.
A Sound Grasp of Audio We wanted to create a model that understands and generates sound like humans do, said Rafael Valle, a manager of applied audio research at NVIDIA and one of the dozen-plus people behind Fugatto, as well as an orchestral conductor and composer.
Supporting numerous audio generation and transformation tasks, Fugatto is the first foundational generative AI model that showcases emergent properties - capabilities that arise from the interaction of its various trained abilities - and the ability to combine free-form instructions.
Fugatto is our first step toward a future where unsupervised multitask learning in audio synthesis and transformation emerges from data and model scale, Valle said.
A Sample Playlist of Use Cases For example, music producers could use Fugatto to quickly prototype or edit an idea for a song, trying out different styles, voices and instruments. They could also add effects and enhance the overall audio quality of an existing track.
The history of music is also a history of technology. The electric guitar gave the world rock and roll. When the sampler showed up, hip-hop was born, said Zmishlany. With AI, we're writing the next chapter of music. We have a new instrument, a new tool for making music - and that's super exciting.
An ad agency could apply Fugatto to quickly target an existing campaign for multiple regions or situations, applying different accents and emotions to voiceovers.
Language learning tools could be personalized to use any voice a speaker chooses. Imagine an online course spoken in the voice of any family member or friend.
Video game developers could use the model to modify prerecorded assets in their title to fit the changing action as users play the game. Or, they could create new assets on the fly from text instructions and optional audio inputs.
Making a Joyful Noise One of the model's capabilities we're especially proud of is what we call the avocado chair, said Valle, referring to a novel visual created by a generative AI model for imaging.
For instance, Fugatto can make a trumpet bark or a saxophone meow. Whatever users can describe, the model can create.
With fine-tuning and small amounts of singing data, researchers found it could handle tasks it was not pretrained on, like generating a high-quality singing voice from a text prompt.
Users Get Artistic Controls Several capabilities add to Fugatto's novelty.
During inference, the model uses a technique called ComposableART to combine instructions that were only seen separately during training. For example, a combination of prompts could ask for text spoken with a sad feeling in a French accent.
The model's ability to interpolate between instructions gives users fine-grained control over text instructions, in this case the heaviness of the accent or the degree of sorrow.
I wanted to let users combine attributes in a subjective or artistic way, selecting how much emphasis they put on each one, said Rohan Badlani, an AI researcher who designed these aspects of the model.
In my tests, the results were often surprising and made me feel a little bit like an artist, even though I'm a computer scientist, said Badlani, who holds a master's degree in computer science with a focus on AI from Stanford.
The model also generates sounds that change over time, a feature he calls temporal interpolation. It can, for instance, create the sounds of a rainstorm moving through an area with crescendos of thunder that slowly fade into the distance. It also gives users fine-grained control over how the soundscape evolves.
Plus, unlike most models, which can only recreate the training data they've been exposed to, Fugatto allows users to create soundscapes it's never seen before, such as a thunderstorm easing into a dawn with the sound of birds singing.
A Look Under the Hood Fugatto is a foundational generative transformer model that builds on the team's prior work in areas such as speech modeling, audio vocoding and audio understanding.
The full version uses 2.5 billion parameters and was trained on a bank of NVIDIA DGX systems packing 32 NVIDIA H100 Tensor Core GPUs.
Fugatto was made by a diverse group of people from around the world, including India, Brazil, China, Jordan and South Korea. Their collaboration made Fugatto's multi-accent and multilingual capabilities stronger.
One of the hardest parts of the effort was generating a blended dataset that contains millions of audio samples used for training. The team employed a multifaceted strategy to generate data and instructions that considerably expanded the range of tasks the model could perform, while achieving more accurate performance and enabling new tasks without requiring additional data.
They also scrutinized existing datasets to reveal new relationships among the dat
North America Stories
25/11/2024
US Navy Awards L3Harris Nearly $1 Billion IDIQ Contract
Multifunction Information Distribution System Joint Tactical Radio System Terminals provide assured communications, situational awareness, command and control a...
25/11/2024
L3Harris Completes Critical Design Review for Space Development Agency Satellite Radios
Rendering of L3Harris Tranche 2 Tracking (T2TRK) imagery for the Space Developme...
25/11/2024
FOX, Disney Capitalize On Multiplatform Viewing in Nielsen's October Media Distributor Gauge
FOX hits company-high water mark in the Media Distributor Gauge with 8.4% of TV;...
25/11/2024
Advertising Research Foundation Proposes Updating TV Categories
NEW YORK The Advertising Research Foundation (ARF) has proposed a new framework that would reclassify how U.S. households connect to TV. It would replace the pa...
25/11/2024
Fubo To Launch 18 NBCU FAST Channels
NEW YORK FuboTV and NBCUniversal have announced a deal that will see the launch of 18 NBCU FAST channels on the virtual multichannel video programming distribut...
25/11/2024
SBE Adds New Certification: Certified Production Technologist
The Society of Broadcast Engineers said it has added a new credential to its program of certification, Certified Production Technologist....
25/11/2024
New Chyron Virtual Placement 7.7 Offers Football-Specific Features
MELVILLE, N.Y. Chyron has updated its Virtual Placement product with the release of version 7.7 offering new tools and enhancements for football telecasts....
25/11/2024
Alien: Romulus Graded with DaVinci Resolve Studio
Alien: Romulus Graded with DaVinci Resolve Studio Brie Clayton November 25, 2024 0 Comments Colorist Mitch Paulson uses modern tools to recreate and b...
25/11/2024
Satellite Television & Radio Australia purchases Appear TV XC Platform from Magna Systems for VAST
Satellite Television & Radio Australia purchases Appear TV XC Platform from Magn...
25/11/2024
SVG Summit 2024: FanDuel Sports Network, MLB Local Media, MSG Networks, and NBC Sports Bay Area Talk Innovation in Regional Sports
SVG Summit 2024: FanDuel Sports Network, MLB Local Media, MSG Networks, and NBC ...
25/11/2024
Making a Splash: SailGP's Chief Content Officer Melissa Lawton Discusses Content and Tech Innovation
Making a Splash: SailGP's Chief Content Officer Melissa Lawton Discusses Con...
25/11/2024
Remembering Neil Flagg: The NBC Sports Mainstay Leaves an Industry Legacy in Sons Ross and Kevin
Remembering Neil Flagg: The NBC Sports Mainstay Leaves an Industry Legacy in Son...
25/11/2024
Warner Bros. Discovery's Dylan Boucherle Shares the Latest on Virtual Production and TNT Sports' Investment in an LED Volume
Warner Bros. Discovery's Dylan Boucherle Shares the Latest on Virtual Produc...
25/11/2024
Netflix Unveils Behind the Scenes Documentary of 'The Helicopter Heist'
Back to All News Netflix Unveils Behind the Scenes Documentary of The Helicopter Heist Entertainment 25 November 2024 GlobalSweden Link copied to clipboard...
25/11/2024
Netflix unveils the trailer and key art for One Hundred Years of Solitude
Back to All News Netflix unveils the trailer and key art for One Hundred Years of SolitudePlay Video Play Video Entertainment 25 November 2024 GlobalColomb...
25/11/2024
Why Workforce Development Is Key to Reaping AI Benefits
AI is changing industries and economies worldwide. Workforce development is central to ensuring the changes benefit all of us, as Louis Stewart, head of strate...
25/11/2024
Now Hear This: World's Most Flexible Sound Machine Debuts
A team of generative AI researchers created a Swiss Army knife for sound, one that allows users to control the audio output simply using text. While some AI mo...
23/11/2024
Heartwarming Out of My Mind Highlights the Importance of Disability Advocacy
PARK CITY, UTAH - JANUARY 19: (L-R) Judith Light, Rosemarie Dewitt, Luke Kirby, Michael Chernus, Phoebe-Rae Taylor, Sharon M. Draper, Amber Sealey, and Courtney...
23/11/2024
TCLtv+ Adds 23 CBS Fast Channels
LOS ANGELES/NEW YORK TCL's streaming service TCLtv+ has struck a content deal with Paramount Streaming that will add 23 CBS FAST channels to its lineup....
23/11/2024
Viamedia Signs Ad Rep Deals with 7 More Service Providers
LEXINGTON, Ky. The independent advertising rep firm Viamedia has further expanded its sales network with news that it has agreements to manage advertising sale...
23/11/2024
The Trade Desk Jumps Into Streaming TV With Ventura OS
VENTURA, Calif. Programmatic ad giant The Trade Desk is pushing into the streaming technology business with a new operating system called Ventura....
23/11/2024
Writers Guild, 3 PBS Stations Reach Tentative Agreement for New Contract
BOSTON, LOS ANGELES AND NEW YORK The Writers Guild of America has announced that it has reached a tentative agreement with management at PBS member stations WGB...
23/11/2024
Supreme Court to Consider Legality of FCC's Universal Service Fund
WASHINGTON The U.S. Supreme Court has agreed to hear an appeal in a case that alleges the Federal Communications Commission does not have the authority to decid...
22/11/2024
Michelle Satter to Be Honored at 2025 Sundance Film Festival Gala Celebrating Sundance Institute Presented by Google TV
Sean Wang, Julian Brave NoiseCat, and Emily Kassie to Receive Annual Vanguard Aw...
22/11/2024
Fox, Hulu Renew Content Deal
Fox Entertainment and Hulu have renewed a multi-year content distribution agreement that will keep in-season streaming rights for Fox's programming slate on...
22/11/2024
Academy Award-Winning Film Studio Caviar Signs Director Duo MAMA
Academy Award-Winning Film Studio Caviar Signs Director Duo MAMA Brie Clayton November 22, 2024 0 Comments Academy Award-winning independent film stud...
22/11/2024
A Creative Alliance for Black Friday: Independent Software Makers Unite for Photographers
A Creative Alliance for Black Friday: Independent Software Makers Unite for Phot...
22/11/2024
Partial Bold Text using After Effects expressions UPDATED
Partial Bold Text using After Effects expressions UPDATED Graham Quince November 22, 2024 0 Comments Now with an improved expression for Per Word Se...
22/11/2024
John Lawson Steps Down as AWARN Executive Director
WASHINGTON John Lawson, longtime broadcast alerting advocate and founder of the AWARN Alliance, said he is stepping down as its executive director to work full-...
22/11/2024
Warner Bros. Discovery Introduces Shop With Max and Moments
NEW YORK Warner Bros. Discovery Advertising Sales has incorporated Kerv's AI-enhanced technology into its ad-tech platform and launched two new ad offerings...
22/11/2024
EDO, Vizio Ink New Multiyear Smart-TV Data Licensing Pact
NEW YORK Vizio's Inscape, a smart-TV data provider, and EDO said they have extended their longstanding data partnership....
22/11/2024
New Pixotope Reveal Enables AR, Virtual Production Without Green Screens
OSLO, Norway Live augmented reality and virtual production specialist Pixotope Technologies has launched Pixotope Reveal, an AI-powered background segmentation ...
22/11/2024
Nominations Open for 2025 NAB Technology Awards
The National Association of Broadcasters has opened nominations for the 2025 NAB Technology Awards, recognizing excellence in broadcast engineering, digital lea...
22/11/2024
Thanksgiving TV Sports Ad-Spend Binge To Hit $624 Million
In between helpings of turkey and other Thanksgiving Day fare, viewers will see companies dishing up hefty portions of ads on sports programming, with the natio...
22/11/2024
8 Channels Added to MyFree DirecTV Streaming Lineup
Following the recent launch of the MyFree DirecTV free-ad supported package of 70-plus streaming channels, DirecTV has launched eight new channels catering to s...
22/11/2024
Viant, Disney Advertising Expand CTV Ad Collaboration
IRVINE, Calif. Viant Technology said it has expanded its agreement with Disney Advertising that's focused on making premium connected TV, video and display ...
22/11/2024
Xumo To Make Ad Inventory Available Programmatically With PubMatic
PHILADELPHIA and REDWOOD CITY, Calif. Xumo, the streaming platform joint venture of Comcast and Charter Communications, has reached an agreement to make its pre...
22/11/2024
Mediaocean To Acquire Innovid, Will Merge It With Flashtalking
NEW YORK Privately-held ad tech giant Mediaocean has inked a definitive agreement to acquire Innovid, an independent software platform for advertising creation,...
22/11/2024
Viz University introduces new Viz Artist certifications aimed at upskilling designers of all levels
Viz University introduces new Viz Artist certifications aimed at upskilling desi...
22/11/2024
NBC Sports President Rick Cordella on How Comcast's NBCU Cable-Net Spinoff Will Impact Sports Ops
NBC Sports President Rick Cordella on How Comcast's NBCU Cable-Net Spinoff W...
22/11/2024
NWSL Championship 2024: CBS Sports Caps Off First Year of In-House Broadcasts With Saturday's Final in Kansas City
NWSL Championship 2024: CBS Sports Caps Off First Year of In-House Broadcasts Wi...
22/11/2024
Premier League To Establish In-House Media-Operations Business for 2026-27 Season
Premier League To Establish In-House Media-Operations Business for 2026-27 Seaso...
22/11/2024
SailGP Season 5 Set to Be Most Expansive Yet'
SailGP Season 5 set to be most expansive yet' By George Bevir Friday, November 22, 2024 - 10:34 Print This Story SailGP: The New Zealand Sail Grand P...
22/11/2024
SailGP Season 5: New Broadcasters, New Requirements
SailGP Season 5: New broadcasters, new requirements By George Bevir Friday, November 22, 2024 - 10:34 Print This Story The Germany SailGP team in action a...
22/11/2024
SailGP Season 5: AI Cameras to Get Viewers Closer to the Action
SailGP Season 5: AI cameras to get viewers closer to the action By George Bevir Friday, November 22, 2024 - 10:33 Print This Story Getting viewers closer ...
22/11/2024
SailGP Season 5: Getting Umpires Onscreen and More Studio-Based Content
SailGP Season 5: Getting umpires onscreen and more studio-based content By George Bevir Friday, November 22, 2024 - 10:33 Print This Story Australia SailG...
22/11/2024
SailGP Season 5: Enhanced LiveLine Graphics Bring Augmented Reality to Chase Boats
SailGP Season 5: Enhanced LiveLine graphics bring augmented reality to chase boa...
22/11/2024
Full House: Inside Production of the European Curling Championships
Full house: Inside production of the European Curling Championships By Kevin Hilton Thursday, November 21, 2024 - 12:40 Print This Story Curling star Anna...
22/11/2024
NBC Sports President Rick Cordella on How Comcast's NBCU Cable Net Spinoff Will Impact Sports Ops
NBC Sports President Rick Cordella on How Comcast's NBCU Cable Net Spinoff W...
22/11/2024
Rugged Rugby: Conquer or Die' Premieres December 10: A Battle for Supremacy Begins
Back to All News Rugged Rugby: Conquer or Die' Premieres December 10: A Ba...