Sony Pixel Power calrec Sony

Now Hear This: World's Most Flexible Sound Machine Debuts

25/11/2024

A team of generative AI researchers created a Swiss Army knife for sound, one that allows users to control the audio output simply using text.

While some AI models can compose a song or modify a voice, none have the dexterity of the new offering.

Called Fugatto (short for Foundational Generative Audio Transformer Opus 1), it generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files.

For example, it can create a music snippet based on a text prompt, remove or add instruments from an existing song, change the accent or emotion in a voice - even let people produce sounds never heard before.

This thing is wild, said Ido Zmishlany, a multi-platinum producer and songwriter - and cofounder of One Take Audio, a member of the NVIDIA Inception program for cutting-edge startups. Sound is my inspiration. It's what moves me to create music. The idea that I can create entirely new sounds on the fly in the studio is incredible.

A Sound Grasp of Audio We wanted to create a model that understands and generates sound like humans do, said Rafael Valle, a manager of applied audio research at NVIDIA and one of the dozen-plus people behind Fugatto, as well as an orchestral conductor and composer.

Supporting numerous audio generation and transformation tasks, Fugatto is the first foundational generative AI model that showcases emergent properties - capabilities that arise from the interaction of its various trained abilities - and the ability to combine free-form instructions.

Fugatto is our first step toward a future where unsupervised multitask learning in audio synthesis and transformation emerges from data and model scale, Valle said.

A Sample Playlist of Use Cases For example, music producers could use Fugatto to quickly prototype or edit an idea for a song, trying out different styles, voices and instruments. They could also add effects and enhance the overall audio quality of an existing track.

The history of music is also a history of technology. The electric guitar gave the world rock and roll. When the sampler showed up, hip-hop was born, said Zmishlany. With AI, we're writing the next chapter of music. We have a new instrument, a new tool for making music - and that's super exciting.

An ad agency could apply Fugatto to quickly target an existing campaign for multiple regions or situations, applying different accents and emotions to voiceovers.

Language learning tools could be personalized to use any voice a speaker chooses. Imagine an online course spoken in the voice of any family member or friend.

Video game developers could use the model to modify prerecorded assets in their title to fit the changing action as users play the game. Or, they could create new assets on the fly from text instructions and optional audio inputs.

Making a Joyful Noise One of the model's capabilities we're especially proud of is what we call the avocado chair, said Valle, referring to a novel visual created by a generative AI model for imaging.

For instance, Fugatto can make a trumpet bark or a saxophone meow. Whatever users can describe, the model can create.

With fine-tuning and small amounts of singing data, researchers found it could handle tasks it was not pretrained on, like generating a high-quality singing voice from a text prompt.

Users Get Artistic Controls Several capabilities add to Fugatto's novelty.

During inference, the model uses a technique called ComposableART to combine instructions that were only seen separately during training. For example, a combination of prompts could ask for text spoken with a sad feeling in a French accent.

The model's ability to interpolate between instructions gives users fine-grained control over text instructions, in this case the heaviness of the accent or the degree of sorrow.

I wanted to let users combine attributes in a subjective or artistic way, selecting how much emphasis they put on each one, said Rohan Badlani, an AI researcher who designed these aspects of the model.

In my tests, the results were often surprising and made me feel a little bit like an artist, even though I'm a computer scientist, said Badlani, who holds a master's degree in computer science with a focus on AI from Stanford.

The model also generates sounds that change over time, a feature he calls temporal interpolation. It can, for instance, create the sounds of a rainstorm moving through an area with crescendos of thunder that slowly fade into the distance. It also gives users fine-grained control over how the soundscape evolves.

Plus, unlike most models, which can only recreate the training data they've been exposed to, Fugatto allows users to create soundscapes it's never seen before, such as a thunderstorm easing into a dawn with the sound of birds singing.

A Look Under the Hood Fugatto is a foundational generative transformer model that builds on the team's prior work in areas such as speech modeling, audio vocoding and audio understanding.

The full version uses 2.5 billion parameters and was trained on a bank of NVIDIA DGX systems packing 32 NVIDIA H100 Tensor Core GPUs.

Fugatto was made by a diverse group of people from around the world, including India, Brazil, China, Jordan and South Korea. Their collaboration made Fugatto's multi-accent and multilingual capabilities stronger.

One of the hardest parts of the effort was generating a blended dataset that contains millions of audio samples used for training. The team employed a multifaceted strategy to generate data and instructions that considerably expanded the range of tasks the model could perform, while achieving more accurate performance and enabling new tasks without requiring additional data.

They also scrutinized existing datasets to reveal new relationships among the dat
LINK: https://blogs.nvidia.com/blog/fugatto-gen-ai-sound-model/...
See more stories from nvidia

North America Stories

25/11/2024

US Navy Awards L3Harris Nearly $1 Billion IDIQ Contract

Multifunction Information Distribution System Joint Tactical Radio System Terminals provide assured communications, situational awareness, command and control a...

25/11/2024

L3Harris Completes Critical Design Review for Space Development Agency Satellite Radios

Rendering of L3Harris Tranche 2 Tracking (T2TRK) imagery for the Space Developme...

25/11/2024

FOX, Disney Capitalize On Multiplatform Viewing in Nielsen's October Media Distributor Gauge

FOX hits company-high water mark in the Media Distributor Gauge with 8.4% of TV;...

25/11/2024

Advertising Research Foundation Proposes Updating TV Categories

NEW YORK The Advertising Research Foundation (ARF) has proposed a new framework that would reclassify how U.S. households connect to TV. It would replace the pa...

25/11/2024

Fubo To Launch 18 NBCU FAST Channels

NEW YORK FuboTV and NBCUniversal have announced a deal that will see the launch of 18 NBCU FAST channels on the virtual multichannel video programming distribut...

25/11/2024

SBE Adds New Certification: Certified Production Technologist

The Society of Broadcast Engineers said it has added a new credential to its program of certification, Certified Production Technologist....

25/11/2024

New Chyron Virtual Placement 7.7 Offers Football-Specific Features

MELVILLE, N.Y. Chyron has updated its Virtual Placement product with the release of version 7.7 offering new tools and enhancements for football telecasts....

25/11/2024

Alien: Romulus Graded with DaVinci Resolve Studio

Alien: Romulus Graded with DaVinci Resolve Studio Brie Clayton November 25, 2024 0 Comments Colorist Mitch Paulson uses modern tools to recreate and b...

25/11/2024

Satellite Television & Radio Australia purchases Appear TV XC Platform from Magna Systems for VAST

Satellite Television & Radio Australia purchases Appear TV XC Platform from Magn...

25/11/2024

SVG Summit 2024: FanDuel Sports Network, MLB Local Media, MSG Networks, and NBC Sports Bay Area Talk Innovation in Regional Sports

SVG Summit 2024: FanDuel Sports Network, MLB Local Media, MSG Networks, and NBC ...

25/11/2024

Making a Splash: SailGP's Chief Content Officer Melissa Lawton Discusses Content and Tech Innovation

Making a Splash: SailGP's Chief Content Officer Melissa Lawton Discusses Con...

25/11/2024

Remembering Neil Flagg: The NBC Sports Mainstay Leaves an Industry Legacy in Sons Ross and Kevin

Remembering Neil Flagg: The NBC Sports Mainstay Leaves an Industry Legacy in Son...

25/11/2024

Warner Bros. Discovery's Dylan Boucherle Shares the Latest on Virtual Production and TNT Sports' Investment in an LED Volume

Warner Bros. Discovery's Dylan Boucherle Shares the Latest on Virtual Produc...

25/11/2024

Netflix Unveils Behind the Scenes Documentary of 'The Helicopter Heist'

Back to All News Netflix Unveils Behind the Scenes Documentary of The Helicopter Heist Entertainment 25 November 2024 GlobalSweden Link copied to clipboard...

25/11/2024

Netflix unveils the trailer and key art for One Hundred Years of Solitude

Back to All News Netflix unveils the trailer and key art for One Hundred Years of SolitudePlay Video Play Video Entertainment 25 November 2024 GlobalColomb...

25/11/2024

Why Workforce Development Is Key to Reaping AI Benefits

AI is changing industries and economies worldwide. Workforce development is central to ensuring the changes benefit all of us, as Louis Stewart, head of strate...

25/11/2024

Now Hear This: World's Most Flexible Sound Machine Debuts

A team of generative AI researchers created a Swiss Army knife for sound, one that allows users to control the audio output simply using text. While some AI mo...

23/11/2024

Heartwarming Out of My Mind Highlights the Importance of Disability Advocacy

PARK CITY, UTAH - JANUARY 19: (L-R) Judith Light, Rosemarie Dewitt, Luke Kirby, Michael Chernus, Phoebe-Rae Taylor, Sharon M. Draper, Amber Sealey, and Courtney...

23/11/2024

TCLtv+ Adds 23 CBS Fast Channels

LOS ANGELES/NEW YORK TCL's streaming service TCLtv+ has struck a content deal with Paramount Streaming that will add 23 CBS FAST channels to its lineup....

23/11/2024

Viamedia Signs Ad Rep Deals with 7 More Service Providers

LEXINGTON, Ky. The independent advertising rep firm Viamedia has further expanded its sales network with news that it has agreements to manage advertising sale...

23/11/2024

The Trade Desk Jumps Into Streaming TV With Ventura OS

VENTURA, Calif. Programmatic ad giant The Trade Desk is pushing into the streaming technology business with a new operating system called Ventura....

23/11/2024

Writers Guild, 3 PBS Stations Reach Tentative Agreement for New Contract

BOSTON, LOS ANGELES AND NEW YORK The Writers Guild of America has announced that it has reached a tentative agreement with management at PBS member stations WGB...

23/11/2024

Supreme Court to Consider Legality of FCC's Universal Service Fund

WASHINGTON The U.S. Supreme Court has agreed to hear an appeal in a case that alleges the Federal Communications Commission does not have the authority to decid...

22/11/2024

Michelle Satter to Be Honored at 2025 Sundance Film Festival Gala Celebrating Sundance Institute Presented by Google TV

Sean Wang, Julian Brave NoiseCat, and Emily Kassie to Receive Annual Vanguard Aw...

22/11/2024

Fox, Hulu Renew Content Deal

Fox Entertainment and Hulu have renewed a multi-year content distribution agreement that will keep in-season streaming rights for Fox's programming slate on...

22/11/2024

Academy Award-Winning Film Studio Caviar Signs Director Duo MAMA

Academy Award-Winning Film Studio Caviar Signs Director Duo MAMA Brie Clayton November 22, 2024 0 Comments Academy Award-winning independent film stud...

22/11/2024

A Creative Alliance for Black Friday: Independent Software Makers Unite for Photographers

A Creative Alliance for Black Friday: Independent Software Makers Unite for Phot...

22/11/2024

Partial Bold Text using After Effects expressions UPDATED

Partial Bold Text using After Effects expressions UPDATED Graham Quince November 22, 2024 0 Comments Now with an improved expression for Per Word Se...

22/11/2024

John Lawson Steps Down as AWARN Executive Director

WASHINGTON John Lawson, longtime broadcast alerting advocate and founder of the AWARN Alliance, said he is stepping down as its executive director to work full-...

22/11/2024

Warner Bros. Discovery Introduces Shop With Max and Moments

NEW YORK Warner Bros. Discovery Advertising Sales has incorporated Kerv's AI-enhanced technology into its ad-tech platform and launched two new ad offerings...

22/11/2024

EDO, Vizio Ink New Multiyear Smart-TV Data Licensing Pact

NEW YORK Vizio's Inscape, a smart-TV data provider, and EDO said they have extended their longstanding data partnership....

22/11/2024

New Pixotope Reveal Enables AR, Virtual Production Without Green Screens

OSLO, Norway Live augmented reality and virtual production specialist Pixotope Technologies has launched Pixotope Reveal, an AI-powered background segmentation ...

22/11/2024

Nominations Open for 2025 NAB Technology Awards

The National Association of Broadcasters has opened nominations for the 2025 NAB Technology Awards, recognizing excellence in broadcast engineering, digital lea...

22/11/2024

Thanksgiving TV Sports Ad-Spend Binge To Hit $624 Million

In between helpings of turkey and other Thanksgiving Day fare, viewers will see companies dishing up hefty portions of ads on sports programming, with the natio...

22/11/2024

8 Channels Added to MyFree DirecTV Streaming Lineup

Following the recent launch of the MyFree DirecTV free-ad supported package of 70-plus streaming channels, DirecTV has launched eight new channels catering to s...

22/11/2024

Viant, Disney Advertising Expand CTV Ad Collaboration

IRVINE, Calif. Viant Technology said it has expanded its agreement with Disney Advertising that's focused on making premium connected TV, video and display ...

22/11/2024

Xumo To Make Ad Inventory Available Programmatically With PubMatic

PHILADELPHIA and REDWOOD CITY, Calif. Xumo, the streaming platform joint venture of Comcast and Charter Communications, has reached an agreement to make its pre...

22/11/2024

Mediaocean To Acquire Innovid, Will Merge It With Flashtalking

NEW YORK Privately-held ad tech giant Mediaocean has inked a definitive agreement to acquire Innovid, an independent software platform for advertising creation,...

22/11/2024

Viz University introduces new Viz Artist certifications aimed at upskilling designers of all levels

Viz University introduces new Viz Artist certifications aimed at upskilling desi...

22/11/2024

NBC Sports President Rick Cordella on How Comcast's NBCU Cable-Net Spinoff Will Impact Sports Ops

NBC Sports President Rick Cordella on How Comcast's NBCU Cable-Net Spinoff W...

22/11/2024

NWSL Championship 2024: CBS Sports Caps Off First Year of In-House Broadcasts With Saturday's Final in Kansas City

NWSL Championship 2024: CBS Sports Caps Off First Year of In-House Broadcasts Wi...

22/11/2024

Premier League To Establish In-House Media-Operations Business for 2026-27 Season

Premier League To Establish In-House Media-Operations Business for 2026-27 Seaso...

22/11/2024

SailGP Season 5 Set to Be Most Expansive Yet'

SailGP Season 5 set to be most expansive yet' By George Bevir Friday, November 22, 2024 - 10:34 Print This Story SailGP: The New Zealand Sail Grand P...

22/11/2024

SailGP Season 5: New Broadcasters, New Requirements

SailGP Season 5: New broadcasters, new requirements By George Bevir Friday, November 22, 2024 - 10:34 Print This Story The Germany SailGP team in action a...

22/11/2024

SailGP Season 5: AI Cameras to Get Viewers Closer to the Action

SailGP Season 5: AI cameras to get viewers closer to the action By George Bevir Friday, November 22, 2024 - 10:33 Print This Story Getting viewers closer ...

22/11/2024

SailGP Season 5: Getting Umpires Onscreen and More Studio-Based Content

SailGP Season 5: Getting umpires onscreen and more studio-based content By George Bevir Friday, November 22, 2024 - 10:33 Print This Story Australia SailG...

22/11/2024

SailGP Season 5: Enhanced LiveLine Graphics Bring Augmented Reality to Chase Boats

SailGP Season 5: Enhanced LiveLine graphics bring augmented reality to chase boa...

22/11/2024

Full House: Inside Production of the European Curling Championships

Full house: Inside production of the European Curling Championships By Kevin Hilton Thursday, November 21, 2024 - 12:40 Print This Story Curling star Anna...

22/11/2024

NBC Sports President Rick Cordella on How Comcast's NBCU Cable Net Spinoff Will Impact Sports Ops

NBC Sports President Rick Cordella on How Comcast's NBCU Cable Net Spinoff W...

22/11/2024

Rugged Rugby: Conquer or Die' Premieres December 10: A Battle for Supremacy Begins

Back to All News Rugged Rugby: Conquer or Die' Premieres December 10: A Ba...