Sony Pixel Power calrec Sony

Pushing Forward the Frontiers of Natural Language Processing

17/09/2021

Idea generation, not hardware or software, needs to be the bottleneck to the advancement of AI, Bryan Catanzaro, vice president of applied deep learning research at NVIDIA, said this week at the AI Hardware Summit.

We want the inventors, the researchers and the engineers that are coming up with future AI to be limited only by their own thoughts, Catanzaro told the audience.

Catanzaro leads a team of researchers working to apply the power of deep learning to everything from video games to chip design. At the annual event held in Silicon Valley, he described the work that NVIDIA is doing to enable advancements in AI, with a focus on large language modeling.

CUDA Is for the Dreamers Training and deploying large neural networks is a tough computational problem, so hardware that's both incredibly fast and highly efficient is a necessity, according to Catanzaro.

But, he explained, the software that accompanies that hardware might be even more important to unlocking further advancements in AI.

The core of the work that we do involves optimizing hardware and software together, all the way from chips, to systems, to software, frameworks, libraries, compilers, algorithms and applications, he said. We optimize all of these things to give transformational capabilities to scientists, researchers and engineers around the world.

This end-to-end approach yields chart-topping performance in industry-standard benchmarks, such as MLPerf. It also ensures that developers aren't constrained by the platform as they aim to advance AI.

CUDA is for the dreamers, CUDA is for the people who are thinking new thoughts, said Catanzaro. How do they think those thoughts and test them efficiently? They need something general and flexible, and that's why we build what we build.

Large Language Models Are Changing the World One of the most exciting areas of AI is language modeling, which is enabling groundbreaking applications in natural language understanding and conversational AI.

The complexity of large language models is growing at an incredible rate, with parameter counts doubling every two months.

A well-known example of a large and powerful language model is GPT-3, developed by OpenAI. Packing 175 billion parameters, it required 314 zettaflops (1021 floating point operations) to train.

It's a staggering amount of compute, Catanzaro said. And that means language modeling is now becoming constrained by economics.

Estimates suggest that GPT-3 would cost about $12 million to train and, Catanzaro observed, the rapid growth in model complexity means that, despite NVIDIA's tireless work to advance the performance and efficiency of its hardware and software, the cost to train these models is set to grow.

And, according to Catanzaro, this trend suggests that it might not be too long before a single model might require more than a billion dollars' worth of computer time to train.

What would it look like to build a model that took a billion dollars to train a single model? Well, it would need to reinvent an entire company, and you'd need to be able to use it in a lot of different contexts, Catanzaro explained.

Catanzaro expects that these models will unlock an incredible amount of value, inspiring continued innovation. During his talk, Catanzaro showed an example of the surprising capabilities of large language models to solve new tasks without being explicitly trained to do so.

After inputting just a few examples into a large language model - four sentences, with two written in English and their corresponding translations into Spanish - he then entered an English sentence, which the model then translated into Spanish properly.

The model was able to do this despite never being trained to do translation. Instead, it was trained - using, as Catanzaro described, an enormous amount of data from the internet - to predict the next word that should follow a given sequence of text.

To perform that very generic task, the model needed to come up with higher-level representations of concepts, such as the existence of languages in general, English and Spanish vocabularies and grammar, and the concept of a translation task, in order to understand the query and properly respond.

These language models are first steps towards generalized artificial intelligence with few shot learning, and that is enormously valuable and very exciting, explained Catanzaro.

A Full-Stack Approach to Language Modeling Catanzaro then went on to describe NVIDIA Megatron, a framework created by NVIDIA using PyTorch for efficiently training the world's largest, transformer-based language models.

A key feature of NVIDIA Megatron, which Catanzaro notes has already been used by various companies and organizations to train large transformer-based models, is model parallelism.

Megatron supports both inter-layer (pipeline) parallelism, which allows different layers of a model to be processed on different devices, as well as intra-layer (tensor) parallelism, which allows a single layer to be processed by multiple different devices.

Catanzaro further described some of the optimizations that NVIDIA applies to maximize the efficiency of pipeline parallelism and minimize so-called pipeline bubbles, during which a GPU is not performing useful work.

A batch is split into microbatches, the execution of which is pipelined. This boosts the utilization of the GPU resources in a system during training. With further optimizations, pipeline bubbles can be reduced even more.

Catanzaro described an optimization, recently published, that entails round-robining each (pipeline) stage among multiple GPUs so that we can further reduce the amount of pipeline bubble overhead in this schedule.

Although this optimization puts additional stress on the communication fabric within the system, Catanzaro showed that, by l
LINK: https://blogs.nvidia.com/blog/2021/09/16/nlp-frontiers-ai-hardware-sum...
See more stories from nvidia

Most recent headlines

09/12/2024

Dalet Named an IDC Innovator in Media and Entertainment

Dalet, a leading technology and service provider for media-rich organizations, today announced that it has been named an IDC Innovator in the IDC Innovators: ...

23/11/2024

Heartwarming Out of My Mind Highlights the Importance of Disability Advocacy

PARK CITY, UTAH - JANUARY 19: (L-R) Judith Light, Rosemarie Dewitt, Luke Kirby, Michael Chernus, Phoebe-Rae Taylor, Sharon M. Draper, Amber Sealey, and Courtney...

23/11/2024

TCLtv+ Adds 23 CBS Fast Channels

LOS ANGELES/NEW YORK TCL's streaming service TCLtv+ has struck a content deal with Paramount Streaming that will add 23 CBS FAST channels to its lineup....

23/11/2024

Viamedia Signs Ad Rep Deals with 7 More Service Providers

LEXINGTON, Ky. The independent advertising rep firm Viamedia has further expanded its sales network with news that it has agreements to manage advertising sale...

23/11/2024

The Trade Desk Jumps Into Streaming TV With Ventura OS

VENTURA, Calif. Programmatic ad giant The Trade Desk is pushing into the streaming technology business with a new operating system called Ventura....

23/11/2024

Writers Guild, 3 PBS Stations Reach Tentative Agreement for New Contract

BOSTON, LOS ANGELES AND NEW YORK The Writers Guild of America has announced that it has reached a tentative agreement with management at PBS member stations WGB...

23/11/2024

Supreme Court to Consider Legality of FCC's Universal Service Fund

WASHINGTON, D.C. The U.S. Supreme Court has agreed to hear an appeal in a case that alleges the FCC does not have the authority to decide how funds from the Uni...

22/11/2024

Michelle Satter to Be Honored at 2025 Sundance Film Festival Gala Celebrating Sundance Institute Presented by Google TV

Sean Wang, Julian Brave NoiseCat, and Emily Kassie to Receive Annual Vanguard Aw...

22/11/2024

Spotify Inks a New Partnership With Bloomsbury To Offer A Greater Assortment of Audiobooks

Our library continues to grow. In 2022, we announced the addition of audiobooks ...

22/11/2024

SBS wins Australian Podcast Publisher of the Year for third year running

SBS wins Australian Podcast Publisher of the Year for third year running 22 November, 2024 Media releases An outstanding slate of multilingual, multicultur...

22/11/2024

Film Lighting: a Cinematic Guide w/ Free Lighting Plots

Home Applications Film Lighting: a Cinematic Guide w/ Free Lighting Plots Your Guide to Film Lighting In this guide, we'll explore the history of fil...

22/11/2024

Fox, Hulu Renew Content Deal

Fox Entertainment and Hulu have renewed a multi-year content distribution agreement that will keep in-season streaming rights for Fox's programming slate on...

22/11/2024

Academy Award-Winning Film Studio Caviar Signs Director Duo MAMA

Academy Award-Winning Film Studio Caviar Signs Director Duo MAMA Brie Clayton November 22, 2024 0 Comments Academy Award-winning independent film stud...

22/11/2024

A Creative Alliance for Black Friday: Independent Software Makers Unite for Photographers

A Creative Alliance for Black Friday: Independent Software Makers Unite for Phot...

22/11/2024

Partial Bold Text using After Effects expressions UPDATED

Partial Bold Text using After Effects expressions UPDATED Graham Quince November 22, 2024 0 Comments Now with an improved expression for Per Word Se...

22/11/2024

John Lawson Steps Down as AWARN Executive Director

WASHINGTON John Lawson, longtime broadcast alerting advocate and founder of the AWARN Alliance, said he is stepping down as its executive director to work full-...

22/11/2024

Red, white and Blue Lucy UK media tech provider reaches across the Pond

Entering the American market follows a period of significant growth for the company By Matthew Corrigan Published: November 22, 2024 Entering the American...

22/11/2024

Warner Bros. Discovery Introduces Shop With Max and Moments

NEW YORK Warner Bros. Discovery Advertising Sales has incorporated Kerv's AI-enhanced technology into its ad-tech platform and launched two new ad offerings...

22/11/2024

EDO, Vizio Ink New Multiyear Smart-TV Data Licensing Pact

NEW YORK Vizio's Inscape, a smart-TV data provider, and EDO said they have extended their longstanding data partnership....

22/11/2024

New Pixotope Reveal Enables AR, Virtual Production Without Green Screens

OSLO, Norway Live augmented reality and virtual production specialist Pixotope Technologies has launched Pixotope Reveal, an AI-powered background segmentation ...

22/11/2024

Nominations Open for 2025 NAB Technology Awards

The National Association of Broadcasters has opened nominations for the 2025 NAB Technology Awards, recognizing excellence in broadcast engineering, digital lea...

22/11/2024

Thanksgiving TV Sports Ad-Spend Binge To Hit $624 Million

In between helpings of turkey and other Thanksgiving Day fare, viewers will see companies dishing up hefty portions of ads on sports programming, with the natio...

22/11/2024

8 Channels Added to MyFree DirecTV Streaming Lineup

Following the recent launch of the MyFree DirecTV free-ad supported package of 70-plus streaming channels, DirecTV has launched eight new channels catering to s...

22/11/2024

Viant, Disney Advertising Expand CTV Ad Collaboration

IRVINE, Calif. Viant Technology said it has expanded its agreement with Disney Advertising that's focused on making premium connected TV, video and display ...

22/11/2024

Xumo To Make Ad Inventory Available Programmatically With PubMatic

PHILADELPHIA and REDWOOD CITY, Calif. Xumo, the streaming platform joint venture of Comcast and Charter Communications, has reached an agreement to make its pre...

22/11/2024

Mediaocean To Acquire Innovid, Will Merge It With Flashtalking

NEW YORK Privately-held ad tech giant Mediaocean has inked a definitive agreement to acquire Innovid, an independent software platform for advertising creation,...

22/11/2024

Viz University introduces new Viz Artist certifications aimed at upskilling designers of all levels

Viz University introduces new Viz Artist certifications aimed at upskilling desi...

22/11/2024

NBC Sports President Rick Cordella on How Comcast's NBCU Cable-Net Spinoff Will Impact Sports Ops

NBC Sports President Rick Cordella on How Comcast's NBCU Cable-Net Spinoff W...

22/11/2024

NWSL Championship 2024: CBS Sports Caps Off First Year of In-House Broadcasts With Saturday's Final in Kansas City

NWSL Championship 2024: CBS Sports Caps Off First Year of In-House Broadcasts Wi...

22/11/2024

Premier League To Establish In-House Media-Operations Business for 2026-27 Season

Premier League To Establish In-House Media-Operations Business for 2026-27 Seaso...

22/11/2024

SailGP Season 5 Set to Be Most Expansive Yet'

SailGP Season 5 set to be most expansive yet' By George Bevir Friday, November 22, 2024 - 10:34 Print This Story SailGP: The New Zealand Sail Grand P...

22/11/2024

SailGP Season 5: New Broadcasters, New Requirements

SailGP Season 5: New broadcasters, new requirements By George Bevir Friday, November 22, 2024 - 10:34 Print This Story The Germany SailGP team in action a...

22/11/2024

SailGP Season 5: AI Cameras to Get Viewers Closer to the Action

SailGP Season 5: AI cameras to get viewers closer to the action By George Bevir Friday, November 22, 2024 - 10:33 Print This Story Getting viewers closer ...

22/11/2024

SailGP Season 5: Getting Umpires Onscreen and More Studio-Based Content

SailGP Season 5: Getting umpires onscreen and more studio-based content By George Bevir Friday, November 22, 2024 - 10:33 Print This Story Australia SailG...

22/11/2024

SailGP Season 5: Enhanced LiveLine Graphics Bring Augmented Reality to Chase Boats

SailGP Season 5: Enhanced LiveLine graphics bring augmented reality to chase boa...

22/11/2024

Full House: Inside Production of the European Curling Championships

Full house: Inside production of the European Curling Championships By Kevin Hilton Thursday, November 21, 2024 - 12:40 Print This Story Curling star Anna...

22/11/2024

NBC Sports President Rick Cordella on How Comcast's NBCU Cable Net Spinoff Will Impact Sports Ops

NBC Sports President Rick Cordella on How Comcast's NBCU Cable Net Spinoff W...

22/11/2024

Sky and Peacock's Original hit drama series The Day of the Jackal scores a second season renewal

Sky and Peacock's Original hit drama series The Day of the Jackal scores a s...

22/11/2024

Rugged Rugby: Conquer or Die' Premieres December 10: A Battle for Supremacy Begins

Back to All News Rugged Rugby: Conquer or Die' Premieres December 10: A Ba...

22/11/2024

Standards Pavilion elevates the role of standards in advancing climate action at COP29

Friday 22 November, Baku, Azerbaijan: As COP29 wraps up in Azerbaijan, the Stand...

22/11/2024

Let's Make Toy Show Day Official

Let's Make Toy Show Day Official The Late Late Toy Show | Friday December 6th | 9:35PM Watch Below The Late Late Toy Show is fast approaching and kids al...

22/11/2024

COOPANS, the Alliance Managing Europe's Largest Air Traffic Volume, Upgrades its Air Traffic Control (ATC) System with Thales

Facebook Twitter LinkedIn COOPANS is a leading international cooperation b...

22/11/2024

Press release

Facebook Twitter LinkedIn Thales confirms that the Parquet National Financier (PNF) in France and the Serious Fraud Office (SFO) in the United Kingdom hav...

22/11/2024

RT General Election 2024: Critical Election Period

The broadcasting regulator, Coimisi n na Me n has removed the traditional broadcast Moratorium for television and radio. In the past, this applied from 14:00 ...

21/11/2024

Neneh Cherry Takes Us on a Musical Journey Inspired by Her Memoir, A Thousand Threads'

Neneh Cherry, a musical trailblazer for more than three decades, is full of stor...

21/11/2024

6 Spotify Audiobook Features That Level Up Your Listening Experience

Since launching our audiobooks offering, we've continuously upped our game on designing a user experience that provides seamless and engaging listening. You...

21/11/2024

SEP 2024 / PAG launches new MPL150 Battery at IBC24

1ST SEPTEMBER 2024 PAG Ltd. UK, the creator of innovative, high-end portable power systems for the film and television industry, has announced the introduction...

21/11/2024

SEP 2024 / PAG Introduces new Cinergy Battery

1ST SEPTEMBER 2024 PAG Ltd. UK, the creator of innovative, high-end, portable power systems for the film and television industry, has announced the introductio...

21/11/2024

Celebrating The Best Of The Best

The HPA Awards exist to honor and recognize the accomplishments of the talented HPA community and to support their efforts with increased awareness and celebrat...

21/11/2024

L3Harris Co-Founder, Chair and CEO Chris Kubasik: Arsenal of Democracy 2.0 Will Require New Ways of Doing Business

He writes in POLITICO: When it comes to protecting our country, innovation and s...