How GPUs Can Democratize Deep Reinforcement Learning for Robotics Development
11/12/2020
Deep reinforcement learning, a technique used to train AI models for robotics and complex strategy problems, works off the same principle.
In reinforcement learning, a software agent interacts with a real or virtual environment, relying on feedback from rewards to learn the best way to achieve its goal. Like the brain of a puppy in training, a reinforcement learning model uses information it's observed about the environment and its rewards, and determines which action the agent should take next.
To date, most researchers have relied on a combination of CPUs and GPUs to run reinforcement learning models. This means different parts of the computer tackle different steps of the process - including simulating the environment, calculating rewards, choosing what action to take next, actually taking action, and then learning from the experience.
But switching back and forth between CPU cores and powerful GPUs is by nature inefficient, requiring data to be transferred from one part of the system's memory to another at multiple points during the reinforcement learning training process. It's like a student who has to carry a tall stack of books and notes from classroom to classroom, plus the library, before grasping a new concept.
With Isaac Gym, NVIDIA developers have made it possible to instead run the entire reinforcement learning pipeline on GPUs - enabling significant speedups and reducing the hardware resources needed to develop these models.
Here's what this breakthrough means for the deep reinforcement learning process, and how much acceleration it can bring developers.
Reinforcement Learning on GPUs: Simulation to Action When training a reinforcement learning model for a robotics task - like a humanoid robot that walks up and down stairs - it's much faster, safer and easier to use a simulated environment than the physical world. In a simulation, developers can create a sea of virtual robots that can quickly rack up thousands of hours of experience at a task.
If tested solely in the real world, a robot in training could fall down, bump into or mishandle objects - causing potential damage to its own machinery, the object it's interacting with or its surroundings. Testing in simulation provides the reinforcement learning model a space to practice and work out the kinks, giving it a head start when shifting to the real world.
In a typical system today, the NVIDIA PhysX simulation engine runs this experience-gathering phase of the reinforcement learning process on NVIDIA GPUs. But for other steps of the training application, developers have traditionally still used CPUs.
Traditional deep reinforcement learning uses a combination of CPU and GPU computing resources, requiring significant data transfers back and forth. A key part of reinforcement learning training is conducting what's known as the forward pass: First, the system simulates the environment, records a set of observations about the state of the world and calculates a reward for how well the agent did.
The recorded observations become the input to a deep learning policy network, which chooses an action for the agent to take. Both the observations and the rewards are stored for use later in the training cycle.
Finally, the action is sent back to the simulator so that the rest of the environment can be updated in response.
After several rounds of these forward passes, the reinforcement learning model takes a look back, evaluating whether the actions it chose were effective or not. This information is used to update the policy network, and the cycle begins again with the improved model.
GPU Acceleration with Isaac Gym To eliminate the overhead of transferring data back and forth from CPU to GPU during this reinforcement learning training cycle, NVIDIA researchers have developed an approach to run every step of the process on GPUs. This is Isaac Gym, an end-to-end training environment, which includes the PhysX simulation engine and a PyTorch tensor-based API.
Isaac Gym makes it possible for a developer to run tens of thousands of environments simultaneously on a single GPU. That means experiments that previously required a data center with thousands of CPU cores can in some cases be trained on a single workstation.
NVIDIA Isaac Gym runs entire reinforcement learning pipelines on GPUs, enabling significant speedups. Decreasing the amount of hardware required makes reinforcement learning more accessible to individual researchers who don't have access to large data center resources. It can also make the process a lot faster.
A simple reinforcement learning model tasked with getting a humanoid robot to walk can be trained in just a few minutes with Isaac Gym. But the impact of end-to-end GPU acceleration is most useful for more challenging tasks, like teaching a complex robot hand to manipulate a cube into a specific position.
This problem requires significant dexterity by the robot, and a simulation environment that involves domain randomization, a mechanism that allows the learned policy to more easily transfer to a real-world robot.
Research by OpenAI tackled this task with a cluster of more than 6,000 CPU cores plus multiple NVIDIA Tensor Core GPUs - and required about 30 hours of training for the reinforcement learning model to succeed at the task 20 times in a row using a feed-forward network model.
Using just one NVIDIA A100 GPU with Isaac Gym, NVIDIA developers were able to achieve the same level of success in around 10 hours - a single GPU outperforming
LINK: | https://blogs.nvidia.com/blog/2020/12/10/deep-reinforcement-learning-g... |
See more stories from nvidia |
Most recent headlines
09/12/2024
Dalet Named an IDC Innovator in Media and Entertainment
Dalet, a leading technology and service provider for media-rich organizations, today announced that it has been named an IDC Innovator in the IDC Innovators: ...
23/11/2024
Heartwarming Out of My Mind Highlights the Importance of Disability Advocacy
PARK CITY, UTAH - JANUARY 19: (L-R) Judith Light, Rosemarie Dewitt, Luke Kirby, Michael Chernus, Phoebe-Rae Taylor, Sharon M. Draper, Amber Sealey, and Courtney...
23/11/2024
TCLtv+ Adds 23 CBS Fast Channels
LOS ANGELES/NEW YORK TCL's streaming service TCLtv+ has struck a content deal with Paramount Streaming that will add 23 CBS FAST channels to its lineup....
23/11/2024
Viamedia Signs Ad Rep Deals with 7 More Service Providers
LEXINGTON, Ky. The independent advertising rep firm Viamedia has further expanded its sales network with news that it has agreements to manage advertising sale...
23/11/2024
The Trade Desk Jumps Into Streaming TV With Ventura OS
VENTURA, Calif. Programmatic ad giant The Trade Desk is pushing into the streaming technology business with a new operating system called Ventura....
23/11/2024
Writers Guild, 3 PBS Stations Reach Tentative Agreement for New Contract
BOSTON, LOS ANGELES AND NEW YORK The Writers Guild of America has announced that it has reached a tentative agreement with management at PBS member stations WGB...
23/11/2024
Supreme Court to Consider Legality of FCC's Universal Service Fund
WASHINGTON, D.C. The U.S. Supreme Court has agreed to hear an appeal in a case that alleges the FCC does not have the authority to decide how funds from the Uni...
22/11/2024
Michelle Satter to Be Honored at 2025 Sundance Film Festival Gala Celebrating Sundance Institute Presented by Google TV
Sean Wang, Julian Brave NoiseCat, and Emily Kassie to Receive Annual Vanguard Aw...
22/11/2024
Spotify Inks a New Partnership With Bloomsbury To Offer A Greater Assortment of Audiobooks
Our library continues to grow. In 2022, we announced the addition of audiobooks ...
22/11/2024
SBS wins Australian Podcast Publisher of the Year for third year running
SBS wins Australian Podcast Publisher of the Year for third year running 22 November, 2024 Media releases An outstanding slate of multilingual, multicultur...
22/11/2024
Film Lighting: a Cinematic Guide w/ Free Lighting Plots
Home Applications Film Lighting: a Cinematic Guide w/ Free Lighting Plots Your Guide to Film Lighting In this guide, we'll explore the history of fil...
22/11/2024
Fox, Hulu Renew Content Deal
Fox Entertainment and Hulu have renewed a multi-year content distribution agreement that will keep in-season streaming rights for Fox's programming slate on...
22/11/2024
Academy Award-Winning Film Studio Caviar Signs Director Duo MAMA
Academy Award-Winning Film Studio Caviar Signs Director Duo MAMA Brie Clayton November 22, 2024 0 Comments Academy Award-winning independent film stud...
22/11/2024
A Creative Alliance for Black Friday: Independent Software Makers Unite for Photographers
A Creative Alliance for Black Friday: Independent Software Makers Unite for Phot...
22/11/2024
Partial Bold Text using After Effects expressions UPDATED
Partial Bold Text using After Effects expressions UPDATED Graham Quince November 22, 2024 0 Comments Now with an improved expression for Per Word Se...
22/11/2024
John Lawson Steps Down as AWARN Executive Director
WASHINGTON John Lawson, longtime broadcast alerting advocate and founder of the AWARN Alliance, said he is stepping down as its executive director to work full-...
22/11/2024
Red, white and Blue Lucy UK media tech provider reaches across the Pond
Entering the American market follows a period of significant growth for the company By Matthew Corrigan Published: November 22, 2024 Entering the American...
22/11/2024
Warner Bros. Discovery Introduces Shop With Max and Moments
NEW YORK Warner Bros. Discovery Advertising Sales has incorporated Kerv's AI-enhanced technology into its ad-tech platform and launched two new ad offerings...
22/11/2024
EDO, Vizio Ink New Multiyear Smart-TV Data Licensing Pact
NEW YORK Vizio's Inscape, a smart-TV data provider, and EDO said they have extended their longstanding data partnership....
22/11/2024
New Pixotope Reveal Enables AR, Virtual Production Without Green Screens
OSLO, Norway Live augmented reality and virtual production specialist Pixotope Technologies has launched Pixotope Reveal, an AI-powered background segmentation ...
22/11/2024
Nominations Open for 2025 NAB Technology Awards
The National Association of Broadcasters has opened nominations for the 2025 NAB Technology Awards, recognizing excellence in broadcast engineering, digital lea...
22/11/2024
Thanksgiving TV Sports Ad-Spend Binge To Hit $624 Million
In between helpings of turkey and other Thanksgiving Day fare, viewers will see companies dishing up hefty portions of ads on sports programming, with the natio...
22/11/2024
8 Channels Added to MyFree DirecTV Streaming Lineup
Following the recent launch of the MyFree DirecTV free-ad supported package of 70-plus streaming channels, DirecTV has launched eight new channels catering to s...
22/11/2024
Viant, Disney Advertising Expand CTV Ad Collaboration
IRVINE, Calif. Viant Technology said it has expanded its agreement with Disney Advertising that's focused on making premium connected TV, video and display ...
22/11/2024
Xumo To Make Ad Inventory Available Programmatically With PubMatic
PHILADELPHIA and REDWOOD CITY, Calif. Xumo, the streaming platform joint venture of Comcast and Charter Communications, has reached an agreement to make its pre...
22/11/2024
Mediaocean To Acquire Innovid, Will Merge It With Flashtalking
NEW YORK Privately-held ad tech giant Mediaocean has inked a definitive agreement to acquire Innovid, an independent software platform for advertising creation,...
22/11/2024
Viz University introduces new Viz Artist certifications aimed at upskilling designers of all levels
Viz University introduces new Viz Artist certifications aimed at upskilling desi...
22/11/2024
NBC Sports President Rick Cordella on How Comcast's NBCU Cable-Net Spinoff Will Impact Sports Ops
NBC Sports President Rick Cordella on How Comcast's NBCU Cable-Net Spinoff W...
22/11/2024
NWSL Championship 2024: CBS Sports Caps Off First Year of In-House Broadcasts With Saturday's Final in Kansas City
NWSL Championship 2024: CBS Sports Caps Off First Year of In-House Broadcasts Wi...
22/11/2024
Premier League To Establish In-House Media-Operations Business for 2026-27 Season
Premier League To Establish In-House Media-Operations Business for 2026-27 Seaso...
22/11/2024
SailGP Season 5 Set to Be Most Expansive Yet'
SailGP Season 5 set to be most expansive yet' By George Bevir Friday, November 22, 2024 - 10:34 Print This Story SailGP: The New Zealand Sail Grand P...
22/11/2024
SailGP Season 5: New Broadcasters, New Requirements
SailGP Season 5: New broadcasters, new requirements By George Bevir Friday, November 22, 2024 - 10:34 Print This Story The Germany SailGP team in action a...
22/11/2024
SailGP Season 5: AI Cameras to Get Viewers Closer to the Action
SailGP Season 5: AI cameras to get viewers closer to the action By George Bevir Friday, November 22, 2024 - 10:33 Print This Story Getting viewers closer ...
22/11/2024
SailGP Season 5: Getting Umpires Onscreen and More Studio-Based Content
SailGP Season 5: Getting umpires onscreen and more studio-based content By George Bevir Friday, November 22, 2024 - 10:33 Print This Story Australia SailG...
22/11/2024
SailGP Season 5: Enhanced LiveLine Graphics Bring Augmented Reality to Chase Boats
SailGP Season 5: Enhanced LiveLine graphics bring augmented reality to chase boa...
22/11/2024
Full House: Inside Production of the European Curling Championships
Full house: Inside production of the European Curling Championships By Kevin Hilton Thursday, November 21, 2024 - 12:40 Print This Story Curling star Anna...
22/11/2024
NBC Sports President Rick Cordella on How Comcast's NBCU Cable Net Spinoff Will Impact Sports Ops
NBC Sports President Rick Cordella on How Comcast's NBCU Cable Net Spinoff W...
22/11/2024
Sky and Peacock's Original hit drama series The Day of the Jackal scores a second season renewal
Sky and Peacock's Original hit drama series The Day of the Jackal scores a s...
22/11/2024
Rugged Rugby: Conquer or Die' Premieres December 10: A Battle for Supremacy Begins
Back to All News Rugged Rugby: Conquer or Die' Premieres December 10: A Ba...
22/11/2024
Standards Pavilion elevates the role of standards in advancing climate action at COP29
Friday 22 November, Baku, Azerbaijan: As COP29 wraps up in Azerbaijan, the Stand...
22/11/2024
Let's Make Toy Show Day Official
Let's Make Toy Show Day Official The Late Late Toy Show | Friday December 6th | 9:35PM Watch Below The Late Late Toy Show is fast approaching and kids al...
22/11/2024
COOPANS, the Alliance Managing Europe's Largest Air Traffic Volume, Upgrades its Air Traffic Control (ATC) System with Thales
Facebook Twitter LinkedIn COOPANS is a leading international cooperation b...
22/11/2024
Press release
Facebook Twitter LinkedIn Thales confirms that the Parquet National Financier (PNF) in France and the Serious Fraud Office (SFO) in the United Kingdom hav...
22/11/2024
RT General Election 2024: Critical Election Period
The broadcasting regulator, Coimisi n na Me n has removed the traditional broadcast Moratorium for television and radio. In the past, this applied from 14:00 ...
21/11/2024
Neneh Cherry Takes Us on a Musical Journey Inspired by Her Memoir, A Thousand Threads'
Neneh Cherry, a musical trailblazer for more than three decades, is full of stor...
21/11/2024
6 Spotify Audiobook Features That Level Up Your Listening Experience
Since launching our audiobooks offering, we've continuously upped our game on designing a user experience that provides seamless and engaging listening. You...
21/11/2024
SEP 2024 / PAG launches new MPL150 Battery at IBC24
1ST SEPTEMBER 2024 PAG Ltd. UK, the creator of innovative, high-end portable power systems for the film and television industry, has announced the introduction...
21/11/2024
SEP 2024 / PAG Introduces new Cinergy Battery
1ST SEPTEMBER 2024 PAG Ltd. UK, the creator of innovative, high-end, portable power systems for the film and television industry, has announced the introductio...
21/11/2024
Celebrating The Best Of The Best
The HPA Awards exist to honor and recognize the accomplishments of the talented HPA community and to support their efforts with increased awareness and celebrat...
21/11/2024
L3Harris Co-Founder, Chair and CEO Chris Kubasik: Arsenal of Democracy 2.0 Will Require New Ways of Doing Business
He writes in POLITICO: When it comes to protecting our country, innovation and s...