
DeepSeek-R1 is an open model with state-of-the-art reasoning capabilities. Instead of offering direct responses, AI models like DeepSeek-R1 perform reasoning through the chain-of-thought method to generate the best answer.
Performing this sequence of inference passes - using reason to arrive at the best answer - is known as test-time scaling. DeepSeek-R1 is a perfect example of this scaling law, demonstrating why accelerated computing is critical for the demands of agentic AI inference.
As models are allowed to iteratively think through the problem, they create more output tokens and longer generation cycles, so model quality continues to scale. Significant test-time compute is critical to enable both real-time inference and higher-quality responses from reasoning models like DeepSeek-R1, requiring larger inference deployments.
R1 delivers leading accuracy for tasks demanding logical inference, reasoning, math, coding and language understanding while also delivering high inference efficiency.
To help developers securely experiment with these capabilities and build their own specialized agents, the 671-billion-parameter DeepSeek-R1 model is now available as an NVIDIA NIM microservice preview on build.nvidia.com. The DeepSeek-R1 NIM microservice can deliver up to 3,872 tokens per second on a single NVIDIA HGX H200 system.
Developers can test and experiment with the application programming interface (API), which is expected to be available soon as a downloadable NIM microservice, part of the NVIDIA AI Enterprise software platform.
The DeepSeek-R1 NIM microservice simplifies deployments with support for industry-standard APIs. Enterprises can maximize security and data privacy by running the NIM microservice on their preferred accelerated computing infrastructure. Using NVIDIA AI Foundry with NVIDIA NeMo software, enterprises will also be able to create customized DeepSeek-R1 NIM microservices for specialized AI agents.
DeepSeek-R1 - a Perfect Example of Test-Time Scaling DeepSeek-R1 is a large mixture-of-experts (MoE) model. It incorporates an impressive 671 billion parameters - 10x more than many other popular open-source LLMs - supporting a large input context length of 128,000 tokens. The model also uses an extreme number of experts per layer. Each layer of R1 has 256 experts, with each token routed to eight separate experts in parallel for evaluation.
Delivering real-time answers for R1 requires many GPUs with high compute performance, connected with high-bandwidth and low-latency communication to route prompt tokens to all the experts for inference. Combined with the software optimizations available in the NVIDIA NIM microservice, a single server with eight H200 GPUs connected using NVLink and NVLink Switch can run the full, 671-billion-parameter DeepSeek-R1 model at up to 3,872 tokens per second. This throughput is made possible by using the NVIDIA Hopper architecture's FP8 Transformer Engine at every layer - and the 900 GB/s of NVLink bandwidth for MoE expert communication.
Getting every floating point operation per second (FLOPS) of performance out of a GPU is critical for real-time inference. The next-generation NVIDIA Blackwell architecture will give test-time scaling on reasoning models like DeepSeek-R1 a giant boost with fifth-generation Tensor Cores that can deliver up to 20 petaflops of peak FP4 compute performance and a 72-GPU NVLink domain specifically optimized for inference.
Get Started Now With the DeepSeek-R1 NIM Microservice Developers can experience the DeepSeek-R1 NIM microservice, now available on build.nvidia.com. Watch how it works:
With NVIDIA NIM, enterprises can deploy DeepSeek-R1 with ease and ensure they get the high efficiency needed for agentic AI systems.
See notice regarding software product information.
Most recent headlines
12/03/2025
CHICAGO Jeff Lilly has been named WGN-TV director of technology effective March 17, 2025, according to Ric Harris, WGN-TV vice president and general manager....
12/03/2025
MOUNTAIN VIEW, Calif. A new study from LG Ad Solutions indicates that consumers want more features that would allow them to shop for products on the connected T...
12/03/2025
BOTHELL, Wash. The Alliance for IP Media Solutions (AIMS), Advanced Media Workflow Association (AMWA) and the Video Services Forum (VSF) will once again present...
12/03/2025
PHILADELPHIA Comcast announced that it has upgraded Xfinity Internet speeds for more than 20 million customers for no additional cost....
12/03/2025
Create with Maxon: Cinema 4D Fundamentals Workshop - March 12-14
Brie Clayton March 11, 2025
0 Comments
Makin' Waffles with Elly Wade
During Marc...
11/03/2025
By Lucy Spicer
One of the most exciting things about the Sundance Film Festival...
11/03/2025
Salsa is making a comeback, captivating new listeners with its infectious energy...
11/03/2025
For many people, music can serve as a reflection of their roots and upbringing. ...
11/03/2025
The solution delivers reliable, scalable and secure connectivity for critical pu...
11/03/2025
SAN JOSE, Calif. Harmonic has announced that Weigel Broadcasting has deployed Harmonics VOS Media Software, which offers playout-to-delivery capabilities, inclu...
11/03/2025
NEW BERN, N.C. Wheatstone will introduce a Linux audio driver for its WheatNet IP audio network during the 2025 NAB Show, April 5-9, at the Las Vegas Convention...
11/03/2025
NEW YORK A new study finds that as TV viewership for women's sports surged by 131% in 2024, the programming also saw a 56% year-over-year increase in ad im...
11/03/2025
Powerful switcher for news studio
FOR-A, a cutting-edge video broadcast technology company backed by more than 50 years experience, has installed its HVS-1200 ...
11/03/2025
Intinor, Sweden's leading developer of high-quality video over the internet, is unveiling significant advancements to its Direkt series at NAB 2025. With a ...
11/03/2025
Glensound, a leader in high-quality audio systems, is bringing yet more innovation to NAB Show 2025 (Booth N2270, Las Vegas Convention Center, 6-9 April). Well-...
11/03/2025
Whittier, Calif.-based Anaconda Street Productions (ASP) is a leading film and television production company known for creating captivating content that resonat...
11/03/2025
DNAV, a full-service systems integrator, consultant, and manufacturer's representative of leading broadcast, AV, lighting, and display equipment, announces ...
11/03/2025
MNC Software Inc., a global leader in network solutions, is pleased to announce the appointment of Darren Frearson as its new Chief Executive Officer, effective...
11/03/2025
Polar Graphics, a UK leading distributor for the broadcast, post and pro-AV industries, has signed a licensing agreement with XenData to include its XenData Arc...
11/03/2025
Chyron PAINT 9.9 Delivers Sharper Visuals and Smoother Workflows for Sports Tele...
11/03/2025
Alice in Wonderlight Filmed with URSA Mini Pro 12K OLPF
Brie Clayton March 11, 2025
0 Comments
Play captured in 8K60p as part of national initiative t...
11/03/2025
Twisting in the Wind - An Apple Motion Tutorial
Simon Ubsdell March 11, 2025
0 Comments
Another very simple text-based project that uses an unusual co...
11/03/2025
The secret to looking inside a Project in Premiere Pro
Colin Smith March 11, 2025
0 Comments
This tutorial demonstrates the incredible capabilities of...
11/03/2025
New Book Explores How the Quiet Storm Shaped Modern R&B In The Quiet Storm, Berklee Online alumnus Amani Roberts explores how the radio format defined America...
11/03/2025
Slapshot aims to make VFX available for a range of users including independent editors, colourists, content creators and established visual effects companies
B...
11/03/2025
Aiming to attract international film production, the 16 million complex is scheduled to open next year
By Matthew Corrigan
Published: March 11, 2025
Aimi...
11/03/2025
Kamil Pietrzyk, support and projects manager at CueScript, tells TVBEurope how an interest in electronics and IT prompted a career in the broadcast industry
By...
11/03/2025
Underwater DoP Ian Seabrook on Last Breath Credit: Jon Borg / 2024 FOCUS FEATURES LLC
Canadian/British Director of Photography, Ian Seabrook is one of the ...
11/03/2025
Originally opened in the early 1960s as Moonglow Records, The Sound Factory name was coined by Producer David Hassinger, who purchased the studio in 1969. A lit...
11/03/2025
BCNEXXT, a trailblazer in virtualized, cloud-native systems for Linear, VoD, and OTT publishing, proudly marks a decade of redefining broadcast playout. This mi...
11/03/2025
Experience Commerce, an integrated marketing agency within the Cheil Network, has been appointed as the official digital partner for Parle Candy Culture, reinfo...
11/03/2025
PORTSMOUTH, N.H. A new survey highlights how far major streaming platforms have come in terms of offering sports, with findings that show the number of people ...
11/03/2025
TYSONS, Va. Tegna Inc. has announced today that John Trevi o has been named president and general manager at WKYC, the NBC affiliate serving Cleveland, Ohio, ef...
11/03/2025
BOSTON Brightcove has announced that Canela Media is using Brightcove's technologies to power its streaming operations....
11/03/2025
CESSON-SEVIGNE, France Chunghwa Telecom, the leading telecommunications operator in Taiwan, has selected Broadpeak to provide solutions for its streaming servic...
11/03/2025
Leader sets US debut of LPX500 Waveform Monitor for NAB 2025
Brie Clayton March 10, 2025
0 Comments
Test & measurement innovator, Leader Instruments C...
11/03/2025
The Berklee Institute of Jazz and Gender Justice Presents the Grand Gathering The Signature Series concert will feature performances by all nine of the instit...
11/03/2025
Abu Dhabi, UAE and Carlsbad, California 12 March 2025 Space42, (ADX: SPACE42...
11/03/2025
March 11th, 2025 Tribeca Enterprises, SIC, and the Lisbon City Council Announce...
11/03/2025
Drones, Flycam To Highlight 60+-Camera Coverage of THE PLAYERS This Weekend 60+ cameras, NEP's PGA TOUR fleet, 25 talent will be deployed at TPC Sawgrass B...
11/03/2025
The Art of 9:16: How Corporate Content Producers Have Embraced Vertical Video Leaders from the corporate space share their advice and success stories By SVG St...
11/03/2025
SVG Rewind: NFL's Tim Tubito on Elevating Gameday Activations of Super Bowl ...
11/03/2025
New Sponsor Spotlight: Ventuz's David Paniego on the Increasing Synergy Betw...
11/03/2025
Best Snow Day Ever: NESN, Bruins Roll Out NHL's Latest Animated Data-Visuali...
11/03/2025
To view this content, please enable our use of cookies. To do so, click Privacy ...
11/03/2025
Back to All News
Netflix Reveals Official Trailer for The Ladys CompanionPlay Video
Play Video
Entertainment
11 March 2025
GlobalSpain
Link copied to clip...
11/03/2025
Back to All News
Geeta Gandbhir's The Perfect Neighbor to Release on Netfli...
11/03/2025
The ROI of AI: New research on how AI is transforming B2B sales Published on Mar 11, 2025 Categories: Research, Data and insights
LinkedIn Corporate Commun...
11/03/2025
SAN JOSE, Calif. - March 11, 2025 - Harmonic (NASDAQ: HLIT) today announced that...
11/03/2025
Powerful new switcher for news studio...