Sony Pixel Power calrec Sony

Math Test? No Problems: NVIDIA Team Scores Kaggle Win With Reasoning Model

15/04/2025

The final days of the AI Mathematical Olympiad's latest competition were a transcontinental relay for team NVIDIA.

Every evening, two team members on opposite ends of the U.S. would submit an AI reasoning model to Kaggle - the online Olympics of data science and machine learning. They'd wait a tense five hours before learning how well the model tackled a sample set of 50 complex math problems.

After seeing the results, the U.S. team would pass the baton to teammates waking up in Armenia, Finland, Germany and Northern Ireland, who would spend their day testing, modifying and optimizing different model versions.

Every night I'd be so disappointed in our score, but then I'd wake up and see the messages that came in overnight from teammates in Europe, said Igor Gitman, senior applied scientist. My hopes would go up and we'd try again.

While the team was disheartened by their lack of improvement on the public dataset during the competition's final days, the real test of an AI model is how well it can generalize to unseen data. That's where their reasoning model leapt to the top of the leaderboard - correctly answering 34 out of 50 Olympiad questions within a five-hour time limit using a cluster of four NVIDIA L4 GPUs.

We got the magic in the end, said Northern Ireland-based team member Darragh Hanley, a Kaggle grandmaster and senior large language model (LLM) technologist.

Building a Winning Equation The NVIDIA team competed under the name NemoSkills - a nod to their use of the NeMo-Skills collection of pipelines for accelerated LLM training, evaluation and inference. The seven members each contributed different areas of expertise, spanning LLM training, model distillation and inference optimization.

For the Kaggle challenge, over 2,200 participating teams submitted AI models tasked with solving 50 math questions - complex problems at the National Olympiad level, spanning algebra, geometry, combinatorics and number theory - within five hours.

https://blogs.nvidia.com/wp-content/uploads/2025/04/Sample-Reasoning-AI.mp4

The team's winning model uses a combination of natural language reasoning and Python code execution.

To complete this inference challenge on the small cluster of NVIDIA L4 GPUs available via Kaggle, the NemoSkills team had to get creative.

Their winning model used Qwen2.5-14B-Base, a foundation model with chain-of-thought reasoning capabilities which the team fine-tuned on millions of synthetically generated solutions to math problems.

These synthetic solutions were primarily generated by two larger reasoning models - DeepSeek-R1 and QwQ-32B - and used to teach the team's foundation model via a form of knowledge distillation. The end result was a smaller, faster, long-thinking model capable of tackling complex problems using a combination of natural language reasoning and Python code execution.

To further boost performance, the team's solution reasons through multiple long-thinking responses in parallel before determining a final answer. To optimize this process and meet the competition's time limit, the team also used an innovative early-stopping technique.

A reasoning model might, for example, be set to answer a math problem 12 different times before picking the most common response. Using the asynchronous processing capabilities of NeMo-Skills and NVIDIA TensorRT-LLM, the team was able to monitor and exit inference early if the model had already converged at the correct answer four or more times.

TensorRT-LLM also enabled the team to harness FP8 quantization, a compression method that resulted in a 1.5x speedup over using the more commonly used FP16 format. ReDrafter, a speculative decoding technique developed by Apple, was used for a further 1.8x speedup.

The final model performed even better on the competition's unseen final dataset than it did on the public dataset - a sign that the team successfully built a generalizable model and avoided overfitting their LLM to the sample data.

Even without the Kaggle competition, we'd still be working to improve AI reasoning models for math, said Gitman. But Kaggle gives us the opportunity to benchmark and discover how well our models generalize to a third-party dataset.

Sharing the Wealth The team will soon release a technical report detailing the techniques used in their winning solution - and plans to share their dataset and a series of models on Hugging Face. The advancements and optimizations they made over the course of the competition have been integrated into NeMo-Skills pipelines available on GitHub.

Key data, technology, and insights from this pipeline were also used to train the just-released NVIDIA Llama Nemotron Ultra model.

Throughout this collaboration, we used tools across the NVIDIA software stack, said Christof Henkel, a member of the Kaggle Grandmasters of NVIDIA, known as KGMON. By working closely with our LLM research and development teams, we're able to take what we learn from the competition on a day-to-day basis and push those optimizations into NVIDIA's open-source libraries.

After the competition win, Henkel regained the title of Kaggle World Champion - ranking No. 1 among the platform's over 23 million users. Another teammate, Finland-based Ivan Sorokin, earned the Kaggle Grandmaster title, held by just over 350 people around the world.

For their first-place win, the group also won a $262,144 prize that they're directing to the NVIDIA Foundation to support charitable organizations.

Meet the full team - Igor Gitman, Darragh Hanley, Christof Henkel, Ivan Moshkov, Benedikt Schifferer, Ivan Sorokin and Shubham Toshniwal - in the video below:

Sample math questions in the featured visual above are from the 2025 American Invitational Mathematics Examination. Find the full set of questions and solutions on the Art
LINK: https://blogs.nvidia.com/blog/reasoning-ai-math-olympiad/...
See more stories from nvidia

North America Stories

16/04/2025

Premion Expands Omnichannel and Ad Tech Capabilities

NEW YORK Premion, a CTV/OTT advertising solution for regional and local advertisers, has launched expanded capabilities and new tools for advertisers to execute...

16/04/2025

High End TV Selects Lawo mc56 MkIII for Flagship Production Truck

High End TV, a major provider of mobile broadcast and recording services, has installed a Lawo mc 56 MkIII console in its flagship mobile production truck, Symp...

16/04/2025

SMPTE Issues Call For Technical Papers

WHITE PLAINS, N.Y. The Society of Motion Picture and Television Engineers has issued a call for technical papers for its 2025 Media Technology Summit, Oct. 13-1...

16/04/2025

MyFree DirecTV Adds Eight NBCU Channels

DirecTV's free streaming service MyFree DirecTV has just added another eight channels from NBCUniversal....

16/04/2025

New ARRI Camera Companion App Offers Personalized Camera Control

MUNICH ARRI has officially launched of its Camera Companion App for iPhone, iPad, and Mac with Apple Silicon, enabling users to configure their own personalized...

16/04/2025

COW Jobs: Offering mentorship - looking for a video editor for game dev YouTube contentCOW Jobs:

COW Jobs: Offering mentorship - looking for a video editor for game dev YouTube ...

16/04/2025

Charlie Puths Homecoming Was Also a Victory Lap

Charlie Puths Homecoming Was Also a Victory Lap The pop star, songwriter, and producer talked about his career and shared stories behind his biggest hits, inc...

16/04/2025

April 15, 2025

The very first structural images of a tuberculosis-fighting virus New insights from Scripps Research could advance phage therapies for the world's deadliest...

15/04/2025

The Gauge: March Madness Lifts Cable, Streaming Competition Grows as Seasonal Trends Take Effect

Seven different platforms deliver March's most-watched streaming titles. St...

15/04/2025

FCC Seeks Public Comments on NABs NextGen TV Proposals

WASHINGTON The Federal Communication Commission's Media Bureau has issued a Notice seeking comments on a major filing by the National Association of Broadca...

15/04/2025

Comcast Launches Five-Year Price Guarantee for Xfinity Internet

PHILADELPHIA As cable operators face increased competition from fixed wireless plans, Comcast is introducing the option to choose a five-year price guarantee wh...

15/04/2025

Kaleidescape Joins 8K Association

MOUNTAIN VIEW, Calif. Kaleidescape has announced that it has joined the 8K Association (8KA)....

15/04/2025

Sinclair Urges FCC to Abolish Station Ownership Rules, Sunset ATSC 1.0

WASHINGTON In a wide-ranging filing with the Federal Communications Commission, Sinclair applauded the agency's push towards deregulation while arguing that...

15/04/2025

Math Test? No Problems: NVIDIA Team Scores Kaggle Win With Reasoning Model

The final days of the AI Mathematical Olympiad's latest competition were a transcontinental relay for team NVIDIA. Every evening, two team members on oppos...

15/04/2025

Everywhere, All at Once: NVIDIA Drives the Next Phase of AI Growth

Every company and country wants to grow and create economic opportunity - but they need virtually limitless intelligence to do so. Working with its ecosystem pa...

15/04/2025

Thousands of NVIDIA Grace Blackwell GPUs Now Live at CoreWeave, Propelling Development for AI Pioneers

CoreWeave today became one of the first cloud providers to bring NVIDIA GB200 NV...

15/04/2025

From Homer to Mickey, How ESPN Is Blazing the Live Animated Broadcast Trail

From Homer to Mickey, How ESPN Is Blazing the Live Animated Broadcast Trail ESPNs Amy Nelson, Sparky Sparrgrove, and Spike Szykowny offer an inside look at the ...

15/04/2025

SVG College Sports Media Awards 2025: Final Deadline to Enter is Wednesday

SVG College Sports Media Awards 2025: Final Deadline to Enter is Wednesday All entries must be received by end-of-day April 16 By Brandon Costa, Director of Di...

15/04/2025

Football Summit 2025: BBC Sport and Sunset+Vine Share Plans for UEFA Women's Euro 2025

Football Summit 2025: BBC Sport and Sunset Vine share plans for UEFA Women's...

15/04/2025

SVG New Sponsor Spotlight: ScorePlay Cofounder and CEO Vic Tixier on Media Management Solutions For Sports People By Sports People

SVG New Sponsor Spotlight: ScorePlay Cofounder and CEO Vic Tixier on Media Manag...

15/04/2025

Inside IOWN: Envisioning a High-Speed, High-Capacity Future for Live Productions

Inside IOWN: Envisioning a high-speed, high-capacity future for live productions By Joe OHalloran Tuesday, April 15, 2025 - 09:38 Print This Story As dele...

15/04/2025

Neither Fair Nor Objective, Mediapro Reacts Strongly to LaLiga Decision to Award Production Contract to HBS

Neither fair nor objective , Mediapro reacts strongly to LaLiga decision to awar...

15/04/2025

HBS and NVP Take LaLiga Production and Distribution for First and Second Division Matches on Five-Year Contract

HBS and NVP take LaLiga production and distribution for first and second divisio...

15/04/2025

LA28 Details Venue Plans for 2028 Summer Games

LA28 Details Venue Plans for 2028 Summer Games By Ken Kerschbaumer, Editorial Director Tuesday, April 15, 2025 - 1:37 pm Print This Story | Subscribe St...

14/04/2025

Give Me the Backstory: Get to Know Andrew Ahn, the Filmmaker Behind The Wedding Banquet

By Bailey Pennick One of the most exciting things about the Sundance Film Festi...

14/04/2025

Celebrating 190 Years of Excellence: L3Harris Calzoni's Legacy in Italy's Naval Industry

For 190 years, innovation and technology from our talented workforce have distin...

14/04/2025

ReachTV Taps Nielsen for Ad Measurement Of Live Sports + Original Content in Airports

For the first time, advertisers and agencies can directly compare campaign perfo...

14/04/2025

FCC Seeks Public Comments on NAB NextGen TV Proposals

WASHINGTON The Federal Communication Commission's Media Bureau has issued a Notice seeking comments on a major filing by the National Association of Broadca...

14/04/2025

magic multi media unveils new customizable features and advanced AI for EDIUS 11 at NAB Show 2025

magic multi media unveils new customizable features and advanced AI for EDIUS 11...

14/04/2025

Rotating around an object or layer in After Effects

Rotating around an object or layer in After Effects Graham Quince April 14, 2025 0 Comments One of the most frequently asked questions on forums is: ...

14/04/2025

Does Clearing the Cache Really Work in Premiere Pro?

Does Clearing the Cache Really Work in Premiere Pro? Colin Smith April 14, 2025 0 Comments This tutorial clearly outlines the types of problems that c...

14/04/2025

Ross Video and Arcturus announce creative partnership to revolutionize virtual production

Ottawa, April 14, 2025 - Ross Video is pleased to announce a new creative partne...

14/04/2025

NAB 2025 in Review: Broadcast Audio Faces a Future of Tariff-Induced Higher Costs

NAB 2025 in Review: Broadcast Audio Faces a Future of Tariff-Induced Higher Cost...

14/04/2025

Changing the World: Behind the Collaborative Production of the 2025 Special Olympics World Winter Games

Changing the world: Behind the collaborative production of the 2025 Special Olym...

14/04/2025

From Paris to Milano Cortina: The International Paralympic Committee on the Movement's French Triumph and Looking Ahead to Italian Success

From Paris to Milano Cortina: The International Paralympic Committee on the move...

14/04/2025

SVG College Summit 2025: Jimmy Platt, ESPN's College Football Playoff, Women's Basketball Final Four Director, to Keynote

SVG College Summit 2025: Jimmy Platt, ESPN's College Football Playoff, Women...

14/04/2025

NAB 2025 in Review: SVG Audio Roundtable Reflects On a Changing Industry

NAB 2025 in Review: SVG Audio Roundtable Reflects On a Changing Industry Global sports, speech intelligibility, digital microphones are on the table By Dan Dal...

14/04/2025

NAB 2025 in Review: The Pairing of AI and Audio Is Possible

NAB 2025 in Review: The Pairing of AI and Audio Is Possible But it won't necessarily happen tomorrow By Dan Daley, Audio Editor Monday, April 14, 2025 - ...

14/04/2025

WNBA Draft 2025: ESPN's Onsite Effort Fits Into NYC Grid for National Telecast From The Shed at Hudson Yards

WNBA Draft 2025: ESPN's Onsite Effort Fits Into NYC Grid for National Teleca...

14/04/2025

Netflix ISP Speed Index for March 2025

Back to All News Netflix ISP Speed Index for March 2025 Product 14 April 2025 Global Link copied to clipboard Forty-five percent of Internet Service Provi...

14/04/2025

NVIDIA to Manufacture American-Made AI Supercomputers in US for First Time

NVIDIA is working with its manufacturing partners to design and build factories that, for the first time, will produce NVIDIA AI supercomputers entirely in the ...

13/04/2025

High Stakes, Higher Thrills: Netflix's Jewel Thief - The Heist Begins' Trailer Sets Heart Racing

Back to All News High Stakes, Higher Thrills: Netflix's Jewel Thief - The ...

12/04/2025

CNBC Ranks New York Yankees as Most Valuable MLB Team

With the Major League Baseball season still in its first month, CNBC has issued its first ever list of the most valuable MLB franchises, with the New York Yanke...

12/04/2025

Echostar Urges FCC to Reduce Blackouts with Major Rule Changes

WASHINGTON The Federal Communication Commissions request for comments on outdated rules that should be deleted has prompted Echostar, the owner of Dish, to file...

12/04/2025

OWC Launches OWC SoftRAID 8.5 for macOS and Windows - for Greater Reliability, Functionality, and Performance

OWC Launches OWC SoftRAID 8.5 for macOS and Windows - for Greater Reliability, F...

12/04/2025

Career Jam 2025: Insights, Images, and Inspiration

Career Jam 2025: Insights, Images, and Inspiration Charlie Puth, Tiny Desk, top A&Rs, and more came together for a day of professional development and energet...

11/04/2025

Unlock IP Easy as SDI: Modernizing Your Series 2 Router with the PassThrough Card

Unlock IP Easy as SDI: Modernizing Your Series 2 Router with the PassThrough Car...

11/04/2025

One to One: John & Yoko is an Immersive '70s Viewing Experience

(L-R) Kevin Macdonald and Sam Rice-Edwards attend the 2025 Sundance Film Festival premiere of One to One: John & Yoko at The Ray Theatre on January 23, 2025, ...

11/04/2025

L3Harris Propulsion Solutions to Keep U.S. Adversaries Guessing

[This article has been updated, original publish date: April 8, 2024] Since the dawn of the Space Age, objects in space have traveled along predictable, gravit...

11/04/2025

Standards: The Next Generation

The development and adoption of the SMPTE ST 2110 IP standards suite has been so prominent in both broadcast and pro AV that, in truth, the coverage of some oth...