
Just as there are widely understood empirical laws of nature - for example, what goes up must come down, or every action has an equal and opposite reaction - the field of AI was long defined by a single idea: that more compute, more training data and more parameters makes a better AI model.
However, AI has since grown to need three distinct laws that describe how applying compute resources in different ways impacts model performance. Together, these AI scaling laws - pretraining scaling, post-training scaling and test-time scaling, also called long thinking - reflect how the field has evolved with techniques to use additional compute in a wide variety of increasingly complex AI use cases.
The recent rise of test-time scaling - applying more compute at inference time to improve accuracy - has enabled AI reasoning models, a new class of large language models (LLMs) that perform multiple inference passes to work through complex problems, while describing the steps required to solve a task. Test-time scaling requires intensive amounts of computational resources to support AI reasoning, which will drive further demand for accelerated computing.
What Is Pretraining Scaling? Pretraining scaling is the original law of AI development. It demonstrated that by increasing training dataset size, model parameter count and computational resources, developers could expect predictable improvements in model intelligence and accuracy.
Each of these three elements - data, model size, compute - is interrelated. Per the pretraining scaling law, outlined in this research paper, when larger models are fed with more data, the overall performance of the models improves. To make this feasible, developers must scale up their compute - creating the need for powerful accelerated computing resources to run those larger training workloads.
This principle of pretraining scaling led to large models that achieved groundbreaking capabilities. It also spurred major innovations in model architecture, including the rise of billion- and trillion-parameter transformer models, mixture of experts models and new distributed training techniques - all demanding significant compute.
And the relevance of the pretraining scaling law continues - as humans continue to produce growing amounts of multimodal data, this trove of text, images, audio, video and sensor information will be used to train powerful future AI models.
Pretraining scaling is the foundational principle of AI development, linking the size of models, datasets and compute to AI gains. Mixture of experts, depicted above, is a popular model architecture for AI training. What Is Post-Training Scaling? Pretraining a large foundation model isn't for everyone - it takes significant investment, skilled experts and datasets. But once an organization pretrains and releases a model, they lower the barrier to AI adoption by enabling others to use their pretrained model as a foundation to adapt for their own applications.
This post-training process drives additional cumulative demand for accelerated computing across enterprises and the broader developer community. Popular open-source models can have hundreds or thousands of derivative models, trained across numerous domains.
Developing this ecosystem of derivative models for a variety of use cases could take around 30x more compute than pretraining the original foundation model.
Developing this ecosystem of derivative models for a variety of use cases could take around 30x more compute than pretraining the original foundation model.
Post-training techniques can further improve a model's specificity and relevance for an organization's desired use case. While pretraining is like sending an AI model to school to learn foundational skills, post-training enhances the model with skills applicable to its intended job. An LLM, for example, could be post-trained to tackle a task like sentiment analysis or translation - or understand the jargon of a specific domain, like healthcare or law.
The post-training scaling law posits that a pretrained model's performance can further improve - in computational efficiency, accuracy or domain specificity - using techniques including fine-tuning, pruning, quantization, distillation, reinforcement learning and synthetic data augmentation.
Fine-tuning uses additional training data to tailor an AI model for specific domains and applications. This can be done using an organization's internal datasets, or with pairs of sample model input and outputs.
Distillation requires a pair of AI models: a large, complex teacher model and a lightweight student model. In the most common distillation technique, called offline distillation, the student model learns to mimic the outputs of a pretrained teacher model.
Reinforcement learning, or RL, is a machine learning technique that uses a reward model to train an agent to make decisions that align with a specific use case. The agent aims to make decisions that maximize cumulative rewards over time as it interacts with an environment - for example, a chatbot LLM that is positively reinforced by thumbs up reactions from users. This technique is known as reinforcement learning from human feedback (RLHF). Another, newer technique, reinforcement learning from AI feedback (RLAIF), instead uses feedback from AI models to guide the learning process, streamlining post-training efforts.
Best-of-n sampling generates multiple outputs from a language model and selects the one with the highest reward score based on a reward model. It's often used to improve an AI's outputs without modifying model parameters, offering an alternative to fine-tuning with reinforcement learning.
Search methods explore a range of potential decision paths before selecting a final output. This post-training technique can iteratively improve the model's responses
Most recent headlines
01/04/2025
USHER's London takeover is in full swing. After kicking off his sold-out run of shows at the O2 Arena to rave reviews, the R&B icon joined forces with Spoti...
01/04/2025
Innovative program empowers partners with growth, efficiency and collaboration
Herndon, Va., April 1, 2025 ST Engineering iDirect, a global leader in satelli...
01/04/2025
MELBOURNE, Fla., April 1, 2025 - L3Harris Technologies (NYSE: LHX) will release its first quarter 2025 financial results before the market opens on Thursday, Ap...
01/04/2025
Calrec Craft Interview: Aston Fearon, Sound Supervisor In this craft interview, Aston Fearon speaks to us about how his career in sound started, projects he'...
01/04/2025
MONT-SAINT-GUIBERT, Belgium Telestream has integrated intoPIX's JPEG XS technology into Telestream's PRISM waveform monitors, which Telestream says will...
01/04/2025
BURLINGTON, Mass. Avid has signed a strategic collaboration agreement with Amazon Web Services (AWS), to deliver a cloud-based production framework that helps f...
01/04/2025
LONDON and NEW YORK The United Football League (UFL) has signed a new global partnership with sports broadcaster DAZN to broadcast every game of the UFL's 2...
01/04/2025
In a groundbreaking bid to streamline and democratize the production process, Netflix has laid out how it is developing a new Media Production Suite, that t...
01/04/2025
PHILADELPHIA Comcast Business has announced that it has completed its acquisition of Nitel, a U.S. managed services provider headquartered in Chicago, from inte...
01/04/2025
NEW YORK A team of research industry veterans, led by Tod Johnson have launched a new consumer insights and analytics platform, Tenetic, that offers both local ...
01/04/2025
V-Nova, a leading provider of compression solutions, today announced its inaugural participation in a patent pool, joining the Access Advance HEVC Patent Pool. ...
01/04/2025
Cinnafilm, a global leader in video optimization solutions, today announced that it will launch Tachyon LIVE, its groundbreaking live IP standards and format co...
01/04/2025
HighField AI, an advanced AI-powered solution designed to automate repetitive tasks within the media production workflow, today announced that it will demonstra...
01/04/2025
Globecast has expanded its use of Net Insight's Nimbra technology by deploying Nimbra Edge, significantly streamlining its media transport operations. This ...
01/04/2025
EdgePeak enables software architects and developers to design and build their own content delivery network (CDN) while reducing streaming costs, fighting video...
01/04/2025
Cinnafilm to preview the innovation at the 2025 NAB Show
Cinnafilm, a global leader in video optimization, has collaborated with NVIDIA to unveil a groundbreak...
01/04/2025
Leading video software provider Synamedia, will showcase its innovation-driven approach to solving the biggest challenges facing customers today and in the futu...
01/04/2025
AJA Debuts IP and 12G-SDI Innovations Ahead of NAB 2025
Brie Clayton April 1, 2025
0 Comments
New tools optimize media and entertainment and proAV wo...
01/04/2025
Bit Part Introduces bitbox mini, the Smallest and Lightest Solution for Ultra-Lo...
01/04/2025
IABM Unveils Bold Transformation at NAB Show, Prioritizing Member Value
Brie Clayton April 1, 2025
0 Comments
IABM is delivering a strategic transform...
01/04/2025
OOONA Introduces Multilingual QC Tool for Subtitling Workflows
Brie Clayton April 1, 2025
0 Comments
See OOONA on booth W4209 at the NAB Show, Las Veg...
01/04/2025
Adopting open standards, the solution aims to provide workflow standardisation, allowing for automation and other innovations across a diverse range of markets
...
01/04/2025
Submissions will be accepted up until 23:59 PST on 2nd April
By Jenny Priestley
Published: March 24, 2025 Updated: April 1, 2025
Submissions will be acc...
01/04/2025
The AI issue takes a look at how AI is reshaping broadcasting, including areas such as sports commentary and archiving and storage, plus we discover how Norways...
01/04/2025
Joining the company with more than two decades of experience forging and scaling alliances in the industry, Wastcoats role will support TVUs strategic developme...
01/04/2025
At the beginning of the year, Rich Welsh, senior vice president with Deluxe, was appointed the new president of Society of Motion Picture and Television Enginee...
01/04/2025
STAMFORD, Conn. and NEW YORK Charter's Spectrum pay TV operations are continuing its previously announced strategy of adding more streaming services to its ...
01/04/2025
HUNT VALLEY, Md. Sinclair, Inc. and its subsidiary, ONE Media Technologies, have announced that members of their leadership team will be participating in multip...
01/04/2025
01 04 2025 - Media release Bus Stop Films' first feature Boss Cat to begin production in June
Boss Cat cast (L-R): Olivia Hargroder, Penny Downie and Juli...
01/04/2025
PremiumBeat - Flexible, Unlimited Music For Creators
Brie Clayton March 31, 2025
0 Comments
Back in November of 2024, PremiumBeat made a bold move tha...
01/04/2025
MLB 2025: TNT Sports Chooses Remote Production for MLB Tuesday,' Upgrades C...
01/04/2025
SVG All-Stars: Francisco Contreras, Executive Director, Field Operations, FOX Sp...
01/04/2025
MILTON drones get a boost with Rohde & Schwarz SIGINT integration Rohde & Schwarz and MILTON have partnered to integrate advanced signals intelligence technol...
01/04/2025
Rohde & Schwarz presents comprehensive R&S ELEKTRA portfolio for reproducible, s...
01/04/2025
Create Complex Compositions with Unlimited Layers with FOR-A MixBoard Powered by ClassX...
01/04/2025
Article courtesy of Digital Production Germany
Read the article
Digital Production Germany magazine editor, Bela Beier, recently talked to Nara's Steve Br...
01/04/2025
Article courtesy of Digital Media World
Read the article
Light Iron uses Nara to handle file navigation, content streaming and information sharing workflow ef...
01/04/2025
Article courtesy of British Cinematographer
Read the article
DoP Don Burgess, VFX supervisor Kevin Baillie and colourist Maxine Gervais pulled their talents t...
01/04/2025
Polesi ski made a name for himself early in his career. Renowned for his attention to detail and ability to mix his creative and technical skills, Polesi ski st...
01/04/2025
visionOS 2.4 is available today, bringing the first set of powerful Apple Intelligence features that help users communicate, write, and express themselves on Ap...
01/04/2025
Facebook
Twitter
LinkedIn
Defence Science and Technology Agency (DSTA) and...
31/03/2025
Ready, set, Party Time!' SBS News empowers young voters with a new politica...
31/03/2025
31 January, 2024
Company News
Tokyo, January 31, 2024 - Hitachi, Ltd. (TSE:6501) today announced the following executive
changes to improve corporate value....
31/03/2025
MELBOURNE, Fla., March 31, 2025 - L3Harris Technologies (NYSE: LHX) has complete...
31/03/2025
Vice Admiral Jan Willem Hartman, commander of the Dutch Materiel and IT Command, and Chris Aebli, President, Tactical Communications, L3Harris Technologies, sig...
31/03/2025
The L3Harris team visited HMS GLASGOW, the first T26 Global Combat Ship, current...
31/03/2025
SEATTLE As the WNBA prepares to kick off the 2025 season, the Seattle Storm WNBA team has announced a multi-year deal with Sinclair's KOMO and KUNS station...
31/03/2025
Digital Nirvana, a provider of leading-edge AI-powered media solutions, today announced a global Alliance Partnership with Avid to bring advanced AI metadata c...
31/03/2025
BeckTV, a premier systems integrator for the broadcast media industry, today announced that Kate Gazdic has joined the company as a senior procurement specialis...
31/03/2025
MainConcept, a leading provider of video and audio codecs, has announced a series of key codec advancements that enable customers to realize significant time an...