AI Volution reports on the latest artificial intelligence Volution news and insights. Industry trends AI analysis, updates on AI technologies AI Volution reports on the latest artificial intelligence Volution news and insights. Industry trends AI analysis, updates on AI technologies
  • Home
  • Ai Business
    Ai BusinessShow More
    A Never-Ending Race of AI Innovations, Features, and What’s Coming Next
    A Never-Ending Race of AI Innovations, Features, and What’s Coming Next
    9 Min Read
    AI’s business truth — essentials for today’s enterprise executives
    AI’s business truth — essentials for today’s enterprise executives
    8 Min Read
    Alibaba launches updated Qwen chatbot amid falling model prices
    Alibaba launches updated Qwen chatbot amid falling model prices
    5 Min Read
    Onton gets $7.5M to push AI shopping beyond furniture
    Onton gets $7.5M to push AI shopping beyond furniture
    5 Min Read
  • Ai News
    Ai NewsShow More
    Trump, GOP Target State AI Rules in Renewed Push
    Donald Trump, GOP Target State AI Rules in Renewed Push
    5 Min Read
    Tiny Samsung model outshines giant LLMs in reasoning tests
    Tiny Samsung model outshines giant LLMs in reasoning tests
    6 Min Read
    Microsoft’s Copilot AI chatbot will exit WhatsApp on January 15
    Microsoft’s Copilot AI chatbot will exit WhatsApp on January 15
    2 Min Read
    xAI plans small solar farm next to Musk’s Colossus data center
    xAI plans small solar farm next to Musk’s Colossus data center
    3 Min Read
  • Ai Startups
    Ai StartupsShow More
    JustiGuide brings AI support to U.S. Immigration Gavigation
    JustiGuide brings AI support to U.S. Immigration Navigation
    4 Min Read
  • Ai Trends
    Ai TrendsShow More
    Character AI Shifts Toward Interactive Story Experiences for Children
    Character AI Shifts Toward Interactive Story Experiences for Children
    3 Min Read
  • ChatGpt
    ChatGptShow More
    ChatGPT’s Voice Mode Is No Longer A Separate Interface
    ChatGPT’s Voice Mode Is No Longer A Separate Interface
    2 Min Read
  • DeepSeek
    DeepSeekShow More
    DeepSeek and Alibaba Push for more explicit rules in AI Governance
    DeepSeek and Alibaba Push for more explicit rules in AI Governance
    5 Min Read
  • Google Ai
    Google AiShow More
    Google Rolls Out Its Latest Artificial Intelligence Model
    Google Rolls Out Gemini 3, Latest Artificial Intelligence Model
    4 Min Read
  • OpenAi
    OpenAiShow More
    What OpenAI and Google Predict About AI Reshaping GTM Approaches
    What OpenAI and Google Predict About AI Reshaping GTM Approaches
    4 Min Read
    OpenAI Says Adam Evaded Safeguards Before Suicide
    OpenAI Says Adam Evaded Safeguards Before Suicide
    4 Min Read
  • More
    • About Us
    • Contact Us
    • Our Mission
    • Privacy Policy
    • Terms of Service
Reading: Tiny Samsung model outshines giant LLMs in reasoning tests
Share
AI Volution reports on the latest artificial intelligence Volution news and insights. Industry trends AI analysis, updates on AI technologies
Latest AI News, AI Tools and Ai AnalysisLatest AI News, AI Tools and Ai Analysis
Font ResizerAa
  • DeepSeek
  • ChatGpt
  • Ai Trends
  • OpenAi
Search
  • Home
    • Home 1
    • Home 2
    • Home 3
    • Home 4
    • Home 5
  • Demos
  • Categories
    • DeepSeek
    • ChatGpt
    • Ai Trends
    • OpenAi
  • Bookmarks
  • More Foxiz
    • Sitemap
Have an existing account? Sign In
Follow US
Home » Blog » Tiny Samsung model outshines giant LLMs in reasoning tests
Ai News

Tiny Samsung model outshines giant LLMs in reasoning tests

Tiny Samsung model outshines giant LLMs in reasoning tests
Kanwal Rubab
Last updated: December 1, 2025 7:22 pm
Kanwal Rubab
Published: December 1, 2025
Share

A new paper from a Samsung AI researcher describes how a small network can outperform gigantic Large Language Models (LLMs) at complex reasoning.

In the contest to build the world’s most powerful computers, big has always been a reliable stand-in for fast. It’s no secret that tech behemoths have spent billions developing models of ever-more staggering proportions. Still, Alexia Jolicoeur-Martineau from Samsung SAIL Montréal believes an entirely new, more efficient approach can be developed with the Tiny Recursive Model (TRM).

With only 7 million parameters, which is less than 0.01% the size of current state-of-the-art LLMs, TRM achieves new state-of-the-art results on notoriously challenging benchmarks, including the ARC-AGI intelligence test. Samsung’s effort is a direct assault on the accepted wisdom that only size matters when advancing AI model capabilities, presenting a more sustainable, parameter-efficient alternative.

Overcoming the limits of scale

Tiny Samsung model outshines giant LLMs in reasoning tests

Though LLMs have demonstrated impressive capabilities in generating human-like text, they can be fragile when performing complex, multi-step reasoning. Since they produce answers token-by-token, a single mistake in the chain affects all downstream outputs, causing an incorrect final output.

Techniques like Chain-of-Thought, where a model “thinks out loud” to unravel a problem, have been attempted in response. But these methods are usually computationally costly, so it is often necessary to manage large amounts of reasoning data, the quality of which can be far from perfect, leading to incorrect logic. Accordingly, even with these extensions, the LLM still struggles to solve puzzles that require perfect logical reasoning.

Samsung’s research is based on a new AI model called the Hierarchical Reasoning Model (HRM). We tested two new networks introduced by HRM, both of which were performing a recursive function at different speeds to improve the answer. It seemed quite powerful but was also complicated; it hinged on speculative biological arguments and complex fixed-point theorems that were not necessarily relevant.

Instead of HRM’s two networks, TRM employs a single, small network that iteratively refines both its internal “reasoning” and its predicted “answer”.

The model is fed with the question, an initial answer guess and a kernel latent reasoning feature. It first iterates over a few steps to condition its latent reasoning on all three inputs. Then, based on this improved logic, it revises its prediction for the final answer. This whole procedure can be iterated up to 16 times, thus allowing the model to progressively self-correct mistakes in a parameter-efficient way.

A surprising finding of the study was that even a tiny network with only two layers generalised across data sets much better than any four-layer version. This model shrinkage seems to prevent overfitting, which is often the case when you train deep models on small datasets.

TRM also (and its good guys being even worse than the baddies makes for a rewarding shift) ditches the abstruse maths that underpinned its precursor. The original HRM model could only be fully justified at the regime acceptance by assuming that its functions tended to a fixed point.

TRM circumvents this altogether by back-propagating directly through its entire recursion. Even this simple modification alone led to a massive leap in  performance, rising from 56.5% to 87.4% accuracy on the Sudoku-Extreme benchmark during an ablation study.

Samsung’s model crushes AI benchmarks using less computing power

The results speak for themselves. Our result on the challenging Sudoku-Extreme dataset, which has only 1,000 training examples, is 87.4% Test accuracy, which is a remarkable improvement over HRM’s 55%. On Maze-Hard, a long path-finding task over 30×30 mazes, TRM scores 85.3% against HRM’s 74.5%.

Most importantly, TRM takes significant leaps on the Abstraction and Reasoning Corpus (ARC-AGI), a benchmark designed to test accurate fluid intelligence in AI. Although there are only 7M parameters, TRM is already powerful, achieving accuracies of 44.6%/7.8% on ARC-AGI-1/ARC-AGI-2, respectively. It performs better than HRM, which employs a 27M-parameter model, and surpasses many of the world’s largest LLMs. By contrast, Gemini 2.5 Pro scores only 4.9% on ARC-AGI-2.

The training of TRM has also been accelerated. An adaptive mechanism, known as ACT, that learns when the model has “considered” an answer for long enough and can pass it on to another data sample, has been simplified, so that a second, expensive forward pass through the network is no longer required at each training step. This decision was taken at the expense of a minimal difference in final generalisation.

This research from Samsung is a perfect piece to argue against the trend of making AI models ever larger. It demonstrates that, with architectures capable of iterative reasoning and self-correction, we can solve very hard problems using a minuscule fraction of computational resources.

Share This Article
Facebook Email Print
FacebookLike
XFollow
PinterestPin
YoutubeSubscribe

LATEST NEWS

banner
FOXIZ MAGAZINE
The Most Flexible WordPress Theme, Design Anything & No Coding Knowledge Required.
Buy Now →
JustiGuide brings AI support to U.S. Immigration Gavigation

JustiGuide brings AI support to U.S. Immigration Navigation

Kanwal Rubab
Kanwal Rubab
November 28, 2025
Onton gets $7.5M to push AI shopping beyond furniture
Google Rolls Out Gemini 3, Latest Artificial Intelligence Model
DeepSeek and Alibaba Push for more explicit rules in AI Governance
xAI plans small solar farm next to Musk’s Colossus data center

You Might Also Like

Trump, GOP Target State AI Rules in Renewed Push
Ai News

Donald Trump, GOP Target State AI Rules in Renewed Push

5 Min Read
Microsoft’s Copilot AI chatbot will exit WhatsApp on January 15
Ai News

Microsoft’s Copilot AI chatbot will exit WhatsApp on January 15

2 Min Read
AI Volution reports on the latest artificial intelligence Volution news and insights. Industry trends AI analysis, updates on AI technologies
  • Review
  • Best Product
  • Contact
  • Reading List
  • Customize Interests

We influence 20 million users and is the number one business and technology news network on the planet.

[mc4wp_form]

Contact US

  • Contact
  • Blog
  • Complaint
  • Advertise

Quick Link

  • Gadget
  • PC hardware
  • Review
  • Software

© Foxiz News Network. Ruby Design Company. All Rights Reserved.

Follow US on Socials

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?