The trend of AI researchers developing new, small open source generative models that outperform far larger, proprietary peers continued this week with yet another staggering advancement.<\/p>\n

Alexia Jolicoeur-Martineau<\/b>, Senior AI Researcher at Samsung's Advanced\u200b Institute of Technology (SAIT)<\/b> in Montreal, Canada,\u200b <\/b>has introduced the Tiny Recursion Model (TRM)<\/b> \u2014 a neural network so small it contains just 7 million parameters (internal model settings), yet it competes with or surpasses cutting-edge language models 10,000 times larger in terms of their parameter count, including OpenAI's o3-mini and Google's Gemini 2.5 Pro,<\/b> on some of the toughest reasoning benchmarks in AI research. <\/p>\n

The goal is to show that very highly performant new AI models can be created affordably without massive investments in the graphics processing units (GPUs) and power needed to train the larger, multi-trillion parameter flagship models powering many LLM chatbots today. The results were described in a research paper published on open access website arxiv.org, entitled "Less is More: Recursive Reasoning with Tiny Networks<\/i>."<\/p>\n

"The idea that one must rely on massive foundational models trained for millions of dollars by some big corporation in order to solve hard tasks is a trap," wrote Jolicoeur-Martineau on the social network X. "Currently, there is too much focus on exploiting LLMs rather than devising and expanding new lines of direction."<\/p>\n

<\/div>\n
Jolicoeur-Martineau also added: "With recursive reasoning, it turns out that 'less is more'. A tiny model pretrained from scratch, recursing on itself and updating its answers over time, can achieve a lot without breaking the bank."<\/b><\/p>\n
TRM's code is available now on Github under an enterprise-friendly, commercially viable MIT License \u2014 meaning anyone from researchers to companies can take, modify it, and deploy it for their own purposes, even commercial applications.<\/p>\n
One Big Caveat<\/b><\/h3>\n
However, readers should be aware that TRM was designed specifically to perform well on structured, visual, grid-based problems like Sudoku, mazes, and puzzles on the ARC (Abstract and Reasoning Corpus)-AGI benchmark, the latter which offers tasks that should be easy for humans but difficult for AI models, such sorting colors on a grid based on a prior, but not identical, solution. <\/p>\n
From Hierarchy to Simplicity<\/b><\/h3>\n
The TRM architecture represents a radical simplification. <\/p>\n
It builds upon a technique called Hierarchical Reasoning Model (HRM)<\/b> introduced earlier this year, which showed that small networks could tackle logical puzzles like Sudoku and mazes. <\/p>\n
HRM relied on two cooperating networks\u2014one operating at high frequency, the other at low\u2014supported by biologically inspired arguments and mathematical justifications involving fixed-point theorems. Jolicoeur-Martineau found this unnecessarily complicated.<\/p>\n
TRM strips these elements away. Instead of two networks, it uses a single two-layer model<\/b> that recursively refines its own predictions. <\/p>\n
The model begins with an embedded question and an initial answer, represented by variables x<\/b>, y<\/b>, and z<\/b>. Through a series of reasoning steps, it updates its internal latent representation z<\/b> and refines the answer y<\/b> until it converges on a stable output. Each iteration corrects potential errors from the previous step, yielding a self-improving reasoning process without extra hierarchy or mathematical overhead.<\/p>\n
How Recursion Replaces Scale<\/b><\/h3>\n
The core idea behind TRM is that recursion can substitute for depth and size.<\/i><\/p>\n
By iteratively reasoning over its own output, the network effectively simulates a much deeper architecture without the associated memory or computational cost. This recursive cycle, run over as many as sixteen supervision steps, allows the model to make progressively better predictions \u2014 similar in spirit to how large language models use multi-step \u201cchain-of-thought\u201d reasoning, but achieved here with a compact, feed-forward design.<\/p>\n
The simplicity pays off in both efficiency and generalization. The model uses fewer layers, no fixed-point approximations, and no dual-network hierarchy. A lightweight halting mechanism<\/b> decides when to stop refining, preventing wasted computation while maintaining accuracy.<\/p>\n
Performance That Punches Above Its Weight<\/b><\/h3>\n
Despite its small footprint, TRM delivers benchmark results that rival or exceed models millions of times larger. In testing, the model achieved:<\/p>\n
\n
\n
87.4% accuracy<\/b> on Sudoku-Extreme<\/b> (up from 55% for HRM)<\/p>\n<\/li>\n
\n
85% accuracy<\/b> on Maze-Hard<\/b> puzzles<\/p>\n<\/li>\n
\n
45% accuracy<\/b> on ARC-AGI-1<\/b><\/p>\n<\/li>\n
\n
8% accuracy<\/b> on ARC-AGI-2<\/b><\/p>\n<\/li>\n<\/ul>\n
These results surpass or closely match performance from several high-end large language models, including DeepSeek R1<\/b>, Gemini 2.5 Pro<\/b>, and o3-mini<\/b>, despite TRM using less than 0.01% of their parameters.<\/p>\n
Such results suggest that recursive reasoning, not scale, may be the key to handling abstract and combinatorial reasoning problems \u2014 domains where even top-tier generative models often stumble.<\/p>\n
Design Philosophy: Less Is More<\/b><\/h3>\n
TRM\u2019s success stems from deliberate minimalism. Jolicoeur-Martineau found that reducing complexity led to better generalization. <\/p>\n
When the researcher increased layer count or model size, performance declined due to overfitting on small datasets. <\/p>\n
By contrast, the two-layer structure, combined with recursive depth and deep supervision<\/b>, achieved optimal results.<\/p>\n
The model also performed better when self-attention was replaced with a simpler multilayer perceptron<\/b> on tasks with small, fixed contexts like Sudoku. <\/p>\n
For larger grids, such as ARC puzzles, self-attention remained valuable. These findings underline that model architecture should match data structure and scale rather than default to maximal capacity.<\/p>\n
Training Small, Thinking Big<\/b><\/h3>\n
TRM is now officially available as open source under an MIT license<\/b> on GitHub.<\/p>\n
The repository includes full training and evaluation scripts, dataset builders for Sudoku, Maze, and ARC-AGI, and reference configurations for reproducing the published results. <\/p>\n
It also documents compute requirements ranging from a single NVIDIA L40S GPU for Sudoku training to multi-GPU H100 setups for ARC-AGI experiments.<\/p>\n
The open release confirms that TRM is designed specifically for structured, grid-based reasoning tasks<\/b> rather than general-purpose language modeling. <\/p>\n
Each benchmark \u2014 Sudoku-Extreme, Maze-Hard, and ARC-AGI \u2014 uses small, well-defined input\u2013output grids, aligning with the model\u2019s recursive supervision process. <\/p>\n
Training involves substantial data augmentation (such as color permutations and geometric transformations), underscoring that TRM\u2019s efficiency lies in its parameter size rather than total compute demand.<\/p>\n
The model\u2019s simplicity and transparency make it more accessible to researchers outside of large corporate labs. Its codebase builds directly on the earlier Hierarchical Reasoning Model framework but removes HRM\u2019s biological analogies, multiple network hierarchies, and fixed-point dependencies. <\/p>\n
In doing so, TRM offers a reproducible baseline for exploring recursive reasoning in small models \u2014 a counterpoint to the dominant \u201cscale is all you need\u201d philosophy.<\/p>\n
Community Reaction<\/b><\/h3>\n
The release of TRM and its open-source codebase prompted an immediate debate among AI researchers and practitioners on X. While many praised the achievement, others questioned how broadly its methods could generalize.<\/p>\n
Supporters hailed TRM as proof that small models can outperform giants, calling it \u201c10,000\u00d7 smaller yet smarter\u201d and a potential step toward architectures that think rather than merely scale. <\/p>\n
Critics countered that TRM\u2019s domain is narrow \u2014 focused on bounded, grid-based puzzles<\/b> \u2014 and that its compute savings come mainly from size, not total runtime. <\/p>\n
Researcher Yunmin Cha noted that TRM\u2019s training depends on heavy augmentation and recursive passes, \u201cmore compute, same model.\u201d <\/p>\n
Cancer geneticist and data scientist Chey Loveday stressed that TRM is a solver<\/i>, not a chat model or text generator: it excels at structured reasoning but not open-ended language.<\/p>\n
Machine learning researcher Sebastian Raschka positioned TRM as an important simplification of HRM rather than a new form of general intelligence. <\/p>\n
He described its process as \u201ca two-step loop that updates an internal reasoning state, then refines the answer.\u201d<\/p>\n
Several researchers, including Augustin Nabele, agreed that the model\u2019s strength lies in its clear reasoning structure but noted that future work would need to show transfer to less-constrained problem types.<\/p>\n
The consensus emerging online is that TRM may be narrow, but its message is broad: careful recursion, not constant expansion, could drive the next wave of reasoning research.<\/p>\n
Looking Ahead<\/b><\/h3>\n
While TRM currently applies to supervised reasoning tasks, its recursive framework opens several future directions. Jolicoeur-Martineau has suggested exploring generative or multi-answer variants<\/b>, where the model could produce multiple possible solutions rather than a single deterministic one. <\/p>\n
Another open question involves scaling laws for recursion \u2014 determining how far the \u201cless is more\u201d principle can extend as model complexity or data size grows.<\/p>\n
Ultimately, the study offers both a practical tool and a conceptual reminder: progress in AI need not depend on ever-larger models. Sometimes, teaching a small network to think carefully \u2014 and recursively \u2014 can be more powerful than making a large one think once.<\/p>\n

\n
Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"
The trend of AI researchers developing new, small open source generative models that outperform far larger, proprietary peers continued this week with yet another staggering advancement. Alexia Jolicoeur-Martineau, Senior AI Researcher at Samsung's Advanced\u200b Institute of Technology (SAIT) in Montreal, Canada,\u200b has introduced the Tiny Recursion Model (TRM) \u2014 a neural network so small it […]<\/p>\n","protected":false},"author":1,"featured_media":3836,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[33],"tags":[],"class_list":["post-3835","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-automation"],"aioseo_notices":[],"jetpack_featured_media_url":"https:\/\/violethoward.com\/new\/wp-content\/uploads\/2025\/10\/cfr0z3n_watercolor_evocative_storybook_artwork_of_a_giant_robot_3432736a-32c4-46e0-af0f-9e87b1fd4208-scaled.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/3835","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/comments?post=3835"}],"version-history":[{"count":0,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/posts\/3835\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media\/3836"}],"wp:attachment":[{"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/media?parent=3835"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/categories?post=3835"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/violethoward.com\/new\/wp-json\/wp\/v2\/tags?post=3835"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}