Alex L. Zhang | Recursive Language Models

Alex L. Zhang, Alex L. Zhang

October 15, 2025 at 07:43 PM

Joy (70%)

positive

Alex L. Zhang | Recursive Language Models

Key Takeaways

Introduction of Recursive Language Models (RLMs) as an inference strategy for handling unbounded context/output lengths.
RLMs mitigate 'context rot' by allowing LLMs to recursively interact with their context stored in an environment (e.g., a Python REPL).
An RLM using GPT-5-mini significantly outperformed standard GPT-5 on the OOLONG long-context benchmark while being cheaper per query.
RLMs surpassed other methods like ReAct + indexing on a new Deep Research task and maintained performance with over 10M tokens.
The authors predict RLMs will be the next significant milestone in general-purpose inference scaling after Chain-of-Thought and ReAct models.

The research introduces Recursive Language Models (RLMs), a novel inference strategy designed to enable language models to process essentially unbounded input and output context lengths while actively mitigating 'context rot,' the phenomenon where recall degrades as context grows. RLMs function as a thin wrapper around an LLM, allowing it to spawn recursive calls for intermediate computation, effectively treating the user's prompt context as a variable stored in an environment like a Python REPL. A specific implementation using GPT-5-mini demonstrated superior results, achieving more than double the correct answers compared to GPT-5 on the difficult OOLONG long-context benchmark, all while being more cost-effective per query. The RLMs also outperformed other advanced techniques, such as ReAct plus test-time indexing, on a newly constructed Deep Research task derived from BrowseComp-Plus. Crucially, the models showed no performance degradation even when tested with inference times involving 10 million or more tokens. The authors suggest that RLMs, especially those explicitly trained for recursive reasoning, are poised to become the next major paradigm shift in general-purpose inference scaling, following CoT and ReAct models.

neutral

TechCrunch Disrupt 2025 Bundle Sale Ends Tomorrow | Founder & Investor Passes

The deadline to purchase the discounted Founder and Investor Bundles for TechCrunch Disrupt 2025 is tomorrow, October 3rd, at 11:59 p.m. PT.

TechCrunch Events

Technology Conferences, Venture Capital, Startup Fundraising +2

TechCrunch

30%Oct 2

neutral

Huawei details open-source AI development roadmap at Huawei Connect 2025

Huawei announced plans to fully open-source its entire AI software stack, including the CANN toolkit and Mind series tools, by the end of 2025 to address developer friction with its Ascend infrastructure.

Dashveenjit Kaur

Artificial Intelligence, Open Source Software, Technology Policy +2

AI News

40%Sep 29

positive

Reply's pre-built AI apps aim to fast-track AI adoption

Reply has introduced 'Prebuilt' AI apps to help enterprises overcome the slow and complex challenges of large-scale AI deployment.

David Thomas

Artificial Intelligence, Enterprise Software, Digital Transformation +2

AI News

30%Sep 30

neutral

Rising AI demands push Asia Pacific data centres to adapt, says Vertiv

The rapid adoption of AI in the Asia Pacific region is drastically increasing energy and cooling demands, forcing data centre operators to move from incremental upgrades to building purpose-built "AI factory" facilities.

Muhammad Zulhusni

Data Center Infrastructure, Artificial Intelligence, Energy Consumption +2

AI News

50%Sep 30

negative

Why AI phishing detection will define cybersecurity in 2026

A Reuters/Harvard experiment showed AI chatbots can create highly effective phishing emails, highlighting the accelerating threat posed by AI-powered cybercrime.

TechForge

Cybersecurity, Artificial Intelligence, Phishing Attacks +2

AI News

70%Oct 1

Alex L. Zhang | Recursive Language Models

Key Takeaways

Related Articles

TechCrunch Disrupt 2025 Bundle Sale Ends Tomorrow | Founder & Investor Passes

Huawei details open-source AI development roadmap at Huawei Connect 2025

Reply's pre-built AI apps aim to fast-track AI adoption

Rising AI demands push Asia Pacific data centres to adapt, says Vertiv

Why AI phishing detection will define cybersecurity in 2026