Recursive Self-Improvement AI Gains Traction, Google Pours Cold Water, While DeepSeek and Others Approach the Fringes

marsbitPublished on 2026-06-06Last updated on 2026-06-06

Abstract

The term "recursive self-improvement" (RSI), where AI improves itself autonomously, is gaining momentum in the AI industry. Startups like Recursive Superintelligence and projects such as Andrej Karpathy's Auto-Research aim to create systems where AI designs, implements, and validates its own research, moving toward superintelligence. While Google CEO Sundar Pichai cautions that such exponential acceleration is not yet a reality, progress is evident. For instance, Anthropic reported its Claude Code writes nearly 100% of the team's code, though it still lacks true self-direction. Analysts frame RSI development in stages: "adequacy" (systems functioning without humans), "parity" (matching human research quality), and "supremacy" (exceeding human-AI collaboration). Reaching parity could trigger rapid, unpredictable advancement due to AI's continuous operation. In China, companies like DeepSeek and Baidu incorporate self-optimization techniques without explicitly branding them as RSI, focusing on algorithmic efficiency and reinforcement learning. However, challenges remain, including "model collapse" from training on AI-generated data and the immense computational and open-collaboration requirements. Ultimately, RSI represents a trend of increasing automation in AI development, potentially reducing human oversight in the creation process itself.

"Recursion" has recently become a hot topic in the AI community.

Two startups have directly used this term as their company names, and many labs have started inserting the three-letter acronym RSI—which stands for Recursive Self-Improvement—into their roadmaps. Much like AGI, RSI is becoming an exciting yet unsettling industry buzzword, even though a unified definition hasn't been fully established.

(Image source: X)

What is RSI? Simply put, it's about enabling AI to train itself. In the tech world, RSI has long been regarded as a major indicator of artificial intelligence progress, alongside memory, reasoning, and multimodality. The only limitation is computing power; humans are no longer a necessary condition, not even as assistants.

Sounds like science fiction, or perhaps, sounds dangerous? But upon calm reflection, this isn't the first wave of fervor in the AI industry. From AlphaGo in 2016 to ChatGPT in 2023, and now the parameter arms race among major models, the industry's nature is to chase the next "game-changer." In the view of Leike Technology AGI, RSI might be the next carnival.

RSI Gains Traction: When AI Can "Recursively" Construct Itself

In May of this year, renowned AI researcher Richard Socher high-profile founded a new company called Recursive Superintelligence, its name directly referencing RSI.

He stated: "Our core goal is to build a truly recursive self-improving superintelligence. The entire process of research conception, implementation, and validation will be automated."

Another case that has garnered more buzz within the circle is a project called Auto-Research advanced by Andrej Karpathy: using agent clusters to train language models, enabling models to perform simple research tasks and improve themselves autonomously.

Image source: github

‌

Andrej Karpathy is also a legendary figure, having left significant contributions at Tesla in autonomous driving and at OpenAI on GPT. Now he's all-in on RSI as his next stop, and he's advancing it with remarkable transparency, indicating he genuinely believes it's achievable.

Interestingly, he has been unusually candid about this project, regularly updating progress on Twitter and maintaining a public GitHub repository for the code. Of course, Andrej Karpathy himself has noted that current work is still iterating on small models at the GPT-2 level, "not yet breakthrough research (for now)," but this is enough to drive a large number of researchers to follow suit.

More importantly, Andrej Karpathy recently joined Anthropic's pretraining team. Anthropic has Claude, Karpathy has the auto-research methodology; combining the two, large model + self-training loop, if proven successful, would move beyond the small-scale experiments of GPT-2.

Image source: haimagazine

Another company called Adaption launched a tool called AutoScientist, aiming to automate the training process of frontier models. The logic aligns with Andrej Karpathy's auto-researchers: training agents for incremental improvement. However, Adaption's ambition is greater, seeking to directly establish a complete training loop for a full-scale frontier model.

These two represent two different approaches: Andrej Karpathy is verifying from the ground up piece by piece, building momentum within the community while open-sourcing; Adaption is directly targeting commercial large-model training scenarios with a stronger drive for practical application. Whichever path succeeds first will have distinctly different impacts on the entire industry.

Google CEO Pours Cold Water: We're Not There Yet

Regarding RSI, opinions among AI industry leaders vary widely.

Last month, in a podcast, Google CEO Sundar Pichai acknowledged reality with quite cautious wording: "(RSI) is a continuum. We are indeed all making progress. But if we talk about RSI in the way people describe it, that represents the next level of acceleration, with many implications, but we're not at that stage yet."

Even so, the term "continuum" here contains much food for thought.

In January this year, a programmer leading Claude Code development at Anthropic admitted that nearly 100% of the team's code was written by Claude Code, which is AI literally writing itself. It's not AI assisting engineers in writing code; it's an AI tool, to some extent, already replacing engineers in writing its own code.

Image source: Anthropic

Anthropic conducted an internal survey about the Mythos preview version: among 18 engineers, five believed that if the supporting systems were improved, this version of Mythos could replace an L4 engineer—a mid-level programmer capable of independently handling complex projects without real-time supervision.

However, the weaknesses were also clearly stated: "The main weaknesses reported by Claude include: fuzzy tasks beyond one management cycle, understanding organizational priorities, taste, verification, instruction following, and epistemology." In other words, its weaknesses lie precisely in the aspects of self-motivation, which is the foundation of RSI.

Interestingly, the Georgetown Center for Security and Emerging Technology (CSET) last year assembled a group of experts specifically to study RSI. These experts exhibited a clear split in their assessments. Some anticipated an imminent "superintelligence explosion," while others expected slower progress that would eventually reach a plateau.

But they shared one consensus: recursion makes the future exceptionally difficult to predict.

To this end, an article by METR researcher Ajeya Cotra breaks down the RSI process into several milestones. I find this to be the most useful analytical framework currently available.

The first level is called "Adequacy": After completely removing humans, the system can still conduct research—even if inferior to humans, it can function.

The second level is called "Parity": Research conducted independently by AI matches the quality of research conducted independently by humans.

The third is called "Supremacy": The performance of an independent AI system surpasses that of a human-AI collaborative system.

It's somewhat akin to L2, 3, 4, 5 in autonomous driving. Ajeya Cotra's assessment is: we are already very close to the first level. But when the second level will arrive, she didn't provide a timeline. However, she offered a clear deduction: once the second level arrives, subsequent acceleration will far exceed past rates, "potentially reaching the third level within a year."

Why so fast? Because at the moment the second level is achieved, AI becomes a research team that doesn't need sleep, meetings, or KPI alignment. It can experiment, modify, and re-experiment 24/7. Humans conducting research, no matter how efficient, only have a few hours of effective deep work per day, interspersed with countless interruptions and communication overhead. Once this bottleneck disappears, the acceleration is cliff-like.

No One in China is Shouting RSI, But DeepSeek and Others Have Already Touched the Edges

After discussing overseas progress, you might wonder: what about China?

Frankly, domestic companies rarely publicly shout about RSI. Overseas AI companies can write "recursive superintelligence" into their mission statements; such a thing is almost unimaginable in China. But when it comes to having AI improve itself, domestic players have already quietly approached the edges on different paths.

The most typical example is DeepSeek. They spend an order of magnitude less money than OpenAI, yet can compete head-on in many reasoning tasks. This is achieved through extreme optimization of algorithmic efficiency—MoE architecture, extreme compression of activation parameters, and engineering refinement of training strategies.

Although this isn't directly related to RSI, it represents a path of using smarter methods to replace brute-force computational power. And this path is precisely one of the core logics of RSI: enabling models to find smarter paths through iteration.

On the Baidu Wenxin side, reinforcement learning-driven model self-optimization has become routine. Although they don't use the RSI label, they are doing the same thing: allowing models to continuously improve through self-feedback loops on specific tasks. From this perspective, domestic companies are not ignoring RSI; they have already incorporated certain aspects of RSI into daily engineering practice, just without the label.

(Image source: gemini generation)

Of course, objective gaps exist. The talent density at OpenAI and Anthropic is still unmatched by any domestic company, meaning that in the exploration of RSI, the current status is still one of following.

But historical experience shows that domestic companies often exhibit astonishing speed in catching up once "the path is clearly defined." The RSI framework is being increasingly dissected by overseas experts, and Karpathy's code is publicly available on GitHub. Once a reproducible path is established, the cost control capabilities and application scenario density of domestic players will become a variable severely underestimated by the market.

Simultaneously, we must also pour some appropriate cold water. In reality, data generated by AI used to train the next version of AI tends to degrade in quality. The logic of RSI is that AI generates good data, which is then used to train the next generation of AI, making it stronger.

But the actual situation might be reversed. AI-generated data often mixes in its own hallucinations, biases, and quality degradation. These second-hand data are fed to the next version, which then produces even worse third-hand output. After a few cycles, the entire system collapses, akin to a photocopier repeatedly copying copies; by the tenth copy, the image is blurred beyond recognition.

Academia refers to this as model collapse, and papers have already verified this phenomenon's existence.

Furthermore, the ideal environment required for RSI doesn't exist in the real world. For this system to function, two prerequisites are indispensable: infinite computing power and a globally open, collaborative research ecosystem.

The reality is that the cost of training a frontier model has reached the billion-dollar scale. Chip capacity is limited, energy is limited, high-quality data is diminishing, and export controls and technological decoupling are fragmenting AI research into several non-interconnected circles. When people and goods can't flow freely, and these basic conditions can't be met, talking about RSI is premature.

RSI is no longer just a technical problem; it requires a sufficiently open world. Whether this prerequisite can be established is beyond the control of the tech community.

In Conclusion

Finally, here's an interesting observation I've made: Over the past five years, the industry first lured people into "parameter worship" through large-scale pretraining, then convinced them that "values can be fine-tuned" with RLHF, and now RSI tells a story of "machines completing the entire R&D chain autonomously." Each step pushes humans one step back—not out of the industry, but out of the decision-making chain.

Although this retreat isn't necessarily a bad thing, it is irreversible. Once a certain link is automated, human intuition, experience, and judgment in that link gradually deteriorate, much like how navigation ability declines when you stop using GPS.

By then, we might not even truly understand how the tools we use are made.

Weekly Token Unlock: HOME Unlocks Over 20% of Circulating Supply

Weekly Token Unlock: HOME to Release Over 20% of Circulating Supply This week, the project DeFi.app (HOME) is scheduled to unlock 790 million tokens, valued at approximately $41.26 million. DeFi.app aims to simplify DeFi trading by enabling users to purchase tokens on any blockchain, execute instant cross-chain swaps, access decentralized derivatives markets, and find yield opportunities—all without paying gas fees or using bridges, while maintaining full self-custody of assets. The accompanying chart details the specific release schedule for these tokens.

marsbit35m ago

Weekly Token Unlock: HOME Unlocks Over 20% of Circulating Supply

marsbit35m ago

US Stocks Suffer Worst Plunge Since 2025: Three Triggers Ignite Tech Stock Valuation Reset

The US stock market experienced its most severe sell-off since the 2025 tariff crisis on June 5th, 2025. The Nasdaq Composite plummeted 4.18%, the S&P 500 fell 2.64%, and the Dow Jones dropped 695 points. The panic stemmed from three converging factors. First, Broadcom's earnings report ignited fears of a slowdown in AI growth. While its AI chip revenue surged 143% YoY to $10.8B, its Q3 AI revenue guidance of $16B fell short of the $17.2B consensus. This triggered a massive sector-wide sell-off, with the Philadelphia Semiconductor Index crashing 10.26% and semiconductor stocks losing roughly $1.3 trillion in market value in a single day. Second, a shockingly strong May jobs report crushed hopes for Federal Reserve rate cuts. Non-farm payrolls added 172,000 jobs, doubling expectations. This robust data, combined with persistently high oil prices above $92/barrel due to the ongoing Iran war and blockade of the Strait of Hormuz, drastically increased market expectations for a potential Fed rate hike instead of a cut. Higher interest rates compress the valuations of growth-heavy tech stocks. Third, the prolonged Iran conflict continues to fuel inflationary pressures, complicating the Fed's policy decisions and undermining the "inflation is tamed" narrative. Together, these events challenged the twin pillars of the market rally: the "limitless AI growth" story and expectations for imminent monetary easing. The sell-off spread globally, impacting Asian and European markets and cryptocurrencies. The article posits this is likely a severe "valuation repricing" rather than the end of the AI story. The underlying demand for AI remains strong, but investor expectations for growth speed and the prices they are willing to pay are being recalibrated. Key upcoming factors include the June FOMC meeting, future AI company earnings, and developments in the Iran conflict.

marsbit2h ago

US Stocks Suffer Worst Plunge Since 2025: Three Triggers Ignite Tech Stock Valuation Reset

marsbit2h ago

From Madison Square Garden to Kalshi: Prediction Markets Break into the NBA Finals

From Madison Square Garden to Kalshi: Prediction Markets Break into the NBA Finals Prediction markets are playing a significant role in the 2026 NBA Finals, particularly around the New York Knicks' unexpected 2-0 series lead. Platforms like Kalshi and Polymarket have seen massive trading volumes, exceeding hundreds of millions of dollars on championship and related markets. Their influence extends beyond online trading. Kalshi's official partnership with Madison Square Garden has given it prominent physical branding at the arena. Furthermore, local businesses like The Jeffrey bar are using prediction market contracts to hedge the risk of game-result-based promotions, turning potential losses into manageable costs—a concept similar to the famous "Mattress Mack" strategy from traditional sports betting. These markets differentiate themselves by offering a wider, more entertainment-focused range of "event contracts" beyond typical game outcomes, such as predicting celebrity attendance. They also have broader accessibility across the U.S. compared to age- and location-restricted traditional sportsbooks. However, their rapid integration into sports raises regulatory and ethical questions. The NBA is cautiously engaging, discussing integrity frameworks with regulators like the CFTC. While the league permits minor investments like Giannis Antetokounmpo's stake in Kalshi, it advocates for strict rules to prevent insider trading. Many fans express concern on platforms like Reddit, fearing that the close ties between prediction markets, the league, and players could compromise the game's integrity. The NBA Finals has thus become a high-stakes testing ground, showcasing prediction markets' commercial potential while challenging traditional boundaries between financial trading, entertainment, and gambling.

marsbit4h ago

Anthropic Warns Globally, OpenAI Has Crossed the 'Reliability Threshold': AI Self-Acceleration Activated

Anthropic issues a global warning, urging a pause in AI research due to fears of recursive self-improvement where AI accelerates its own development, nearing a "self-building" tipping point. Simultaneously, OpenAI's Yann Dubois provides a key insight: AI's perceived "jump" in usefulness stems from crossing a "reliability threshold." Before this point, AI is an unreliable toy; after (around Dec 2023 for OpenAI), it becomes a dependable tool, triggering self-acceleration. This is evident as AI now assists in its own R&D, boosting researcher productivity. Dubois argues AI development is more "craft" than pure science, relying heavily on intuition. He highlights "the last-mile AI dividend": if current models were frozen, focused development on vertical application harnesses (orchestration systems connecting AI to real-world data and permissions) could deliver AGI-like performance in many domains. The main bottleneck isn't model intelligence, but integration—granting access, connecting data, and embedding into workflows. However, major challenges remain, like enabling continual learning so AI improves with experience rather than plateauing. For startups, the opportunity lies in solving these intricate, real-world integration problems—the hard work of bringing powerful models down to ground level.

marsbit4h ago

Anthropic Warns Globally, OpenAI Has Crossed the 'Reliability Threshold': AI Self-Acceleration Activated

marsbit4h ago

FBI Cracks Down On Alleged ISIS Crypto Donors, 3 Americans Arrested

bitcoinist5h ago

Trading

Spot

Futures

Hot Articles

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

Audiera is a dual-platform Web4 entertainment ecosystem combining a mobile rhythm experience and a lightweight Telegram mini-game, powered by AI interaction and an on-chain creator economy.

40.1k Total ViewsPublished 2026.03.11Updated 2026.03.11

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

Talus is a decentralized AI Agent framework built on the Sui, designed to solve the structural problems of current AI systems: centralization, opacity, and a lack of native economic identity.

42.9k Total ViewsPublished 2026.03.18Updated 2026.03.18

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

By 2026, the integration of artificial intelligence and cryptocurrency has advanced from proof-of-concept to a new stage of "system-level integration".

2.0k Total ViewsPublished 2026.03.26Updated 2026.03.26

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

Recursive Self-Improvement AI Gains Traction, Google Pours Cold Water, While DeepSeek and Others Approach the Fringes

Abstract

RSI Gains Traction: When AI Can "Recursively" Construct Itself

Google CEO Pours Cold Water: We're Not There Yet

No One in China is Shouting RSI, But DeepSeek and Others Have Already Touched the Edges

In Conclusion

Related Questions

Related Reads

Weekly Token Unlock: HOME Unlocks Over 20% of Circulating Supply

US Stocks Suffer Worst Plunge Since 2025: Three Triggers Ignite Tech Stock Valuation Reset

From Madison Square Garden to Kalshi: Prediction Markets Break into the NBA Finals

Anthropic Warns Globally, OpenAI Has Crossed the 'Reliability Threshold': AI Self-Acceleration Activated

FBI Cracks Down On Alleged ISIS Crypto Donors, 3 Americans Arrested

Trading

Hot Articles

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

Discussions

Top Questions

Hot Categories

Hot Tags