Anthropic Cries Wolf: Is the AGI Threat Real, or Just an IPO Story?

marsbitPublished on 2026-06-05Last updated on 2026-06-05

Abstract

Anthropic has published an article titled "When AI builds itself," discussing the emerging concept of "recursive self-improvement," where AI begins to actively participate in designing, training, testing, and optimizing its own subsequent versions. The company presents internal data showing that by May 2026, over 80% of code merged into its codebase was written by Claude, its AI model. Claude's capabilities have expanded to handling complex, open-ended engineering tasks, achieving a 76% success rate in such areas, and even contributing to research processes, such as optimizing code performance and conducting AI safety experiments. Anthropic outlines an evolution from human-driven development to AI-assisted workflows, culminating in the current stage where AI agents can autonomously write, run, and delegate code. The company cautions that the path toward a "closed loop," where AI continuously improves itself, is becoming visible. It calls for coordinated global mechanisms to potentially slow or pause frontier AI development to allow safety research and societal structures to catch up. However, the timing of this warning coincides with Anthropic's preparations for an IPO, framing the narrative not just as a safety concern but also as a demonstration of Claude's advanced capabilities and its integral role in accelerating Anthropic's own R&D—creating a potential "flywheel" effect for competitive advantage. This contrasts with OpenAI's recent, more policy-oriented discussion of ...

By | Alphabet AI

Anthropic published a lengthy article last night titled "When AI builds itself," which sounds like a science fiction novel by Asimov, and indeed deals with a sci-fi concept: recursive self-improvement.

Simply put, in the past, human researchers wrote code, ran experiments, and trained models to make AI stronger. But if AI starts to participate in designing, training, testing, and optimizing its own successors, then the speed of AI progress is no longer driven solely by humans—it may begin to "self-evolve."

To this end, Anthropic made a plea:

"We believe it would be beneficial for the world if there were an option to slow down or temporarily pause frontier AI development, allowing societal structures and alignment research to catch up with technological progress."

This statement sounds like a safety warning, but in the context of Anthropic preparing for an IPO, it's hard not to see it as another kind of narrative setup: Claude is so good, it's even starting to create the next generation of Claude itself.

A New Storm Has Emerged

To illustrate that AI is increasingly involved in AI research and development itself, Anthropic presented substantial internal data.

For instance, as of May 2026, over 80% of the code merged into Anthropic's codebase was written by Claude. Before the release of Claude Code, this number was only in the single digits.

By the second quarter of 2026, according to Anthropic's statistics, the daily volume of code merged by engineers was about 8 times higher than in 2024.

More notable than the code volume is that Claude is handling more open-ended engineering problems.

Anthropic stated in the article that over the past year, the frequency with which employees had to correct Claude, steer it back on track, or take over tasks mid-way has been steadily declining. This change is happening not only for simple tasks but also for the most complex, open-ended tasks.

So-called open-ended tasks are problems without clear instructions. For example, a system crash, a training task failure—issues where even engineers themselves don't know what the solution looks like initially and have to troubleshoot and make judgments on the fly.

These types of tasks historically relied most heavily on human experience. Yet, in those most open-ended tasks, Claude's success rate reached 76% by May 2026, a 50 percentage point increase within six months.

Not just writing code, Anthropic also uses Claude for code review—checking for bugs, security vulnerabilities, and other defects. Their retrospective analysis found that if every code change in the past had undergone automated review by Claude, approximately one-third of the bugs that caused incidents on claude.ai could have been caught before deployment.

Going a step further, Claude has begun to participate in the research process.

Anthropic has a standard test: give Claude code for training a small model and ask it to make the code run faster without altering the results. In May 2025, Claude Opus 4 could achieve about a 3x speedup; by April 2026, Claude Mythos Preview had pushed that number to approximately 52x.

Anthropic also mentioned an open-ended AI safety research case. They posed a question to a Claude-powered agent: Can a weaker model reliably supervise a stronger model?

This process involved proposing hypotheses, testing them, sharing findings with parallel agents, and iterating repeatedly.

Two human researchers spent a week bridging about 23% of the gap; Claude, with roughly 800 cumulative hours and about $18,000 in compute costs, bridged 97%.

This result certainly has limitations—the problem was chosen by humans, the scoring criteria were human-defined, and the findings haven't been fully migrated to production-scale models. But it still illustrates that Claude can now, within a research framework defined by humans, design experiments, execute them, and iterate on its own.

Furthermore, when human researchers "go down the wrong path," Claude can suggest a better next step.

Anthropic took 129 internal Claude Code research sessions where human researchers and Claude worked together on open-ended research problems. Anthropic identified points where "the human later proved to have taken a detour," gave the context up to that point to different versions of Claude, and asked it what it would suggest doing next. Then, another Claude judge, aware of the full session outcome, judged which was better: the model's suggestion or the human's choice at the time.

The results showed that at those points where the human researcher was later shown to have had room for improvement, Claude became increasingly able to propose a better next step.

In the past, AI model progress was primarily driven by human researchers and engineers. Humans decided what experiments to run, wrote the code, trained the models, and pushed forward AI's capabilities.

Now, more and more links in this chain are being taken over by Claude.

Anthropic presented a very intuitive stage diagram:

From 2021 to 2023, Anthropic was no different from a typical tech company—humans writing code and documentation on laptops.

From 2023 to 2025, chatbots began entering workflows. Engineers had models generate code snippets, then copied them into editors.

From 2025 to 2026, programming agents emerged. Claude began autonomously writing and modifying code, sometimes even completing entire files independently.

Today, agents can run code on their own and delegate hours-long work to other agents.

Looking ahead is the stage Anthropic is genuinely concerned about: the closed loop.

If this day arrives, subsequent versions of Claude might be continuously improved by Claude itself—this is recursive self-improvement.

Anthropic phrased it cautiously: we haven't reached that point yet, and recursive self-improvement isn't inevitable. But it still emphasizes that the path leading to that step is beginning to become visible.

That's why Anthropic discusses slowing down, even pausing, at the end of the article. Its meaning isn't that all AI companies should shut down immediately, but rather that if the risks of AI self-improvement continue to rise in the future, frontier labs need a coordinated, verifiable deceleration mechanism.

In other words, the "singularity" is approaching, and humanity must impose controls.

Unstoppable Claude

On the surface, this is a very forward-looking safety document. Anthropic is talking about recursive self-improvement, about AI potentially improving itself faster and faster, and about the need for human society to prepare deceleration and pause mechanisms in advance.

But placed in the context of Anthropic preparing for an IPO, this article takes on another layer of meaning.

In a way, Anthropic's recent moves resemble that annoyingly smug top student in class—it genuinely has the skills, but it's also quite pretentious.

What it wants to say isn't just "we have a very strong Claude"; a step beyond that, it wants to say "Claude is helping us build an even stronger Claude."

If Anthropic were merely selling a model or a tool, it would struggle to completely escape horizontal comparisons: Anthropic has Claude, OpenAI has GPT; Anthropic has Claude Code, OpenAI has Codex; Anthropic competes for enterprise clients, OpenAI competes for enterprise clients. The competition between the two companies is very tight, seeing who can tell the bigger story to the market.

It's worth noting that just three days ago, OpenAI wrote in a document about frontier AI governance:

"We are already seeing early signs of recursive self-improvement in today's systems: AI development itself is being accelerated by AI.

This will intensify competitive pressures among developers and nations, and create governance challenges that existing institutions are not equipped to handle."

Three days later, Anthropic says: The path for Claude towards recursive self-improvement is beginning to become visible.

If Claude develops as it hopes, this wouldn't be an ordinary product narrative—it would become a research and development flywheel.

Claude writes code, runs experiments, optimizes training processes, which in turn reduces incidents in Anthropic's own products… Once this system is up and running, Claude isn't just a product from Anthropic; it's a crucial production tool for Anthropic itself.

Users see the Claude product; enterprise customers buy Claude's capabilities. But what Anthropic truly wants the capital markets to notice is: Claude is already embedded in the underlying processes of frontier model development; it's been placed inside Anthropic's engine room.

Capital markets love flywheel stories, promising endless prosperity: A stronger Claude allows Anthropic's engineers to merge more code; more code enables faster product and infrastructure iteration; faster iteration allows researchers to run more experiments; more experiments in turn help the next generation of Claude become stronger. Once the next generation Claude is stronger, it continues to accelerate Anthropic's R&D.

Claude's iteration pace also supports this flywheel. Looking at public release timelines, from 2023 to early 2025, Claude's major model updates were mostly on a three-to-four month cycle. But with Claude 4, Anthropic's model updates have noticeably intensified.

Claude 4 was released in May 2025, Opus 4.1 in August, Sonnet 4.5 in September, Haiku 4.5 in October, Opus 4.5 in November.

In 2026, Opus 4.6 was released on February 5, Sonnet 4.6 on February 17, Opus 4.7 on April 15, and Opus 4.8 on May 28. The gap between Opus 4.7 and Opus 4.8 was only 42 days.

Anthropic, on the surface, is saying "this could be very dangerous, we need to prepare the brakes in advance," but it's simultaneously implying: "We've seen what happens when the accelerator is pressed."

The subtlety of the IPO narrative lies here. It describes the risks as significant while also elevating its own technological position.

Not every AI company is qualified to discuss recursive self-improvement. You first need to make the outside world believe your AI is already part of the AI R&D process to have the standing to say this might require global coordination.

OpenAI: How Could This Happen?

As mentioned earlier, just before Anthropic published this lengthy article, OpenAI had already put recursive self-improvement on the table.

But the two companies' narratives are quite different.

OpenAI's document, "Democratic Governance of Frontier AI," is a policy blueprint for Washington. It's concerned not with "how models get stronger," but with how to constrain frontier AI if it continues to surge ahead.

Most of the content in that report isn't suitable for detailed discussion here, but one key line stands out: OpenAI said that in today's systems, early signs of recursive self-improvement are already visible.

This line and Anthropic's lengthy article point in the same direction.

It's just that OpenAI talks about institutions, while Anthropic talks about itself.

OpenAI's point is: AI development is too fast; existing governance structures may not keep up, so a new set of rules is needed.

Anthropic directly showcases that system, telling the market: Claude is already in our R&D process, so we see the path to AI self-acceleration.

This move is quite clever. One imagines the grumbling inside OpenAI—this is practically idea theft! We were here first!

Just joking, but OpenAI really needs to step up its game and quickly bring GPT 5.6 to the table.

Trending Cryptos

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

Crypto payments for gray-market peptides up by 700% in a year – Details

The use of cryptocurrency payments, primarily Bitcoin and stablecoins, for gray-market peptides in the United States surged by 700% in a year, according to a recent report. These peptides, often sold as wellness or anti-aging aids, exist in a regulatory gray area. Facing tighter restrictions from traditional banks, vendors increasingly accept crypto, with some Chinese suppliers accepting only Bitcoin. This has attracted mainstream consumers seeking cheaper access, even those new to cryptocurrency. Data from Chainalysis shows gray-market peptide vendors received $32 million in crypto in Q1 2026, projecting an annual run rate over $100 million. TRM Labs noted a similar trend, estimating $41.4 million in crypto inflows to Chinese peptide sellers in 2025. Analysts highlight this involves many first-time crypto users, distinct from historical darknet market activity, and underscores crypto's role in facilitating cross-border value transfer. This growth occurred alongside a broader downturn in general retail crypto transaction volumes in early 2026.

ambcrypto25m ago

Crypto payments for gray-market peptides up by 700% in a year – Details

ambcrypto25m ago

human.tech Launches Clean SDK for Privacy-First Web3 Apps

human.tech has launched the Clean SDK, a toolkit enabling developers to build privacy-first Web3 applications with transparent accountability. Released alongside Aztec's version 5, the SDK provides components for integrating zero-knowledge identity verification, sanctions screening, and private transactions, without developers handling sensitive user data or building compliance infrastructure from scratch. It uses zero-knowledge proofs and programmable verification to allow apps to confirm user legitimacy and sanctions compliance while keeping identities confidential. The first application built on the SDK, Shield, a privacy bridge to Aztec, also launched. It allows users to transfer assets privately while proving a unique human is behind each transfer and that funds have passed sanctions checks, as verified by a May 2026 audit. The SDK offers three core verification techniques: Proof of Innocence (sanctions screening against 23 sources), Proof of Personhood (simpler verification via Human Passport), and Proof of Clean Hands (higher-assurance zero-knowledge government ID checks). This allows apps to authenticate users and transactions without exposing personal data. Designed for Aztec builders, the SDK lets developers add programmable privacy to decentralized apps, eliminating the need to create their own verification and ZK infrastructure. Shield demonstrates its practical use for private bridges, but the SDK aims to enable a wider ecosystem of private, accountable financial apps and services. The launch addresses growing demand for infrastructure that balances privacy and accountability. The SDK avoids traditional identity databases, storing encrypted data off-chain, screening at both entry and exit points, and including a gated disclosure mechanism for legal requests. human.tech's products, including the Clean SDK, focus on using zero-knowledge technology to enable verifiable personhood and privacy in digital systems.

TheNewsCrypto32m ago

human.tech Launches Clean SDK for Privacy-First Web3 Apps

TheNewsCrypto32m ago

Unlocking $100 Million in Liquidity? Pump.fun's New Policy Tests the 5-Minute Pump Technique

Pump.fun, a popular meme coin launchpad, has introduced a new standard mechanism called BOOST. It aims to address a significant capital efficiency issue: when a newly launched token graduates from its initial bonding curve to a liquidity pool (LP), roughly 20% of its liquidity becomes permanently locked as "dead liquidity," estimated to waste over $100 million annually. Instead of locking these funds permanently, BOOST repurposes them. Upon a token's migration, approximately 20% of the settlement funds (e.g., 17.6 SOL or ~$2516 USDC) are used to buy back the token on the open market over a 5-minute period via a Time-Weighted Average Price (TWAP) mechanism. All purchased tokens are immediately burned. This creates a brief, systematic buy pressure immediately after migration, potentially generating a short-term price surge ("pump") while permanently reducing the token's circulating supply. The goal is to enhance the immediate post-launch trading experience, potentially increasing trader retention and sustainable protocol revenue, which funds ongoing token buybacks. However, concerns exist that this artificial 5-minute boost could lower the barrier for launching low-quality tokens and lead to steeper price crashes once the buy pressure stops, if followed by large sell-offs. The feature automatically applies to tokens migrating after July 21, 2024, but not to previously migrated tokens or those launched via the Mayhem AI Agent lab.

marsbit39m ago

Unlocking $100 Million in Liquidity? Pump.fun's New Policy Tests the 5-Minute Pump Technique

marsbit39m ago

Selling Block Space is Dead: Public Chains Must Find a New Way

"Blockchain infrastructure as a neutral, generalized platform is no longer a viable business model. This article argues that simply selling block space has failed, evidenced by the fact that only one public blockchain, Hyperliquid, was among 14 crypto firms surpassing $200M in annual revenue recently. Hyperliquid's revenue vastly outpaces others like Arbitrum, highlighting a crisis. The future lies in public chains pivoting away from neutrality to become product studios, application distributors, payment rails, or vertical SaaS providers, directly serving paying customers. The piece also covers key industry events: a major exploit on Ostium, attributed to vulnerabilities in its off-chain price oracle, underscores the critical security needs for protocols bridging off-chain markets. Additionally, it discusses the necessity of abstracting complex financial instruments like options. Experts argue that to achieve mass adoption, options should be packaged as user-friendly products—such as yield vaults, binary options, or structured products—rather than marketed with their technical complexity. Finally, the radar section notes developments including Yearn's fixed-rate lending product Flex, community backlash over Base's strategy changes, the launch of a DXY perpetuals DEX called Plether, and Starknet's focus on institutional-grade privacy and quantum resistance."

marsbit59m ago

Selling Block Space is Dead: Public Chains Must Find a New Way

marsbit59m ago

Podcast Notes | Conversation with GSR Asset Management Head: To Determine if This Crypto Rally is Real, Just Watch the Lending Rates on Aave

Podcast Summary: Dialogue with GSR's Head of Asset Management: To Determine if This Crypto Rally is Real, Just Check Lending Rates on Aave Andy Baehr, Managing Director of Asset Management at GSR, discusses the current crypto market, characterizing it as stuck in a state of "ambivalence" with short-lived, unsustainable rallies. He outlines a simple framework: the market moves between "ambivalence" and "conviction" (sustained upward momentum). Currently, every rally resembles a single-stage rocket booster that quickly fizzles out. Baehr identifies three key signals to watch: 1) DeFi lending rates, 2) the potential passage of the CLARITY Act, and 3) the market forming a consensus on the "Fed hawkish peak." He emphasizes that the most immediate indicator for the sustainability of the recent CPI-triggered rally is the USDC borrowing rate on Aave, currently around 3.75%—close to U.S. Treasury yields. The absence of a credit spread indicates low leverage demand and a lack of market energy. He explains that a healthy, sustained rally requires layered buying pressure. Last year's rally progressed from an ETH short squeeze to crypto-native trader influx and finally to ETF inflows. Currently, this structure is missing. Other potential structural buyers like Digital Asset Treasury (DAT) companies are absent, and ETF flows have proven transient. Baehr notes that while small-cap crypto tokens outperformed large caps in Q2—a potential sign of capitation in major assets—capital is also flowing to more exciting opportunities like AI stocks and tech IPOs, leaving crypto sidelined. Regarding DeFi, he highlights that platforms like Aave provide a clear, real-time signal of leverage demand through their supply/demand-driven interest rates. A significant, sustained rate increase would signal genuine market conviction. He also observes the quiet emergence of fixed-income-like products and vaults in DeFi. On regulation, the probability of the CLARITY Act passing before the August 7th deadline has dropped linearly from 75% to below 40% on Polymarket. Baehr suggests its passage would be treated as a bullish surprise, a potent driver for price movement. However, political hurdles, including ethical clause debates and disclosures about the First Family's crypto profits, remain significant obstacles. Ultimately, the market awaits clarity on the Fed's terminal rate under Chair Warsh. Until the "Fed Solstice"—the point where the market collectively understands the peak of hawkish policy—sustained conviction will be difficult to achieve.

marsbit1h ago

Podcast Notes | Conversation with GSR Asset Management Head: To Determine if This Crypto Rally is Real, Just Watch the Lending Rates on Aave

marsbit1h ago

Trading

Spot

Hot Articles

How to Buy NIGHT

Welcome to HTX.com! We've made purchasing Midnight (NIGHT) simple and convenient. Follow our step-by-step guide to embark on your crypto journey.Step 1: Create Your HTX AccountUse your email or phone number to sign up for a free account on HTX. Experience a hassle-free registration journey and unlock all features.Get My AccountStep 2: Go to Buy Crypto and Choose Your Payment MethodCredit/Debit Card: Use your Visa or Mastercard to buy Midnight (NIGHT) instantly.Balance: Use funds from your HTX account balance to trade seamlessly.Third Parties: We've added popular payment methods such as Google Pay and Apple Pay to enhance convenience.P2P: Trade directly with other users on HTX.Over-the-Counter (OTC): We offer tailor-made services and competitive exchange rates for traders.Step 3: Store Your Midnight (NIGHT)After purchasing your Midnight (NIGHT), store it in your HTX account. Alternatively, you can send it elsewhere via blockchain transfer or use it to trade other cryptocurrencies.Step 4: Trade Midnight (NIGHT)Easily trade Midnight (NIGHT) on HTX's spot market. Simply access your account, select your trading pair, execute your trades, and monitor in real-time. We offer a user-friendly experience for both beginners and seasoned traders.

4.0k Total ViewsPublished 2025.12.08Updated 2026.06.02

Hot Tokens Learning Week 3: “UNIfication” Governance Proposal Will End on December 25 | NIGHT May Serve as a Growth Catalyst for Cardano in 2026

The “UNIfication” governance proposal will conclude on December 25, with community sentiment remaining strongly optimistic.

41.3k Total ViewsPublished 2025.12.23Updated 2025.12.23

Hot Tokens Learning Week 3: “UNIfication” Governance Proposal Will End on December 25 | NIGHT May Serve as a Growth Catalyst for Cardano in 2026

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of NIGHT (NIGHT) are presented below.