Breaking News: The "Worker's Edition" Claude 5 Is Here, Everyone Can Use It

marsbitPublished on 2026-07-01Last updated on 2026-07-01

Abstract

BREAKING: Claude Sonnet 5, dubbed "Fennec," is now the default model for all Free and Pro users. This mid-tier model boasts the strongest Agent capabilities in the Sonnet line yet, with performance rivaling the flagship Opus 4.8. It features autonomous planning and can utilize browser and terminal tools—capabilities previously exclusive to costly, large models. Key benchmarks highlight significant gains over its predecessor, Sonnet 4.6, in reasoning, tool use, coding, and knowledge work. Sonnet 5 scores 63.2% on SWE-bench Pro (surpassing GPT-5.5's 58.6%), 80.4% on Terminal-Bench 2.1, and 57.4% on Humanity's Last Exam (just 0.5% behind Opus 4.8). It even slightly outperforms Opus 4.8 in some knowledge tasks. Anthropic positions it as delivering ~90% of Opus's capability at a fraction of the cost. Pricing is aggressive: a limited-time promotional rate of $2 per million input tokens and $10 per million output tokens (reverting to $3/$15 after August 31). This undercuts Opus 4.8 ($5/$25) and GPT-5.5 ($5/$30). However, a new tokenizer may increase token counts by 1.0-1.35x, affecting final costs post-promotion. Notably, Sonnet 5 excels in security, with a mere 0.93% browser injection attack success rate, outperforming Mythos 5 and Opus 4.8. Its prompt injection defense matches Opus 4.8 at 0.19%. Launching amid uncertainty around the region-restricted Fable 5, Sonnet 5 is globally available. It targets the mid-market, offering near-flagship performance at a competitive price, e...

Just now, Claude Sonnet 5 has arrived!

Code name: Fennec, the fennec fox, the smallest fox in the Sahara Desert.

This is Anthropic's Sonnet model with the strongest Agent capabilities to date, with performance close to that of the flagship Opus 4.8.

Effective immediately, Sonnet 5 becomes the default model for all Free and Pro users.

It can autonomously plan and invoke browser and terminal tools.

Just a few months ago, this required spending a lot of money to invoke super-large models; now, Sonnet easily achieves it.

Compared to the previous generation Sonnet 4.6, Sonnet 5 shows significant performance improvements in reasoning, tool use, programming, and knowledge work tasks.

Key points:

SWE-bench Pro score of 63.2%, surpassing GPT-5.5's 58.6%, slightly behind Opus 4.8's 69.2%

'Humanity's Last Exam' score of 57.4%, only 0.5 percentage points behind Opus 4.8

Standard pricing: $3 per million input tokens / $15 per million output tokens, only 60% of Opus 4.8's price

Browser injection defense: 0.93% success rate, beating both Mythos 5 and Opus 4.8

Interestingly, Fable 5 was also revealed to be making a comeback on the same day. But the cost is mandatory real-name verification, and it will most likely be limited to US users.

Sonnet 5, on the other hand, promises to hold nothing back, and is available globally for all users to use openly starting today.

On Par with Opus 4.8 Across the Board, the Strongest Worker AI Launches a Surprise Attack

This sudden launch of Sonnet 5 also helps fill the void left by the unavailability of Fable 5.

For many developers, the year one of the Agent era began with Sonnet.

Claude Sonnet 3.5, 3.6, and 3.7 were among the earliest models to demonstrate astonishing abilities in writing code and using tools.

In other words, the concept of "letting AI do the work itself" was first proven feasible by the Sonnet "medium cup" series.

But over the past year or so, the most dramatic leaps in capability have been concentrated on the Opus "large cup" line. Sonnet was left directly behind by the flagship.

What Sonnet 5 aims to do is close this gap!

Anthropic sets the tone with one sentence – Claude Sonnet 5 is the most capable "worker" Sonnet in history.

Looking at its real-world performance scores best illustrates this point.

In its traditional stronghold of programming, Sonnet 5 impressively scores 63.2% on SWE-bench Pro. The previous Sonnet 4.6 only managed 58.1%, while Opus 4.8 currently leads with 69.2%.

In contrast, OpenAI's flagship GPT-5.5 only scores 58.6% on the same benchmark, and Google's Gemini 3.5 Flash scores just 55.1%.

Terminal-Bench 2.1 performance is even more ferocious. Sonnet 5 skyrockets to 80.4%, leaving Sonnet 4.6's 67.0% far behind with a huge 13 percentage point jump. It's less than 2 points away from Opus 4.8's 82.7%.

On the cross-disciplinary reasoning benchmark dubbed 'Humanity's Last Exam', Sonnet 5 with tools achieves 57.4%, compared to Opus 4.8's 57.9%—a mere 0.5 percentage point difference. GPT-5.5 scores only 52.2% on the same test, and Gemini 3.1 Pro scores 51.4%.

In computer control capabilities, Sonnet 5 scores 81.2% on OSWorld-Verified, again surpassing GPT-5.5's 78.7% and closely trailing Opus 4.8's 83.4%.

More surprisingly, in knowledge work, Sonnet 5 scores 1618 on GDPval-AA v2, directly overtaking Opus 4.8's 1615.

In agent search and tool use performance, Sonnet 5 provides Opus 4.8-level capabilities at the lowest cost.

It can be said that in almost every benchmark, Sonnet 5's performance falls within the 90% to 100% range of Opus 4.8's scores.

It's practically like paying Sonnet's price for 90% of Opus's brainpower.

$2 Limited-Time Promotion, But With a Hidden Pitfall

The price is this release's real "killer feature".

For API pricing, Anthropic is offering a limited-time promotion: $2 per million tokens for input, $10 per million tokens for output.

After August 31st, the price reverts to the original $3 for input and $15 for output.

In comparison, Opus 4.8 is priced at $5 and $25, and GPT-5.5 Standard is $5 and $30.

During the promotion period, both input and output prices are only 40% of Opus 4.8's. Even after the standard price resumes, it's only 60%.

However, while Anthropic appears full of sincerity on the surface, there's a little trick hidden in the details.

The reason is that Sonnet 5 uses a completely new tokenizer. The number of tokens for the same input text may inflate by a factor of 1.0 to 1.35.

Once the promotion period ends, the original price of $3/$15 combined with the tokenizer inflation effect will definitely make the real spending sting a bit more than using Sonnet 4.6.

But even so, compared to Opus, it's still a crushing difference.

Counterattacking All Flagship Models in the Family

The System Card hides one of Sonnet 5's most underestimated aspects.

Prompt injection attack success rate: 0.19%, on par with Opus 4.8. GPT-5.5 is at 3.08%, Gemini 3.5 Flash is at 6.66%.

In browser injection defense, the attack success rate is only 0.93%, while Mythos 5 is at 29.7% and Opus 4.8 at 31.5%.

A $2 mid-range model has counterattacked and defeated all flagship models in the family; with protective measures enabled, it drops directly to 0%.

For malicious code injection, Sonnet 4.6 had a high attack success rate of 45.26%. Sonnet 5 has reduced this to 0.29%, an improvement of 150 times.

In the Firefox 147 vulnerability exploitation test, Mythos 5 could write usable exploits 88.4% of the time, Opus 4.8 at 8.8%, and Sonnet 5 at 0.0%. It can write top-tier business code, but can't write a single usable exploit.

A side effect is a misalignment behavior score of 2.53 (out of 10), an improvement over Sonnet 4.6's 2.89, but higher than Opus 4.8's 2.10 and Mythos Preview's 1.95.

It has become stronger, and also more opinionated.

Not Competing for the Crown, Specializing in Cutting Down the Mid-Tier

Sonnet 5 occupies an incredibly precise position. Its upward-facing capabilities approach those of Opus 4.8 and GPT-5.5, while its downward-facing price is close to the level of Gemini 3.5 Flash.

Just as OpenAI doubled its prices compared to the previous generation, Anthropic turned around and pushed Sonnet 5's entry price down to $3.

Developers who were previously hesitant about paying for a flagship now have a lethally powerful alternative.

While everyone else is focusing on fighting at the top, Anthropic has fired a shot at the mid-tier.

Developer Wallets Voted Tonight

Now, Sonnet 5's performance has stepped into the flagship range; most tasks like fixing bugs, adding tests, or refactoring can be handled in one go.

The awkward situation where Opus felt too expensive to use, but Sonnet wasn't good enough, is gone as of today.

It's more cost-effective. The same budget that could previously run only one Opus-level Agent can now run two or three parallel Sonnets.

The cost barrier for multi-Agent architectures has been kicked lower by Sonnet 5.

When Fable 5 will make its kingly return is still unknown.

But Sonnet 5 is already standing firmly here right now, with its performance pushed right up to Opus's doorstep.

For the vast majority of developers, it is the most capable and most usable Claude to have on hand for quite some time to come.

References:

https://x.com/claudeai/status/2072017450611142835

https://www.anthropic.com/news/claude-sonnet-5

This article is from the WeChat public account "New Zhiyuan", author: ASI Revelation

Claude Gets a 'Staff Card' Now, From Anthropic

"Claude now has an 'official position' as Anthropic introduces 'Claude Tag', an AI team member with its own identity in Slack. Unlike traditional AI assistants that borrow user permissions, Claude Tag operates with its own dedicated accounts and audit trails across systems like Slack, GitHub, and data warehouses—a model Anthropic calls 'agent identity'. This allows the AI to act as a shared colleague within a channel, learning from team context and even proactively addressing issues in 'eavesdrop mode'. The key shift is that permissions are tied to the channel, not individual users, enabling anyone in a channel (like a PM without repo access) to leverage Claude's capabilities. Anthropic reports using this internally, with AI generating most of their product team's code. As AI agents proliferate in enterprises, this dedicated identity model addresses the scaling and security challenges of managing non-human 'employees'."

marsbit3m ago

Claude Gets a 'Staff Card' Now, From Anthropic

marsbit3m ago

Edel Finance loses $403K as flash-loan oracle exploit hits xStock lending reserves

Edel Finance, a programmable market layer for tokenized equities, suffered a $403,000 exploit. An attacker manipulated the wrapped xStocks (wGOOGLx) exchange rate via a flash loan, briefly inflating the collateral's value 78-fold to borrow far beyond its true worth. Although the protocol acted promptly to limit losses, the attack revealed critical oracle and collateral pricing vulnerabilities in tokenized lending markets. The exploit severely impacted liquidity, causing total value locked (TVL) to plummet from around $630,000 to roughly $947 as users withdrew funds. A net outflow of approximately $630,000 followed. Recovery hinges on restoring user confidence through deposit growth, stabilized TVL, and stronger oracle protections and risk management practices.

ambcrypto20m ago

Edel Finance loses $403K as flash-loan oracle exploit hits xStock lending reserves

ambcrypto20m ago

Crypto Payment Cards with $1.5 Billion Monthly Transaction Volume, Stuck in the 1990s

Monthly crypto payment card transaction volume has reached $15 billion, but the industry's development stage is comparable to debit cards in the 1990s, before they became a mainstream financial staple. A key limitation is the lack of established daily financial relationships, such as direct salary deposits and recurring bill payments, with crypto wallets. Despite annualized transaction volumes of approximately $18 billion, the market is concentrated and immature. The leading provider, RedotPay, commands over half the market share. User adoption is heavily skewed towards emerging markets like Bangladesh, India, and Nigeria, where access to USD and stable financial services is limited, rather than developed economies. The sector features four primary business models: 1) Card-issuing infrastructure providers, 2) Exchange-affiliated cards for user retention, 3) Decentralized wallet/DeFi cards with self-custody but high complexity, and 4) Stablecoin-focused digital banks, which dominate transaction volume by offering integrated financial services. The article argues that a pure payment functionality is insufficient for long-term success, mirroring the historical trajectory of traditional debit cards. Future winners will need to: 1) Control the upstream flow of funds, 2) Secure defensible niches in underserved markets, and 3) Most crucially, build core account relationships that integrate into users' daily financial lives. Without this evolution, crypto cards risk remaining niche prepaid tools rather than becoming universal financial infrastructure.

Foresight News21m ago

Crypto Payment Cards with $1.5 Billion Monthly Transaction Volume, Stuck in the 1990s

Foresight News21m ago

$7.8 Billion in Theft and Losses Reveals the Truth: Security Costs Have Become an Unavoidable Liquidity Tax for DeFi

"7.8 Billion in Thefts Reveals the Truth: Security Costs Have Become DeFi's Unavoidable 'Liquidity Tax'" A summary of Q2 2026 data reveals that security risks are now a fundamental capital cost in DeFi, directly impacting user returns and liquidity decisions. DeFiLlama recorded 88 hacking incidents with quantified losses totaling $780.3 million in Q2. April was the worst month with $644.8 million lost. DeFi protocol attacks accounted for $735.8 million, while cross-chain bridge exploits resulted in $354.4 million in losses (note: some event categorizations overlap). Cumulatively, DeFi hacks have reached $7.85 billion, with bridge losses at $3.26 billion. The quarter highlighted two primary risk categories: high-value infrastructure vulnerabilities (e.g., bridges, oracles, admin keys) causing massive single losses, and more frequent contract logic bugs. This signals a critical market shift: from post-incident analysis to preemptive pricing of risk. Users and liquidity providers now implicitly factor in the security of the entire asset pathway—not just pool APY—into their decisions. This hidden "risk premium" manifests through wider spreads, higher liquidity incentives, and capital migration towards perceived safer routes. Cross-chain bridge risks, responsible for over $353 million in Q2 losses, exemplify this change. Asset routing credibility is now part of the transaction. Following incidents like KelpDAO and THORChain, markets are demanding safer bridges, asset insurance, and clearer risk disclosure, increasing the cost of capital for riskier pathways. Consequently, security spending is transforming from a defensive cost into a core distribution cost for attracting liquidity. Protocols must invest more in audits, bug bounties, real-time monitoring, and insurance to remain competitive. Users are increasingly demanding transparency about fund flow paths, associated risks, and contingency plans. The key indicators for the industry's direction will be whether capital continues consolidating in trusted channels, if projects delay launches for enhanced audits, if insurance premiums rise, and if aggregators start displaying security risk metrics. Q2 2026 may be remembered not just as a bad period, but as the point when DeFi underwent a fundamental asset risk repricing, where security became a persistent,隐性 tax on all on-chain activity.

Foresight News50m ago

$7.8 Billion in Theft and Losses Reveals the Truth: Security Costs Have Become an Unavoidable Liquidity Tax for DeFi

Foresight News50m ago

Ethereum at $1.5K: A tense stand-off forms, with ETH shorts under pressure

Ethereum faces a tense standoff, trading around $1,500 after a nearly 50% decline in the first half of 2026. While technicals and bearish derivatives positioning signal downside risk, key on-chain data suggests a potential trap for short sellers. Large transfers to exchanges are accompanied by significant stablecoin inflows, indicating whales may be preparing to buy rather than sell. Furthermore, strong user retention and rising ETH staking activity are reducing available supply. This clash between heavy bearish bets and solid holder conviction creates conditions for a possible short squeeze, making the current consolidation risky for those positioned for further downside.

ambcrypto51m ago

Ethereum at $1.5K: A tense stand-off forms, with ETH shorts under pressure