Claude Opus 4.8 Just Launched, Immediately Claimed to Be DeepSeek and Qwen

marsbitPubblicato 2026-05-28Pubblicato ultima volta 2026-05-28

Introduzione

Anthropic has launched Claude Opus 4.8, an upgraded flagship AI model, while announcing a massive $65 billion Series H funding round, valuing the company at $965 billion. Claude Opus 4.8 offers improvements in coding, reasoning, agent tasks, and knowledge work, while maintaining previous pricing. Key enhancements include greater reliability in multi-step tasks and a significant increase in honesty—it's more likely to flag uncertainties or potential code issues. However, some user tests noted instances where the model incorrectly identified itself as other AIs like DeepSeek or Qwen, suggesting possible distillation behavior. Alongside the model, Anthropic introduced new features, most notably 'dynamic workflows' in Claude Code. This research preview allows the system to coordinate dozens to hundreds of parallel sub-agents for large-scale tasks like major code migrations, framework replacements, and security audits. The $65 billion funding round, which includes strategic investments from hyperscalers like Amazon and key hardware partners, is intended to secure massive compute capacity. This positions Anthropic to shift from a model provider to a company building a comprehensive enterprise AI infrastructure system, integrating intelligence, development tools, cloud platforms, and compute resources.

With IPO expectations heating up, Anthropic's product model release pace is also accelerating.

Just now, Anthropic released two major announcements in quick succession:first, upgrading the flagship model to Claude Opus 4.8; second, completing a $650 billion Series H funding round, with a post-money valuation reaching $9650 billion, already nearing the trillion-dollar mark.

For an AI company with a valuation already approaching a trillion dollars, the market is no longer looking just at benchmark scores, but at whether it can turn intelligence, tools, development environments, cloud platforms, and computing resources into a set of scalable, deliverable infrastructure.

In other words, Anthropic must gradually shift from being a company that delivers good models to a company attempting to reshape enterprise AI workflows.

More for the Same Price: Claude Opus 4.8 Officially Arrives

The released Claude Opus 4.8 is an upgrade to Anthropic's flagship Opus series. Given the current model release cadence, as internet users jokingly say,we are likely to see Claude Opus 6 before GTA 6 releases (if there are no delays, in November).

Anthropic states that Opus 4.8 is built on Opus 4.7, with improvements in coding, agent tasks, reasoning, and knowledge work. It's already open to users, with standard usage prices unchanged,remaining $5 per million input tokens and $25 per million output tokens.

Developers can also access claude-opus-4-8 via the Claude API.

API Price Comparison https://platform.claude.com/docs/en/about-claude/models/overview

According to officially disclosed information, Opus 4.8's improvements cover coding, agent capabilities, reasoning abilities, and practical knowledge work tasks.

Anthropic used a comparison table in the release materials to show Opus 4.8's performance compared to its predecessor Opus 4.7 and other models across various tests. However, compared to single-response quality,the model's upgrade focus is more on its performance in long-duration tasks and complex collaboration.

In real-world workflows, models often need to handle multi-step tasks consecutively, call tools, check intermediate results, and proceed based on feedback. Anthropic states that early testers found Opus 4.8 more reliable when executing agent tasks, with clearer judgment.

Improvements in honesty are a highlight for Opus 4.8.

A common problem with AI models is making judgments prematurely with insufficient evidence and confidently claiming progress. Anthropic says Opus 4.8 is more willing to explain uncertainties in its work and less likely to make unsupported assertions.

Coding tasks particularly reflect this change.

Internal evaluations show that Opus 4.8 is about one-fourth as likely as its predecessor to let defects in its own written code pass without explanation. In other words, the new model is more likely to alert users when it identifies risks, rather than leaving issues for later testing or production environments.

Regarding alignment and safety, Anthropic continues its core narrative. Opus 4.8's occurrence of misalignment behaviors like deception and cooperation with abuse is significantly lower than Opus 4.7, approaching one of the currently best-aligned models, Claude Mythos Preview.

Safe, reliable, and controllable remain a set of keywords Anthropic uses to differentiate itself. As Claude integrates deeper into enterprise processes, these keywords are starting to carry more commercial significance.

Interestingly, after Opus 4.8's release, netizens noticed something odd.

https://x.com/realNyarime/status/2060059543820963975

Many users testing it found that when they repeatedly questioned Opus 4.8 about its model identity, its answers weren't always Claude.

Sometimes it would identify itself as Qwen, other times it would state the name DeepSeek, suggesting possible distillation behavior.

When users asked the same question in the official Claude client, such responses were usually harder to reproduce. The reason likely lies in the more complete system prompts and product-layer constraints within the client.

Dynamic Workflows Launched, Claude Code Moves Towards Multi-Agent Collaboration

Alongside Claude Opus 4.8, several product and developer features also went live.

Among them, the one most directly impacting the Claude user experience is effort control, or thinking intensity adjustment.

Located next to the model selector, as the name suggests, users can decide how much reasoning compute power Claude invests in a single task. At higher intensity, Claude performs more reasoning to achieve better response quality; at lower intensity, Claude responds faster, and usage quota consumption is slower.

Anthropic states that Opus 4.8 defaults to high effort. Users can also choose extra (corresponding to xhigh in Claude Code), or max, allowing the model to invest more tokens. Anthropic recommends using extra for difficult tasks and long-running asynchronous workflows.

What truly influences Claude Code's product form is dynamic workflows.

This feature is currently in research preview, aiming to enable Claude Code to handle large-scale tasks that previously required longer engineering cycles. Work planned quarterly in the past might now be completed in just a few days.

The core mechanism of dynamic workflows is that Claude dynamically writes orchestration scripts based on user tasks, running dozens to hundreds of parallel subagents in a single session. The model first plans the task, then assigns it to multiple subagents, checks the returned results, and finally reports back to the user. With Opus 4.8's launch, these agents can also run for longer durations.

This feature primarily targets complex, large, or historically burdened codebases. Typical scenarios include bug hunting across entire services, performance optimization audits, security audits, large codebase migrations, framework replacements, API deprecation migrations, language porting, and multi-angle validation of critical solutions.

Regarding usage, Anthropic recommends enabling auto mode in dynamic workflows. Users can directly ask Claude to create a workflow or enable ultracode within Claude Code. Ultracode sets the thinking intensity to xhigh and lets Claude automatically judge whether the current task is suitable for using a workflow.

Dynamic workflows are now available in Claude Code CLI, Desktop, and VS Code extension, for Max, Team, and Enterprise plans. For Enterprise, it's disabled by default at launch and requires administrator enablement in Claude Code settings.

The feature is also usable via Claude API, Amazon Bedrock, Vertex AI, and Microsoft Foundry. For Max, Team users, and users accessing Claude Code via API, dynamic workflows are enabled by default.

Anthropic demonstrated the upper limit of dynamic workflows with the Bun migration case. Jarred Sumner used this feature to port Bun from Zig to Rust, ultimately generating approximately 750,000 lines of Rust code, achieving a 99.8% pass rate on existing test suites, taking about 11 days from first commit to merge.

The entire migration was completed through multiple workflows: first mapping Rust lifetimes for struct fields in the Zig codebase, then generating behaviorally consistent .rs files for each .zig file, with hundreds of agents working in parallel, each file reviewed by two reviewers. Subsequently, a fix loop continuously ran builds and test suites until builds and tests passed. After migration, overnight workflows handled unnecessary data copying issues, opening PRs for each problem type for final review.

Beyond Claude Code, Anthropic also updated the Messages API. Now, the Messages API can accept system entries within the messages array.

Developers can update Claude's instructions during task execution without breaking prompt cache or needing to pass updates through a user turn. This capability can be used to update permissions, token budgets, or environmental context during agent runtime.

Next, Anthropic plans to introduce a new model category with higher intelligence levels than Opus.Yes, the incredibly powerful Claude Mythos Preview, expected to be available to all customers within the coming weeks.

When the time comes, we'll also be among the first to give it a try.

Behind the Near-Trillion-Dollar Valuation, Claude Needs a Larger Computing Foundation

The other news announced on the same day as Claude Opus 4.8is Anthropic's completion of a $650 billion Series H funding round. This round was led by Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital, resulting in a post-money valuation of $9650 billion.

This funding round also includes $150 billion in existing committed investments from hyperscalers, including $50 billion from Amazon. Strategic infrastructure partners like Micron, Samsung, and SK hynix also joined. Anthropic states these companies play crucial roles in global memory, storage, and logic chip supply, helping it scale computing power as Claude's demand grows.

Computing power expansion is the key backdrop behind this funding round. Anthropic disclosed several infrastructure agreements: an agreement with Amazon for up to 5 gigawatts of new capacity; agreements with Google and Broadcom for 5 gigawatts of next-generation TPU capacity; and an agreement with SpaceX for access to GPU capacity in Colossus 1 and Colossus 2.

Anthropic also emphasizes that Claude is the first frontier model simultaneously available on all three major cloud platforms: AWS, Google Cloud, and Microsoft Azure. However, AWS remains Anthropic's primary cloud provider and training partner.

Behind the funding lies a shift in Anthropic's commercial positioning. Early large model companies competed on model capabilities and general chat experience. Now, enterprise customers care more about whether AI can enter core processes, handle complex tasks, and be integrated into development environments, cloud platforms, and internal systems.

Claude Code, Cowork, effort control, dynamic workflows, and Messages API updates all revolve around this direction.

Looking at product releases and funding together, Anthropic is simultaneously expanding three types of capabilities.

The first is model capabilities.Opus 4.8 improves coding, reasoning, agent tasks, and knowledge work performance, and strengthens the expression of uncertainty.

The second is workflow capabilities.Dynamic workflows move Claude Code from single-instance code assistance towards more complex engineering execution and review.

The third is infrastructure capabilities.The $650 billion funding, hyperscaler committed investments, memory and chip partner inclusion, and computing power agreements with Amazon, Google, Broadcom, and SpaceX provide resources for future model training and inference needs.

This is also the core logic behind Anthropic's near-trillion-dollar valuation. Claude is no longer just an AI chat window; it is becoming a work system connecting models, code, enterprise processes, cloud platforms, and computing infrastructure.

Opus 4.8 is the latest model foundation in this system. Dynamic workflows represent the product form for complex engineering tasks. The $650 billion funding and computing power expansion are prerequisites for pushing this system to larger-scale customers.

The AI tide has pushed Anthropic to the crest of the wave. From this height, moving forward means riding the wind; falling back means sinking. There is no third posture.

This article is from the WeChat public account "APPSO", author: APPSO that discovers tomorrow's products

Domande pertinenti

QWhat are the two main announcements made by Anthropic as reported in the article?

AThe two main announcements are the upgrade of its flagship model to Claude Opus 4.8 and the completion of a $65 billion Series H funding round, resulting in a post-money valuation of approximately $965 billion.

QWhat unusual behavior did users observe when questioning Claude Opus 4.8 about its identity?

AUsers observed that when asked about its model identity, Claude Opus 4.8 sometimes identified itself as other models like Qwen or DeepSeek, suggesting potential distillation behavior.

QWhat is the 'dynamic workflows' feature introduced with Claude Opus 4.8, and what is its primary goal?

AThe 'dynamic workflows' feature allows Claude Code to handle large-scale, complex tasks that previously required longer engineering cycles. Its primary goal is to orchestrate parallel subagents to complete major projects, such as code migrations or audits, potentially reducing the timeline from quarters to days.

QAccording to the article, what is the core shift in Anthropic's business positioning as its valuation approaches $1 trillion?

AThe core shift is from being a company that delivers good AI models to one that aims to reshape enterprise AI work practices. This involves integrating AI intelligence, tools, development environments, cloud platforms, and computing resources into a scalable infrastructure.

QHow does the 'effort control' feature, also mentioned in the article, work for Claude users?

AThe 'effort control' feature allows users to adjust the amount of reasoning compute power Claude invests in a task. Higher settings (like extra or max) result in more reasoning for better answer quality but consume usage quota faster, while lower settings provide faster responses with slower quota consumption.

Letture associate

AI Relay Stations Spark Heated Debate on Zhihu: Behind Cheap Tokens, What Are Users Really Worried About?

A discussion on Zhihu about "AI relay stations" shifted the niche developer topic of "cheap tokens" into broader user awareness. Users moved beyond simply questioning the legitimacy of these services to focus on practical concerns: Where do cheap tokens truly come from? Is the model being accessed the real one? Can relay stations see prompts, code, and API keys? For occasional users, are the risks worth it? The core debate centered less on price and more on trust. A primary worry is model authenticity—the risk of "model swapping," where users paying for a premium model might be routed to a cheaper one, creating an information asymmetry. Others argued that cost comparisons matter; while cheaper than official pay-as-you-go APIs, relay stations may not be the lowest-cost option versus subscriptions, domestic models, or free tiers, making user needs assessment crucial. Speculation about token sources ranged from legitimate bulk discounts to gray-area methods like account sharing or exploiting regional pricing. This opacity makes risk assessment difficult for users. Data security emerged as a critical concern, especially for enterprise use. When processing sensitive information like code, contracts, or client data, the inability to verify a relay station's data handling, retention, or access policies poses significant compliance and confidentiality risks. The evolving consensus suggests relay stations can be used cautiously for low-sensitivity, disposable tasks (e.g., summarizing public info, simple translation). However, they should not be the default for sensitive, professional, or production workflows involving proprietary data, Agents, or automated systems. Recommendations include avoiding large prepayments, not relying on a single service, using test prompts to monitor quality, anonymizing data where possible, and keeping official channels as backups. Ultimately, the discussion framed tokens not just as a billing unit but as a measure of real cost encompassing price, model integrity, data security, and service stability. The popularity of relay stations highlights user demand for affordable access, but the debate underscores a key trade-off: the savings from cheap tokens may come at the price of trust, transparency, and control over one's data and AI experience.

marsbit14 min fa

AI Relay Stations Spark Heated Debate on Zhihu: Behind Cheap Tokens, What Are Users Really Worried About?

marsbit14 min fa

In-Depth Research Report on TradFi: The Convergence Wave of Crypto and Traditional Finance

In 2026, the crypto industry is undergoing a profound infrastructure-level transformation—TradFi assets are migrating on-chain at an unprecedented pace. According to CoinGecko's Q1 2026 report, the total value locked (TVL) of tokenized real-world assets (RWA) has surpassed $31 billion, a nearly 4x increase from $7.8 billion at the beginning of 2025, with the sector’s aggregate market capitalization reaching $19.3 billion. Among these, the market cap of tokenized stocks surged from $2 million to $486 million, with Q1 spot trading volume reaching $15.1 billion—a single quarter already surpassing the entire second half of 2025. RWA perpetual contract Q1 trading volume reached a staggering $524.8 billion, far exceeding the $313 billion for all of 2025. Meanwhile, BlackRock's BUIDL fund has reached $2.3 billion in scale and has filed for two new tokenized funds, signaling that the world's largest asset manager's tokenization strategy is evolving from pilot to product suite expansion. HTX, as a core participant in the crypto exchange sector, officially launched TradFi perpetual futures products including NVDA, AAPL, MSFT, META, and SPY in 2026, enabling crypto users to gain 24/7 trading access to core U.S. equities. Boston Consulting Group predicts that global tokenized asset scale could reach $16 trillion by 2030, while McKinsey offers a conservative estimate of approximately $2 trillion. The on-chain migration of TradFi assets is no longer a "future narrative" but a structural transformation unfolding in real time, as crypto exchanges evolve from single crypto asset trading platforms toward "multi-asset-class trading infrastructure."

HTX Learn17 min fa

In-Depth Research Report on TradFi: The Convergence Wave of Crypto and Traditional Finance

HTX Learn17 min fa

Blocked Its Own Treasure, WeChat AI Steps Up

Tencent's stock surged over 10% on June 2nd amid reports that WeChat, with 1.43 billion monthly users, is finalizing tests for a native AI Agent. The reported feature, accessible by swiping right from the main interface, allows users to issue commands in natural language. The AI then decomposes tasks and automatically calls upon relevant Mini Programs within WeChat to complete actions like ordering food, booking tickets, or making payments, creating a closed-loop service execution system. This strategic shift follows the internal conflict and subsequent "blocking" of Tencent's standalone AI app, Yuanbao, by WeChat for violating sharing rules during a 2026 Spring Festival promotion. The incident highlighted a lack of internal consensus and exposed the weakness of competing in the standalone AI assistant arena against rivals like ByteDance's Doubao (345M MAU) and Alibaba's Qianwen. The new WeChat AI Agent aims to leverage WeChat's unique assets—its massive user base, standardized Mini Program APIs, WeChat Pay, and identity system—to move from simple content generation to actual task execution. Analysts note this changes the competitive landscape from model benchmarks to which AI can connect to more real-world services. However, success depends on key variables: the capability of Tencent's underlying Hunyuan model, managing massive inference costs, and redesigning incentives for Mini Program developers whose traffic might be bypassed. The move is seen as an attempt to keep user service intent within WeChat's ecosystem as AI begins to redefine how users access services.

marsbit1 h fa

Blocked Its Own Treasure, WeChat AI Steps Up

marsbit1 h fa

ByteDance Adopts Arm CPUs, Jensen Huang: So Sad I Didn't Buy Arm

**Summary:** At Computex 2026, Arm CEO Rene Haas announced that ByteDance and Oracle have adopted Arm's self-designed Arm AGI data center CPU. The company expects significant revenue growth from this product, projecting $20 billion in demand for the 2027/2028 fiscal years. Haas noted that restricting AI-capable CPUs from the US to China is nearly impossible due to their widespread applications. Arm's stock has surged dramatically this year, notably rising 16% after NVIDIA's Arm-based Vera CPU and RTX Spark announcements. A highlight was the informal, humorous on-stage conversation between Haas and NVIDIA CEO Jensen Huang. Huang joked about NVIDIA's failed attempt to acquire Arm and playfully lamented selling his Arm shares. Both executives showed a clear sense of camaraderie and shared regret over the missed merger. Key technical topics were discussed: 1. **AI PC Design:** Huang explained NVIDIA's RTX Spark superchip (with a 20-core Arm CPU) is designed for future AI agents that will autonomously run and use tools on PCs, blending local and cloud processing. 2. **Agent vs. OS:** Huang emphasized the operating system remains crucial, as AI agents rely on its APIs and tools to function. 3. **Growth Constraints:** He identified the shift to "useful AI" that generates profitable tokens as a primary driver for immense, almost limitless, computational demand. Haas outlined Arm's strategy across PC and data centers. For PCs, Arm collaborates with partners like NVIDIA and MediaTek, offering its compute subsystem (CSS) for custom SoCs. In data centers, its Arm AGI CPU (built on TSMC's 3nm process) has gained major partners including OpenAI, Meta, and now ByteDance and Oracle. Arm presented a multi-year roadmap for its in-house CPU line. The article concludes that while GPUs dominated the AI training race, the explosion of AI agents is shifting significant focus to CPUs for inference, state management, and tool orchestration. The industry is trending towards vertical integration, with companies like cloud providers designing chips and chip/IP firms offering full solutions, all competing to deliver more efficient computing per watt.

marsbit1 h fa

ByteDance Adopts Arm CPUs, Jensen Huang: So Sad I Didn't Buy Arm

marsbit1 h fa

Trading

Spot
Futures
活动图片