Microsoft is Afraid of Being Marginalized by AI Giants

marsbit2026-06-03 tarihinde yayınlandı2026-06-03 tarihinde güncellendi

Özet

Microsoft, once the defining force of the PC era, now faces a familiar challenge in the AI age: the risk of being relegated to a profitable but invisible infrastructure provider. This anxiety was laid bare at Build 2026, where CEO Satya Nadella unveiled a major strategic pivot. The catalyst was a quiet April agreement that dissolved Microsoft's exclusive licensing and cloud-hosting deal with OpenAI, its once-vital partner. This erased Microsoft's key AI moat. With OpenAI and Anthropic defining AI applications and gaining enterprise traction—even within Microsoft's own ranks—Nadella had to answer: without exclusivity, what is Microsoft's role? The answer was a suite of seven in-house AI models, a developer-focused AI workstation (Surface RTX Spark Dev Box), and, most crucially, the Agent 365 platform for enterprise AI governance. The models, notably targeting Anthropic's strengths in coding and enterprise, signal a defensive move. However, the broader strategy is to make the models themselves less decisive. Financially, Microsoft's AI revenue is strong, driven largely by Azure running others' models. Yet its user-facing products like Copilot show weak penetration and engagement. Microsoft earns infrastructure money but lacks direct user mindshare. Nadella's core fear is being "hollowed out." As OpenAI and Anthropic prepare for IPOs and gain financial independence, they may build their own infrastructure, threatening Azure's lucrative AI revenue stream. Microsoft's window i...

Once upon a time, OpenAI needed Microsoft.

Today, Microsoft needs to prove it doesn't need OpenAI.

On June 2nd, Build 2026 opened. Microsoft CEO Satya Nadella unveiled seven in-house models, an AI workstation for developers, an enterprise Agent governance platform, and a quantum chip in one go. The information density was off the charts, with every move pointing to the same thing:

Microsoft is parting ways with its closest ally.

But if we zoom out a bit.

You'll find that the situation Microsoft faces today is not unfamiliar.

Thirty years ago, Microsoft defined the PC era. Twenty years ago, the internet rose, Google defined search, Facebook defined social media, Apple defined mobile internet, while Microsoft gradually retreated to the background, becoming the most profitable yet least imaginative infrastructure company of the era.

Later, Satya Nadella spent a decade bringing Microsoft back to center stage.

Today, as AI becomes the new operating system, Microsoft suddenly finds itself standing in that familiar position again.

OpenAI and Anthropic are defining AI, Cursor is defining development, Google is back at the AI table.

Microsoft seems to be selling cloud services again.

I. April 27th, That Crack

The turning point of the story isn't Build itself. It's a deliberately low-key revised agreement from two months prior.

On April 27th, Microsoft and OpenAI simultaneously issued statements announcing their cooperation was entering its next phase. The wording was gentle, the content dramatic:

Microsoft's intellectual property license for OpenAI's models and products changed from exclusive to non-exclusive.

OpenAI can offer products through any cloud service provider, no longer bound to Azure.

Microsoft will no longer pay OpenAI a revenue share.

To put it bluntly, the breakup papers are signed, but they're still living together.

This means the biggest moat Microsoft bought starting in 2019 with $13 billion—that it was the only one globally who could run OpenAI models—was broken overnight.

OpenAI can partner with AWS, with Google Cloud, with anyone.

Microsoft went from being the AI era's exclusive partner to one of its primary cloud service providers.

This is why Build 2026 became Nadella's most crucial keynote. He had to answer a question: Without exclusive rights to OpenAI, what makes Microsoft still the protagonist of the AI era?

II. The Real Problem Behind the Seven Models

The seven models released on Build day—MAI Thinking 1 (Reasoning), MAI Code 1 Flash (Code), MAI Image 2.5 (Image), MAI Voice 2 (Voice), MAI Transcribe 1.5 (Transcription)—cover almost all core capabilities of the AI product chain. This is also the largest simultaneous release of in-house models in Microsoft's history.

But what's truly noteworthy about these models isn't their parameter count or benchmark scores; it's who they're benchmarking against.

Microsoft's AI lead, Mustafa Suleyman, said one thing in an interview: We're focusing more on an Anthropic-style direction: enterprise, developers, and coding.

This directly named the opponent: it's Anthropic, no question.

MAI Code 1 Flash was directly compared to Claude Haiku 4.5, scoring 51.2% on SWE Bench Pro versus Haiku's 35.2%. MAI Thinking 1 is benchmarked against Claude Sonnet 4.6.

Why is the opponent Anthropic and not OpenAI?

The answer is obvious.

According to the Ramp AI Index, in April 2026, Anthropic's enterprise paid adoption rate reached 34.4%, surpassing OpenAI's 32.3% for the first time. In about 70% of initial AI service procurement head-to-heads, enterprises signed with Claude, not ChatGPT. Claude Code holds 54% of the AI programming tools market, while GitHub Copilot has fallen to about 25%.

More awkwardly, Microsoft's own engineers are also using Claude.

According to media reports, Microsoft's internal evaluation found that core developer satisfaction with its own Copilot was lower than with external competitors. The development culture is being infiltrated by external tools.

This is the truth behind the seven models. It's not that Microsoft is also awesome; it's that Microsoft must save itself.

Over the past three years, the entire AI industry believed the model was everything. But at Build 2026, Nadella repeatedly emphasized not one model, but 11,000 models.

Behind this is a classic Microsoft logic.

The future won't have just one super model. Models will become increasingly like databases, servers, and cloud resources—a standard capability.

Microsoft releasing seven models isn't just to prove it can make models. It's also an attempt to make the models themselves less important.

III. Making Money ≠ Being the Protagonist

If you just look at the financial reports, Microsoft's AI business looks great.

In Q3 FY2026, Azure grew 40%, AI business annualized revenue run-rate hit $37 billion, up 123% year-over-year. This is real money.

But behind these numbers lies an awkward truth.

The vast majority of that $37 billion comes from running models *for others*. OpenAI runs on Azure, part of Anthropic's compute is on Azure.

Microsoft is making infrastructure money, not application money.

What about Copilot, which directly faces users?

In the paid AI assistant market, based on Recon Analytics statistics for US paid subscribers, Copilot's market share fell from 18.8% in July 2025 to 11.5% in January 2026, shrinking 7.3 percentage points in half a year, a relative drop of 39%.

In overall workplace office hours, AI tool actual usage time remains consistently at only 1%, showing almost no increase for several consecutive years. There's neither explosive adoption nor a cliff-like collapse; it's stuck on the edge of mainstream workflows.

M365 Copilot reached 15 million paid seats at the beginning of the year, but a Stackmatix survey shows that among employees with product access, the regular active conversion rate is only 35.8%. Many enterprises bulk purchase licenses, but daily employee adoption rates are far below the seat count numbers. Far fewer people are actually using it for daily work than the numbers suggest.

This is the real dilemma Microsoft faces: making money and being the protagonist are two different things.

Azure makes the most money, but ChatGPT and Claude directly own the users. Nadella knows that if Copilot's penetration rate stays in the single digits, Microsoft in the AI era will be like AWS: a massive, profitable, brandless pipe.

Nobody remembers what AWS looks like. But everyone remembers ChatGPT. This is precisely the problem Nadella must solve.

IV. The Subtlety of Jensen Huang's Appearance

One detail at Build is worth noting.

Jensen Huang appeared via video link at the keynote, showing support for the Surface RTX Spark Dev Box. He said the PC is transitioning from personal computer to personal AI.

Two days earlier in Taipei, he had just launched the RTX Spark chip, announcing NVIDIA's formal entry into the PC processor market. Microsoft is his most important partner on the PC front.

But looking closely at this relationship reveals a more subtle conflict.

In the data center market, NVIDIA's biggest customers are Azure, AWS, Google Cloud. Whoever buys the chips is the winner in AI infrastructure. NVIDIA doesn't pick sides; it sells products to everyone.

In the PC market, the RTX Spark chip is allied with Microsoft and MediaTek. The first OEMs are Dell, Lenovo, HP, Asus. On the surface, Microsoft is the beneficiary; every AI PC runs Windows.

But NVIDIA's real goal in entering the PC market isn't to help Microsoft sell Windows; it's to sell local AI compute. RTX Spark's core selling point is 1 petaflop of AI inference capability—running models locally, running Agents locally. If local compute is strong enough, why would users need to go to the cloud? Why would they need Azure?

While helping Microsoft on stage, Jensen Huang is also loosening the foundation of Microsoft's cloud business.

This is the most surreal aspect of today's tech industry: everyone is a partner, and everyone is a potential competitor.

V. Why Build Downplayed Consumers

Looking at the two-day Build 2026 agenda, it almost entirely downplayed the average consumer.

There was no announcement of new Copilot features. No demos of "AI changing your daily life." Even the AI PC concept, which Jensen Huang shouted so loudly about in Taipei two days prior, was deliberately downplayed by Microsoft at Build.

What replaced it were product and capability pitches aimed at developers and enterprises.

Surface RTX Spark Dev Box: 1 petaflop local AI compute, 128GB unified memory, a workstation built for developers.

Agent 365: An enterprise Agent governance platform managing identity, permissions, access control, and compliance, integrating with all major clouds.

MXC (Microsoft Execution Containers): Underlying containers adding safety guardrails to Agents.

OpenClaw on Windows: Enabling personal Agents to securely enter enterprise environments.

The logic behind this is crystal clear: in the consumer market, Microsoft can't win anymore. Tools like ChatGPT, Claude Code, Cursor engage users directly; Microsoft's Copilot can't break in.

But the enterprise market is different. Enterprises need compliance, identity management, auditing, data isolation, multi-Agent permission controls. These are things ChatGPT and Claude can't do; they are inherently personal products.

Nadella's bet is that the AI era needs an enterprise operating system. This operating system isn't the model itself, but the entire platform that lets models run safely and compliantly within enterprises.

Whoever controls this platform controls the entry point for enterprise AI.

This also explains why Build's final headliner wasn't a model launch, but Agent 365—an AI management platform for managing AI.

VI. What Nadella Truly Fears

What exactly is Nadella anxious about?

It's not that Azure isn't profitable, nor that they can't build models, nor that competitors are too strong. He fears that after OpenAI and Anthropic go public, they won't need Microsoft anymore, and he'll be marginalized.

Anthropic secretly filed an S-1 with the SEC on June 1st, valuation $965 billion. OpenAI is also preparing a confidential filing, expected to submit in the second half of the year.

What happens after they go public?

With their own money, they'll use it to buy compute, build data centers. AWS is already deeply tied with Anthropic. SpaceX supplies Anthropic with $1.25 billion in GPU compute monthly, an annual compute bill approaching $15 billion.

When OpenAI and Anthropic no longer need to run models on Azure, how much of that $37 billion AI annualized revenue can Microsoft hold onto?

This is Nadella's real time window.

He must transform Microsoft from being the infrastructure that runs models for others into the enterprise platform all AI must go through, before OpenAI and Anthropic achieve independence.

Models can be swapped. Clouds can be swapped.

But if your identity system, compliance framework, audit logs, and security containers all run on Microsoft's Agent 365, you can't swap out Microsoft.

This is what Build 2026 is really doing: laying a layer of infrastructure only Microsoft can build, beneath all the models.

【Beyond the Layout】's Words:

Another trend revealed by Microsoft Build 2026 is that models are gradually shifting from being the protagonist to becoming infrastructure.

As more and more enterprises use OpenAI, Anthropic, Gemini, and various open-source models simultaneously, what truly matters isn't choosing which model, but who manages these models.

If in 2019, Microsoft leveraged OpenAI to get a first-class ticket to the AI era.

Then in 2026, Microsoft is doing something else. Nadella wants to prove he's not just a passenger, but the driver.

He understands better than anyone that Microsoft once missed mobile internet and once retreated from center stage.

This time, he doesn't want to experience that again.

This article is from the WeChat public account "Beyond the Layout", author: Hua Hua

İlgili Sorular

QAccording to the article, what is the core strategic shift Microsoft is attempting at Build 2026?

AMicrosoft is shifting its focus from being a provider of AI infrastructure (like running OpenAI's models on Azure) to becoming the essential platform layer for enterprise AI. This means emphasizing products like Agent 365 for governance, security, compliance, and management of multiple AI models and agents, aiming to make its platform indispensable regardless of which specific AI model a company uses.

QWhy is Microsoft's relationship with OpenAI described as changing from 'exclusive' to 'non-exclusive'?

AA revised agreement in April 2026 removed key exclusive terms. Microsoft's license for OpenAI's intellectual property became non-exclusive, OpenAI was no longer bound to run its products solely on Azure, and Microsoft stopped paying OpenAI a revenue share. This broke Microsoft's primary competitive moat, allowing OpenAI to partner with other cloud providers like AWS and Google Cloud.

QWhat major concern does the article highlight regarding Microsoft's Copilot business?

AWhile Azure's AI revenue is high, it largely comes from infrastructure services. The direct-to-user Copilot business faces challenges: declining market share among paid AI assistants, low active user conversion rates within enterprises (around 35.8%), and AI tools being stuck at the edge of mainstream workflows, accounting for only about 1% of workplace time.

QWhat is the paradoxical nature of NVIDIA's partnership with Microsoft as described in the article?

ANVIDIA partners with Microsoft in the PC market (e.g., RTX Spark chips for AI PCs running Windows), but its strategy ultimately undermines Microsoft's core cloud business. By pushing powerful local AI inference capabilities (1 petaflop), NVIDIA reduces the need for users to rely on cloud services like Azure for AI processing, making partners also potential competitors.

QWhat future event is Nadella most anxious about, according to the article's analysis?

ANadella is most anxious about OpenAI and Anthropic going public. Once they have their own capital from IPOs, they will be less dependent on Microsoft's Azure for compute resources (e.g., buying their own GPUs, building data centers, partnering with AWS/SpaceX). This threatens the sustainability of a significant portion of Microsoft's $37 billion AI revenue run rate, which is heavily tied to providing them cloud infrastructure.

İlgili Okumalar

Blocked Its Own Treasure, WeChat AI Steps Up

Tencent's stock surged over 10% on June 2nd amid reports that WeChat, with 1.43 billion monthly users, is finalizing tests for a native AI Agent. The reported feature, accessible by swiping right from the main interface, allows users to issue commands in natural language. The AI then decomposes tasks and automatically calls upon relevant Mini Programs within WeChat to complete actions like ordering food, booking tickets, or making payments, creating a closed-loop service execution system. This strategic shift follows the internal conflict and subsequent "blocking" of Tencent's standalone AI app, Yuanbao, by WeChat for violating sharing rules during a 2026 Spring Festival promotion. The incident highlighted a lack of internal consensus and exposed the weakness of competing in the standalone AI assistant arena against rivals like ByteDance's Doubao (345M MAU) and Alibaba's Qianwen. The new WeChat AI Agent aims to leverage WeChat's unique assets—its massive user base, standardized Mini Program APIs, WeChat Pay, and identity system—to move from simple content generation to actual task execution. Analysts note this changes the competitive landscape from model benchmarks to which AI can connect to more real-world services. However, success depends on key variables: the capability of Tencent's underlying Hunyuan model, managing massive inference costs, and redesigning incentives for Mini Program developers whose traffic might be bypassed. The move is seen as an attempt to keep user service intent within WeChat's ecosystem as AI begins to redefine how users access services.

marsbit36 dk önce

Blocked Its Own Treasure, WeChat AI Steps Up

marsbit36 dk önce

ByteDance Adopts Arm CPUs, Jensen Huang: So Sad I Didn't Buy Arm

**Summary:** At Computex 2026, Arm CEO Rene Haas announced that ByteDance and Oracle have adopted Arm's self-designed Arm AGI data center CPU. The company expects significant revenue growth from this product, projecting $20 billion in demand for the 2027/2028 fiscal years. Haas noted that restricting AI-capable CPUs from the US to China is nearly impossible due to their widespread applications. Arm's stock has surged dramatically this year, notably rising 16% after NVIDIA's Arm-based Vera CPU and RTX Spark announcements. A highlight was the informal, humorous on-stage conversation between Haas and NVIDIA CEO Jensen Huang. Huang joked about NVIDIA's failed attempt to acquire Arm and playfully lamented selling his Arm shares. Both executives showed a clear sense of camaraderie and shared regret over the missed merger. Key technical topics were discussed: 1. **AI PC Design:** Huang explained NVIDIA's RTX Spark superchip (with a 20-core Arm CPU) is designed for future AI agents that will autonomously run and use tools on PCs, blending local and cloud processing. 2. **Agent vs. OS:** Huang emphasized the operating system remains crucial, as AI agents rely on its APIs and tools to function. 3. **Growth Constraints:** He identified the shift to "useful AI" that generates profitable tokens as a primary driver for immense, almost limitless, computational demand. Haas outlined Arm's strategy across PC and data centers. For PCs, Arm collaborates with partners like NVIDIA and MediaTek, offering its compute subsystem (CSS) for custom SoCs. In data centers, its Arm AGI CPU (built on TSMC's 3nm process) has gained major partners including OpenAI, Meta, and now ByteDance and Oracle. Arm presented a multi-year roadmap for its in-house CPU line. The article concludes that while GPUs dominated the AI training race, the explosion of AI agents is shifting significant focus to CPUs for inference, state management, and tool orchestration. The industry is trending towards vertical integration, with companies like cloud providers designing chips and chip/IP firms offering full solutions, all competing to deliver more efficient computing per watt.

marsbit56 dk önce

ByteDance Adopts Arm CPUs, Jensen Huang: So Sad I Didn't Buy Arm

marsbit56 dk önce

New Wall Street Play: Yen Shorts Still Adding, But Japan Stocks Don't Rely on Carry Trade Unwinding

On June 3rd, USD/JPY hit 160.44, its highest level since July 2024, while the Nikkei 225 surged past 68,000 points. Contrary to popular narratives of an imminent "carry trade unwind" akin to August 2024, data reveals a more complex picture. Speculative net short positions in yen futures have actually increased, reaching -114,667 contracts by late May, suggesting traders are doubling down rather than retreating. Meanwhile, Japan's Finance Ministry conducted its largest-ever single-round FX intervention (11.73 trillion yen) in April-May but failed to hold the 160 yen line. The Nikkei's rally is not driven by carry trade dynamics. Foreign investors are aggressively buying Japanese stocks, with net purchases in 2026 running nearly 16 times higher than 2025 levels. This inflow is concentrated in AI and semiconductor-related stocks like SoftBank and Socionext, fueled by positive sector outlooks, rather than being a flight from unwinding yen shorts. Furthermore, the Nikkei has continued climbing despite the Bank of Japan's (BOJ) rate hikes to 0.75%. This disconnect exists because the current equity boom is fueled by AI-driven foreign investment, not reliant on cheap yen funding. However, this relationship remains fragile. Should the BOJ hike rates further (e.g., to 1.0%) while dollar weakness increases carry trade costs, the trajectories of the yen and Japanese stocks could reconverge, potentially triggering volatility.

marsbit1 saat önce

New Wall Street Play: Yen Shorts Still Adding, But Japan Stocks Don't Rely on Carry Trade Unwinding

marsbit1 saat önce

Broadcom's Q3 Guidance Misses Expectations by $12 Billion, After-Hours Trading Plummets Over 13%, AI Narrative "Cooling"?

On June 3, Broadcom released record Q2 FY26 results with revenue of $22.19B, up 48% YoY, and AI chip sales of $10.8B, up 143%. Adjusted EPS of $2.44 beat estimates. However, its Q3 AI semiconductor revenue guidance of $16B, while up over 200% YoY, fell roughly $1.2B (7%) short of analyst consensus expectations of $17.2B. This miss, coupled with slightly weaker-than-expected software revenue, triggered a severe market reaction. CEO Hock Tan maintained the FY26 AI revenue outlook of over $100B but did not raise it, disappointing investors who had priced in more robust growth. The stock plummeted over 13% in after-hours trading, erasing roughly $270B in market cap. The sell-off extended to peers like Marvell. A key concern for markets, particularly for Chinese optical module suppliers, was Tan's comment that the contribution of AI networking (e.g., Ethernet switches, optical interconnect chips) to AI revenue, currently near 40%, is expected to normalize to around 30% over time, signaling a potential peak in growth for that segment. Despite the guidance shortfall, Tan reiterated that AI demand remains "insatiable" and reaffirmed the long-term target of exceeding $100B in AI revenue by FY27. The reaction highlights the heightened sensitivity and premium valuation placed on AI-exposed stocks, where anything less than stellar guidance can prompt significant profit-taking. The broader question is whether this represents a cooling AI narrative or a correction in overstretched valuations.

marsbit1 saat önce

Broadcom's Q3 Guidance Misses Expectations by $12 Billion, After-Hours Trading Plummets Over 13%, AI Narrative "Cooling"?

marsbit1 saat önce

İşlemler

Spot
Futures
活动图片