Why Did OpenAI Decide to Make a Phone? ChatGPT Is Taking the Permissions Apple Won't Give

marsbitPublicado em 2026-05-18Última atualização em 2026-05-18

Resumo

The article discusses OpenAI's surprising move into developing its own AI-powered smartphone, reportedly targeting a 2027 launch. Initially driven by faith that superior AI models alone would secure its dominance—evidenced by ChatGPT's viral success—OpenAI now faces a strategic pivot. Key challenges include slower-than-expected revenue growth and competition from rivals like Anthropic's Claude Code, which successfully monetized a specific, high-value user base (developers) by deeply integrating into workflows. OpenAI recognizes that for ChatGPT to evolve from a conversational tool into a true "AI Agent" that completes tasks (e.g., booking travel, managing files), it needs direct system-level permissions and a default user interface. Currently, as a service integrated into platforms like Apple's iOS and Microsoft's Windows, ChatGPT lacks the necessary access and control ("sovereignty") over hardware, data, and user interactions. Building its own device is seen as a way to give ChatGPT its "first body"—a dedicated terminal where it can operate with full autonomy, bypassing the limitations imposed by partner ecosystems. This shift underscores a broader realization: in the AI Agent era, owning the end-user device and experience is critical to capturing value and maintaining competitive advantage, even if it means directly competing with former allies like Apple.

This summer, Musk is going to do something unprecedented in history. Taking a large language model company and packing it into a rocket company, then taking them public together.

The last thing OpenAI should do right now is probably make a phone. But Sam Altman doesn't seem to think so.

In the first quarter of this year, OpenAI's revenue and user growth both fell short of expectations. Its rival Anthropic, with Claude Code, snatched away the group most willing to pay. Following this script, OpenAI should now be consolidating, focusing, and first proving it can make money, gearing up for an IPO by the end of this year or early next.

But the supply chain tells a different story. It's going to challenge the world's most mature, most closed, and most profitable consumer electronics category: the iPhone.

According to leaks, OpenAI is accelerating development of its first AI Agent phone, with mass production as early as the first half of 2027 and a target of shipping 30 million units over the next two years.

Is it crazy?

Probably not. OpenAI has likely already seen a more dangerous problem clearly: ChatGPT is smart, but it doesn't have hands.

It can answer you, but it has a hard time completing tasks for you. It lives in other people's systems—Apple's, Microsoft's, operating systems, browsers—so it can't get the real permissions.

What we want to discuss next isn't why OpenAI wants to build a phone. It's how this company came to realize, step by step, that without its own terminal device, ChatGPT can never truly break free.

ChatGPT's Success is Also a Form of Path Dependency

In April 2026, SpaceX secured an option: it could acquire Cursor for up to $60 billion later in the year.

OpenAI's initial belief wasn't in phones, browsers, or any particular App. It believed in the model—or more precisely, in intelligence itself.

In its worldview, as long as the model is powerful enough, the entry point, product, and business model will all be pushed forward by intelligence.

This wasn't just talk. In 2020, OpenAI published the later frequently cited Scaling Laws paper, establishing a relatively optimistic belief: by scaling up the model, data, and compute together, intelligence would improve in a predictable way.

In other words, the most important thing wasn't to grab an entry point first, but to make the model even stronger. With strong enough intelligence, the world would naturally make way.

This belief was realized on November 30, 2022.

That day ChatGPT launched. It had no flashy interface, no hardware, no platform pre-installation, just an input box on a webpage. But it gave ordinary people an experience they'd never had before: you type a sentence, and it replies like a human.

The awe wasn't just that AI could talk, but that it had barely relied on any traditional entry point. No phone manufacturers pushed it, no operating system placed it in a prominent spot; users found it themselves.

Two months, 100 million monthly active users—the fastest-growing consumer application in human history.

OpenAI seemed to be right. Microsoft immediately doubled down, embedding its capabilities into Copilot, Office, and Bing; Apple also integrated ChatGPT into Apple Intelligence at WWDC 2024.

At that moment, OpenAI stood at the center of the era. The strongest model, the most users, the deepest partnerships.

But that's precisely where the problems started.

ChatGPT's success was too dazzling. So dazzling that it easily made OpenAI believe: the model itself is the entry point. It didn't need to own a phone first, nor control an operating system first—if the intelligence was impressive enough, users would come find it.

The real cracks that emerged later also started here.

Claude Code Rewrote the Rules of Monetization

The first crack came from Anthropic.

In May 2025, it released Claude Code. No flashy demo, no blockbuster launch event. This product simply entered developers' terminals, codebases, and Git workflows, helping engineers get the job done.

Six months after launch, Claude Code reached a $1 billion annual run rate; in less than a year, over $2.5 billion. By April 2026, Anthropic's overall annualized revenue crossed $30 billion.

Meanwhile, OpenAI was at $2 billion monthly revenue, or roughly $24 billion annualized.

Anthropic achieved higher revenue with far fewer users than ChatGPT. This is what OpenAI should truly be worried about.

The reason is simple—it captured a group of people most willing to pay.

The question is, why was OpenAI a step slower?

Not because it couldn't see Agents. It was because ChatGPT's success was too dazzling, so dazzling that OpenAI continued moving forward with its original inertia: make a stronger model, expand the user base, find the next universal entry point.

So over these past two years, you've seen OpenAI attempt many 0-to-1 projects—GPT Store, Sora, Operator, Deep Research—all born from this line of thinking. They collectively point to one judgment: as long as the model is strong enough, new products, new entry points, new business models will naturally emerge.

But Anthropic chose a different path. It didn't first create a super entry point for everyone. Instead, it nailed Claude Code into developers' workflows, repeatedly polishing one thing—letting AI finish the job.

This is where OpenAI was slow. It wasn't that it didn't make new products, but that it didn't immediately take a high-paying-use-case scenario from 1 to 100.

Sora is a classic example. It stunned the world upon release, but video generation consumed vast amounts of compute, and user retention and the business model weren't clear enough. When OpenAI later shut down Sora, it was, in a sense, a pruning—it began to realize that creating a stunning AI demo and penetrating a high-value workflow are two different things.

Model capabilities can create highlights, but commercial efficiency comes from consistently delivering results.

At this point, OpenAI finally realized: Agent isn't an add-on feature; it's the core of the next phase of AI commercialization. ChatGPT can't just prove it's smart; it must prove it can complete tasks for users.

But when it truly starts taking over tasks, what it hits isn't the ceiling of model capability, but the ceiling of permissions.

900 Million Users, How to Turn Them into Money

OpenAI is, of course, catching up. In May 2025, it launched Codex, directly responding to Claude Code. By April 2026, Codex reached 3 million weekly active users.

But in the coding battle, OpenAI will find it hard to win back in the short term—Anthropic has already captured the mindshare for coding Agents, leaving the follower to play catch-up.

This is also why OpenAI began reallocating resources: shifting attention from projects that easily create hype but struggle to achieve commercial closure, towards Agents, the enterprise market, and more fundamental research.

But what it really needs to look at is its bigger card—900 million weekly active users.

These people aren't programmers; they won't pay for code. But they all have needs: writing emails, making proposals, researching, booking travel, shopping, organizing files.

If ChatGPT can evolve from a "conversational" entry point to a "task-completing" entry point, that would be OpenAI's true commercial capability.

Imagine this scenario: you want to buy a plane ticket, tell ChatGPT the time, budget, preferences. It searches flights, compares prices, checks hotels for you, and finally gives you a confirmation button.

At that moment, part of Ctrip's value is bypassed. Price comparison, ad slots, commissions, user decision influence—all would be redistributed. Buying insurance, paying credit card bills, settling utility bills follow the same logic. As long as the Agent can complete tasks for you, OpenAI has a chance to take a cut of every transaction commission, every advertising influence within.

This is where the true value of 900 million users lies—ChatGPT no longer just answers questions, but starts taking over task and transaction entry points.

But once AI starts handling tasks, it's no longer just a model in a chatbox. It needs to know where you are, see what's happening on your screen, and access your files, calendar, emails, and payments.

The problem thus shifts from "Is the model strong enough?" to "Who has the permissions?"

And permissions are precisely what OpenAI lacks.

ChatGPT Lives in Someone Else's House

OpenAI initially thought partnerships could solve the entry point problem. Apple gave it the iPhone, Microsoft gave it Office, Windows, and enterprise customers. At the time, this looked like a victory for OpenAI's faith in models.

But with the advent of the Agent era, the problem changed.

In Apple's world, ChatGPT is an external expert that gets called upon. It can answer questions, but it cannot truly take over the screen, camera, notifications, payments, and files—Apple won't hand over these permissions. Otherwise, the iPhone's "soul" would no longer belong to Apple.

It's similar with Microsoft. In the past, OpenAI provided the model, and Microsoft was responsible for integrating AI into entry points like Office. But when OpenAI itself started making Codex and enterprise Agents, it stepped onto Microsoft's turf—Agents inherently enter workflows, write code, process files, and complete tasks for employees, which is exactly the core of Microsoft's sovereignty.

So, the relationship between OpenAI and Microsoft hasn't broken immediately, but the boundaries have shifted. In April 2026, the two parties renegotiated their agreement, with Microsoft's exclusive licensing becoming non-exclusive, allowing OpenAI to serve customers on any cloud.

The meaning of this is clear: OpenAI doesn't want to be just a supplier within the Microsoft ecosystem. It wants to face customers itself, deliver Agents itself, and capture entry points itself.

At this point, its relationships with Apple and Microsoft become delicate. Because what an Agent needs isn't a showcase spot, but the default entry point, system permissions, and the intelligent terminal the user interacts with first every day.

These things, Apple won't give, and Microsoft won't either. They can't.

Ultimately, ChatGPT is strong, but it always lives in someone else's house—Apple's house, Microsoft's house, the browser's house, the operating system's house. It can be called upon, integrated, and be a great supplier, but it can't decide when it appears, nor what permissions it gets.

And the phone is the one closest to its resource endowment. 900 million weekly active users are already willing to hand questions to ChatGPT—migrating this mindset to a device is a shorter path than building an operating system or a browser from scratch.

It's not trying to make another iPhone filled with Apps, but a phone dedicated to Agents—a body that allows ChatGPT to see, call upon, and execute tasks.

This is also why in May 2025, OpenAI spent approximately $6.5 billion to acquire Jony Ive's hardware company. This person is the industrial designer of the original iPhone, one of the most important figures beside Steve Jobs. OpenAI sought him not just to make a beautiful piece of hardware, but to redefine personal devices for the AI era.

Returning to the opening question: why would a large language model company make a phone?

What OpenAI wants isn't a phone; it's sovereignty.

It wants to find a default entry point that belongs to ChatGPT itself. But making a phone essentially pushes OpenAI into direct opposition with Apple. In the past, Apple could treat ChatGPT as a supplier; if OpenAI truly makes a phone for the AI era, it's no longer a supplier but a competitor to Apple for personal entry points.

Looking back over these past few years, OpenAI's story has actually undergone a reversal.

It once believed that if the model was strong enough, the world would actively reorganize itself around intelligence. ChatGPT's explosion did prove this—it had no hardware, no pre-installation, just a webpage input box, and pulled hundreds of millions of users into the AI era.

But when the Agent era arrived, OpenAI found it still lacked the most crucial thing: sovereignty.

ChatGPT's success was a victory, but also a form of path dependency. It made OpenAI believe for too long that the model itself was the answer. It wasn't until Claude Code reached a $2.5 billion annual run rate, and until Apple and Microsoft were unwilling to hand over system permissions, that OpenAI realized: no matter how strong the model is, it still needs to capture the entry point, permissions, and tasks.

So, when OpenAI makes a phone, what it truly wants to make isn't a phone; it's ChatGPT's first body.

This article is from the WeChat public account "Pixel 301", author: Pixel 301

Perguntas relacionadas

QWhat is the primary reason OpenAI is reportedly developing an AI phone, according to the article?

ATo gain sovereign control over a default entry point and system permissions for ChatGPT, allowing it to function as a true AI Agent that can execute tasks, rather than relying on other platforms like Apple or Microsoft which limit its access.

QWhat key competitor move exposed a flaw in OpenAI's growth strategy, and how did it outperform ChatGPT financially?

AAnthropic's release of Claude Code targeted the high-paying developer workflow, generating $25 billion in annualized revenue within a year by focusing on completing tasks, while OpenAI, with more users, had lower annualized revenue of about $240 billion, highlighting that user scale doesn't automatically translate to commercial efficiency in specific high-value workflows.

QWhat strategic shift did OpenAI's acquisition of Jony Ive's hardware firm represent?

AIt represented a move beyond being just a model provider to aiming to redefine personal AI devices. OpenAI sought Ive's expertise to build not just a smartphone, but a dedicated 'body' or terminal for its AI Agent, ChatGPT, to have its own sovereign interface.

QHow did the success of ChatGPT create a 'path dependency' or strategic blind spot for OpenAI?

AChatGPT's viral, hardware-independent success led OpenAI to believe that superior model intelligence alone was sufficient to attract users and define new entry points. This made them slower to prioritize developing deep, commercially closed-loop Agent capabilities for specific high-value tasks, as demonstrated by Anthropic.

QWhy can't partnerships with Apple or Microsoft solve OpenAI's need for system-level permissions in the Agent era?

ABecause neither Apple nor Microsoft will relinquish core system permissions (like screen control, notifications, payments, or deep file access) to an external AI. These permissions define platform sovereignty. For OpenAI's ChatGPT to act as a true task-executing Agent, it needs these permissions, which requires its own controlled hardware entry point.

Leituras Relacionadas

Anthropic's IPO Launch: Commercial Miracle or Valuation Bubble?

Anthropic has confidentially filed for an IPO, led by Morgan Stanley and Goldman Sachs, potentially going public by October. Following its latest $650 billion funding round, its pre-IPO valuation stands at $965 billion, with projections reaching up to $2 trillion at listing, which would make it the highest-valued private company ever. The article, written by Fu Sheng, addresses skepticism that this represents an AI bubble akin to the 2000 dot-com crash. It argues the current situation differs fundamentally. Unlike the internet bubble era, which relied on speculative narratives with little revenue, Anthropic's valuation is backed by unprecedented, measurable financial performance. Key data points include: * **Revenue Growth:** ARR skyrocketed from $10 billion in early 2025 to $470 billion by May 2026, targeting $100 billion by year-end—a growth curve unmatched in business history. * **Profitability:** It achieved operating profitability in Q2 2026 with an estimated $5.6 billion profit. * **Efficiency:** With ~3,000 employees and ~$470 billion ARR, its revenue per employee exceeds $10 million. Products like Claude Code, launched less than a year ago, already generate $25 billion in annualized revenue. * **Enterprise Adoption:** It boasts a strong enterprise client base, with 8 of the Fortune 10 and over 1,000 large firms spending over $1 million annually on Claude. The valuation is framed using a traditional SaaS model (e.g., a 10x Price-to-Sales multiple on $100 billion revenue). The author contends the core question for analysts has shifted from "How big could this be?" to "How much is it earning and will earn next quarter?" The discussion extends beyond Anthropic to a broader paradigm shift: the transition from a "carbon-based" to a "silicon-based" economy. Companies are increasingly prioritizing investment in compute and AI capabilities over human resources, as these directly scale productivity and competitive advantage. Anthropic's IPO is thus positioned not just as a corporate milestone, but as a price anchor for this new economic era.

链捕手Há 28m

Anthropic's IPO Launch: Commercial Miracle or Valuation Bubble?

链捕手Há 28m

Near Returns to the AI Stage: Transformation into a Public Chain Due to 'Payroll Difficulties,' Agent and Privacy Emerge as New Growth Narratives

NEAR Returns to AI Origins: From Payroll Struggles to Blockchain, Now Focusing on AI Agents and Privacy NEAR Protocol's journey began not with grand blockchain ambitions, but from a practical hurdle: its AI startup founders, including Transformer paper co-author Illia Polosukhin, couldn't efficiently pay international developers in 2017. This led them to pivot and build a high-performance, scalable blockchain. After years navigating various crypto narratives like sharding and cross-chain interoperability, NEAR is now leveraging its AI roots to re-enter the AI arena. A key driver is its "NEAR Intents" layer, which abstracts complex cross-chain transactions. Users simply state their goal (e.g., swap BTC for ETH), and a solver network finds the optimal route. This system has processed over $20B in cross-chain volume, generating significant fee revenue. A major growth area is private transactions via "Confidential Intents/Swaps," which hide trade details until settlement to protect against MEV and front-running. Remarkably, private swaps recently accounted for over 40% of NEAR's transaction volume, highlighting strong demand but also potential regulatory scrutiny. With its AI-founder pedigree, NEAR is positioning itself at the intersection of blockchain, AI agents, and privacy, aiming to become infrastructure for the emerging agent economy while navigating the challenges of its rapid adoption.

marsbitHá 3h

Near Returns to the AI Stage: Transformation into a Public Chain Due to 'Payroll Difficulties,' Agent and Privacy Emerge as New Growth Narratives

marsbitHá 3h

From Ethereum to AI's 'CROPS': What Exactly is This Set of 'Slow Variables' That Vitalik Repeatedly Emphasizes?

In recent discussions, Vitalik Buterin has frequently emphasized the concept of "CROPS," a framework defining core values for Ethereum's development. CROPS stands for Censorship Resistance, Capture Resistance, Open Source, Privacy, and Security. Initially outlined in the Ethereum Foundation's "EF Mandate," it represents a commitment to user sovereignty, ensuring that the network resists external control, remains open, protects privacy, and prioritizes security. The relevance of CROPS extends beyond Ethereum's foundational principles, becoming crucial in the context of AI integration. As AI agents begin handling wallet operations and automated transactions, the risk increases that users may cede control over their digital assets, privacy, and intentions to centralized AI service providers. A "CROPS AI" would therefore emphasize local execution where possible, privacy-preserving remote model calls (e.g., using zero-knowledge proofs), and transparent, verifiable processes to maintain user agency. Vitalik highlights a significant convergence between "CROPS Ethereum access layer" and "CROPS AI." Both address the same fundamental challenge: how users can access powerful services—be it blockchain data via RPCs or AI models—without exposing sensitive information or relinquishing ultimate control. This intersection points toward a future digital entry point that is more private, secure, and user-controlled. Ultimately, CROPS is not merely an abstract ideal but a practical guidepost. It steers development—from protocol resilience and wallet design to AI agent safety—towards a future where users retain self-sovereignty even as digital systems grow more complex and powerful. In an era of accelerating AI adoption, these "slow variables" of censorship resistance, openness, privacy, and security may define Ethereum's enduring value.

marsbitHá 3h

From Ethereum to AI's 'CROPS': What Exactly is This Set of 'Slow Variables' That Vitalik Repeatedly Emphasizes?

marsbitHá 3h

Trading

Spot
Futuros

Artigos em Destaque

Como comprar LA

Bem-vindo à HTX.com!Tornámos a compra de Lagrange (LA) simples e conveniente.Segue o nosso guia passo a passo para iniciar a tua jornada no mundo das criptos.Passo 1: cria a tua conta HTXUtiliza o teu e-mail ou número de telefone para te inscreveres numa conta gratuita na HTX.Desfruta de um processo de inscrição sem complicações e desbloqueia todas as funcionalidades.Obter a minha contaPasso 2: vai para Comprar Cripto e escolhe o teu método de pagamentoCartão de crédito/débito: usa o teu visa ou mastercard para comprar Lagrange (LA) instantaneamente.Saldo: usa os fundos da tua conta HTX para transacionar sem problemas.Terceiros: adicionamos métodos de pagamento populares, como Google Pay e Apple Pay, para aumentar a conveniência.P2P: transaciona diretamente com outros utilizadores na HTX.Mercado de balcão (OTC): oferecemos serviços personalizados e taxas de câmbio competitivas para os traders.Passo 3: armazena teu Lagrange (LA)Depois de comprar o teu Lagrange (LA), armazena-o na tua conta HTX.Alternativamente, podes enviá-lo para outro lugar através de transferência blockchain ou usá-lo para transacionar outras criptomoedas.Passo 4: transaciona Lagrange (LA)Transaciona facilmente Lagrange (LA) no mercado à vista da HTX.Acede simplesmente à tua conta, seleciona o teu par de trading, executa as tuas transações e monitoriza em tempo real.Oferecemos uma experiência de fácil utilização tanto para principiantes como para traders experientes.

287 Visualizações TotaisPublicado em {updateTime}Atualizado em 2026.06.02

Como comprar LA

Discussões

Bem-vindo à Comunidade HTX. Aqui, pode manter-se informado sobre os mais recentes desenvolvimentos da plataforma e obter acesso a análises profissionais de mercado. As opiniões dos utilizadores sobre o preço de LA (LA) são apresentadas abaixo.

活动图片