ByteDance Adopts Arm CPUs, Jensen Huang: So Sad I Didn't Buy Arm

marsbit2026-06-04 tarihinde yayınlandı2026-06-04 tarihinde güncellendi

Özet

**Summary:** At Computex 2026, Arm CEO Rene Haas announced that ByteDance and Oracle have adopted Arm's self-designed Arm AGI data center CPU. The company expects significant revenue growth from this product, projecting $20 billion in demand for the 2027/2028 fiscal years. Haas noted that restricting AI-capable CPUs from the US to China is nearly impossible due to their widespread applications. Arm's stock has surged dramatically this year, notably rising 16% after NVIDIA's Arm-based Vera CPU and RTX Spark announcements. A highlight was the informal, humorous on-stage conversation between Haas and NVIDIA CEO Jensen Huang. Huang joked about NVIDIA's failed attempt to acquire Arm and playfully lamented selling his Arm shares. Both executives showed a clear sense of camaraderie and shared regret over the missed merger. Key technical topics were discussed: 1. **AI PC Design:** Huang explained NVIDIA's RTX Spark superchip (with a 20-core Arm CPU) is designed for future AI agents that will autonomously run and use tools on PCs, blending local and cloud processing. 2. **Agent vs. OS:** Huang emphasized the operating system remains crucial, as AI agents rely on its APIs and tools to function. 3. **Growth Constraints:** He identified the shift to "useful AI" that generates profitable tokens as a primary driver for immense, almost limitless, computational demand. Haas outlined Arm's strategy across PC and data centers. For PCs, Arm collaborates with partners like NVIDIA and Medi...

Chip World, June 3rd report, Arm CEO Rene Haas delivered a keynote speech during Computex 2026 yesterday, announcing that ByteDance and Oracle have adopted Arm's self-developed data center CPU chip, Arm AGI.

Last month, Arm doubled its demand forecast for the Arm AGI CPU, expecting it to reach $20 billion (approximately RMB 135 billion) in fiscal years 2027 and 2028, while also projecting the product will generate $150 billion (approximately RMB 1,016 billion) in annual revenue in about five years.

In an interview with foreign media yesterday, Rene Haas stated that it's 'almost impossible' for the U.S. to block AI CPU exports to China because AI CPUs are widely used, making it difficult to determine which CPUs are specifically for AI, unlike AI chips where specific performance thresholds and memory bandwidth limits can be set.

On Monday, NVIDIA released the Arm-based RTX Spark superchip and Vera data center CPU. Arm's stock price surged that night, closing Tuesday with a 16% gain. So far this year, Arm's stock price has accumulated a 263% increase.

NVIDIA founder and CEO Jensen Huang also popped into Rene Haas's Tuesday speech. Taking the stage, he joked, 'Look at his stock price. Every time I release a product, his stock goes up, while mine does nothing.'

Rene Haas cleverly retorted, 'You were a shareholder, and then you sold your shares.'

Jensen Huang immediately took the bait, 'Yes, yes, oh man, I needed cash.'

The two, evidently old friends, chatted warmly for 15 minutes, frequently improvising comedic sketches, throwing and catching jokes that had the audience roaring with laughter, both often laughing wide enough to show all their teeth.

This was truly the liveliest atmosphere I've seen in a recent tech industry conversation.

For example, after lavishing praise on Arm CPUs, Jensen Huang summarized, 'The keywords are 'Arm is perfect.'

Rene Haas responded, 'Another keyword is 'thank you.''

Jensen Huang immediately switched to Mandarin, 'Nǎlǐ, nǎlǐ, bùyào kèqi la.' (Meaning 'Not at all, not at all, don't be polite.')

Then Rene Haas quipped, 'Now this competition is unfair.' (Implying it's unfair for Jensen to speak Mandarin)

Jensen Huang then considerately added in English, 'You're welcome.'

Jensen Huang also joked that 'One of Arm's greatest advantages is not having to worry about supply chain issues.' The supply chain for IP is electrons, and you can have as many electrons as you want.

'So I love their business model,' Jensen Huang began reminiscing. 'You know, I tried. I tried to become Arm. Rene and I worked together before, and then we tried to partner again, but that's no big deal. I'm still very sad.'

Rene Haas said, 'If the two companies merged, we would become the world's largest company.'

'I like that,' Jensen Huang laughed. 'That's a great idea.'

It seems both are full of regret about NVIDIA's failed acquisition of Arm.

Finally, during the gift-giving segment, Rene Haas played a 'nostalgia card,' giving Jensen Huang a Microsoft Surface RT laptop equipped with an NVIDIA Tegra 3 chip, even mimicking Jensen Huang's signature on it.

The NVIDIA Tegra 3 was the world's first Arm mobile quad-core processor launched by NVIDIA years ago.

Jensen Huang pointed at the photo on the big screen and boasted, 'What happened when we were young? I have to say, I think I look younger. Do you agree? I think I've aged rather well.'

Rene Haas laughed until his image blurred.

Then Jensen Huang snatched the gift, his tone rising, 'This is for me? If I sign it and give it back, it becomes a treasure.'

Rene Haas said, 'No, you sign it and give it back to me, there's a contract, an invoice here, we can't do that. We know that game.'

Returning to serious industry topics, during this speech, Rene Haas asked Jensen Huang several key questions:

1. Why make the RTX Spark?

2. How to balance local agents and cloud agents?

3. Can agents truly work independently, detached from the underlying OS?

4. What does Jensen Huang see as growth constraints in the coming years?

Jensen Huang also painted a grand picture for market development: currently, the computer industry is limited by the number of people using computers. With agents that can autonomously use computers, we will no longer have a billion people using computers, but tens of billions, possibly even more agents, robots, and autonomous vehicles using computers than humans.

So the question is, how big can the computer product scale become?

'I feel that, by now, the outcome is decided. This multi-trillion-dollar industry might be ten times larger; we are on our way.' Jensen Huang said.

Rene Haas also shared the latest progress and future plans for Arm in the Agent PC and data center CPU fields.

He mentioned in passing that he chatted with TSMC Chairman and CEO C.C. Wei and Senior Vice President & COO C.J. Chou this week, and they said they have never seen the semiconductor industry cycle so prosperous for four consecutive years.

01. Jensen Huang's Mini-Lecture: How to Design Agent PCs?

Jensen Huang answered Rene Haas's key questions one by one. These insights are quite instructive for the upcoming development of AI PCs and chip design thinking.

1. Why make the RTX Spark product?

PCs and operating systems have existed for 40 years. Manual programming will be replaced by agent applications that will use tools on the PC. So, how should we restructure the architecture, change the OS, and reinvent the computer in the future?

NVIDIA realized that agent systems need excellent CPUs, which is why Arm was adopted.

The RTX Spark superchip comes with a 20-core CPU, excellent single-thread performance, and memory needs to store many parameters.

So, NVIDIA created a new data format called NVFP4 to compress the massive language, build as many models as possible, and integrate very intelligent AI into the system memory.

NVIDIA also wanted to combine CUDA and CUDA Tile for accelerated computing, integrating tensor core processing into a single processor.

2. How to balance locally running agents and cloud-running agents?

These Arm PCs will become autonomous agents that run continuously.

Today, if you leave your laptop at home or in a hotel, you can't use it.

But in the future, you can just pick up your phone, remotely talk to your PC, and command the agent to work.

Jensen Huang said, 'The essence of personal computing devices is that you can do anything with that device without spending time.'

If you need to use some cloud APIs, just call the cloud APIs. Whatever can be done locally, do it on the computer.

3. Is the operating system important for running agents? If we view the agent as the OS, can it truly work independently and not heavily rely on the underlying OS?

The importance of the operating system is not diminished at all; it might even be more important than before.

This is also a controversial point often mentioned when AI appears—'software is dead.' Jensen Huang thinks nothing could be further from the truth.

People might only know 10-20% of many tool functions.

But now, you can tell the agent what you want.

The agent knows very well how to use these tools because it has read the Skills file. Skills essentially involve reading the user manual for that tool, so it now uses the MCP or CLI connected to this tool, unlocking all these tools to meet your needs.

These tools will be more valuable than ever. They run on an OS, so we need Windows, we need these APIs and tools for a long time to come.

4. What are the constraints on growth in the coming years?

'We see constraints in almost every direction.' Jensen Huang said NVIDIA planned ahead, did supply chain planning well, grew nearly 100% year-over-year this year, will achieve very rapid growth next year, and the supply chain can support NVIDIA's growth.

But demand is actually higher.

Jensen Huang talked about how the new computing application paradigm truly requires a new architecture. Now, a major breakthrough is that agents can produce practical AI, which is why everyone's growth is so incredible.

When AI becomes practical, the tokens generated can bring profit. When token generation is profitable, everyone wants to generate trillions more tokens.

Now, AI is not just a chatbot that can answer questions; it can think, use tools, read, continue thinking, plan, try—the number of tokens that need to be generated has increased massively. The profitability of tokens is driving compute demand, creating a compounding effect.

02. Arm PC Chips: Apple, Google, Qualcomm All Say Good, Close Collaboration with NVIDIA and MediaTek

In the PC field, Google, Apple, NVIDIA, Qualcomm, etc., have developed PC chips based on the Arm architecture. Arm has collaborated with Apple, Google, Microsoft, and others for decades.

Rene Haas discussed that Arm is honored to collaborate with NVIDIA on developing the Arm-based RTX Spark superchip. The custom Grace CPU in this chip has 20 cores, each based on the Arm architecture.

'I believe this is the highest number of CPU cores in any laptop on the market today.' Rene Haas said that when you pair it with the Blackwell GPU, this chip achieves 1 PFLOPS of FP4 AI performance, along with 128GB of unified memory capacity, full native support for the Arm platform's Windows OS.

Arm's role here is to closely collaborate with NVIDIA and MediaTek using Arm's Compute Subsystem (CSS) strategy.

The Compute Subsystem combines all the components needed to build a custom SoC (CPU, GPU, system IP, memory controller) to form a complete end-solution system.

Arm collaborated with MediaTek to accomplish this, and MediaTek can provide the complete solution.

Rene Haas also displayed Arm's CSS roadmap for Agent PCs. The next generation will optimize custom CPU cores specifically designed for PCs.

03. Arm's Self-Developed Agent CPU: OpenAI, ByteDance Are Partners

Rene Haas said over 25 billion Arm chips have been made in Taiwan. The Arm AGI, Arm's first self-developed CPU launched in March this year, is produced by Taiwan's TSMC.

The Arm AGI CPU is specifically built for AI agent infrastructure. It uses TSMC's 3nm process, a dual-chiplet design, with a single CPU integrating 136 high-performance Arm Neoverse V3 cores, equipped with 2MB L2 cache per core, supporting up to 3.7 GHz frequency, providing 6GB/s memory bandwidth per core, memory latency below 100ns, featuring 96 lanes of PCIe Gen 6 interface, supporting CXL 3.0 protocol, with a TDP of 300W.

Arm AGI CPU partners include OpenAI, Meta, Cerebras, SAP, SK telecom, Rebellions, etc. Rene Haas announced that market demand for this chip is stronger than at launch, with Oracle and ByteDance also joining the family, validating that the Arm AGI CPU can solve real-world problems.

Of course, not all companies want to buy the Arm AGI CPU. For companies interested in developing their own chips, Arm offers a variety of IP and Compute Subsystems (CSS), committed to providing any solution customers would like to see succeed.

In the data center, the Axion CPU connected to Google's latest AI chips TPU 8t and TPU 8i is a chip using Arm Neoverse technology, reducing power consumption by up to 60% compared to x86 CPUs with equivalent performance.

Amazon's self-developed CPU, Graviton, also uses Arm architecture. Amazon CEO Andy Jassy once revealed, 'Two large customers asked if they could buy all our Graviton instances for 2026.'

NVIDIA also just released the new Vera CPU based on Arm this week.

Arm plans to make self-developed CPUs a long-term endeavor and displayed a three-year roadmap.

The second-generation Arm AGI CPU is already in development, featuring more cores, higher energy efficiency, and better performance than the previous generation.

The third-generation Arm AGI CPU is also coming soon.

All of these are based on the Compute Subsystem Arm intends to deliver alongside the chips.

04. Conclusion: After Agent Explosion, Compute Competition Spotlight Turns to CPU

This week's speeches by chip industry leaders like Jensen Huang, Lip-Bu Tan, and Rene Haas reflect some common trends in the CPU industry—agents are changing computing logic, opening a brand new door of market opportunities for CPUs.

In recent years, the focus of compute competition has mainly been on GPUs, indispensable for AI training. However, with the explosion of agent applications this year, agent inference demand is booming, requiring massive state management, tool calling, and process orchestration—these are CPU-intensive tasks.

Intel and AMD continue to consolidate x86 processors' advantages in the data center. Emerging players, including Amazon, Google, NVIDIA, etc., are largely betting on the Arm CPU route. Even Arm made a decision 'against its ancestors,' formally entering the data center CPU market this year.

A very interesting phenomenon is the formation of a new vertical integration trend in the chip industry.

Chip giants with rich product lines like NVIDIA, Intel, and AMD are increasingly emphasizing their full-stack nature. The advantages they tout for their solutions also converge: higher energy efficiency, having everything you need, and saving more money.

Leading companies across industries are also 'crossing boundaries': cloud giants extend downwards to self-develop chips, chip companies extend upwards to make system solutions, and semiconductor IP companies extend upwards to develop chips.

In the context where tokens become the new competitive currency and compute demand grows explosively, how to generate more effective compute per watt will be the main theme of chip competition going forward.

This article is from the WeChat public account 'Chip World' (芯东西), author: ZeR0, editor: Mo Ying

İlgili Sorular

QWhich two major companies have adopted Arm's self-developed data center CPU chip Arm AGI, as announced by Arm CEO Rene Haas?

AByteDance and Oracle.

QWhat humorous exchange did Jensen Huang and Rene Haas have regarding Arm's stock price and NVIDIA's potential acquisition of Arm?

AJensen Huang joked that every time he releases a new product, Arm's stock price rises while his (NVIDIA's) does nothing. Rene Haas responded by noting that Huang was once a shareholder but sold his shares. Huang quipped back, "Yes, yes, well, I needed the cash," indicating mutual regret over NVIDIA's failed attempt to acquire Arm.

QWhat are the four key questions Rene Haas asked Jensen Huang during their conversation?

A1. Why did NVIDIA create RTX Spark? 2. How to balance local agents and cloud agents? 3. Can agents truly work independently, without relying on the underlying operating system? 4. What does Jensen Huang see as the constraints on growth in the coming years?

QWhat is the strategic significance of Arm entering the data center CPU market with its self-developed Arm AGI CPU, according to the article?

AThe article suggests that the explosion of AI agent applications has shifted the focus of computing competition. While GPUs were central for AI training, agent inference requires significant state management, tool invocation, and process orchestration, which are CPU-intensive tasks. Arm's entry into the data center CPU market, alongside other players like NVIDIA, Amazon, and Google betting on Arm architecture, highlights a new vertical integration trend and a key industry focus on maximizing effective computation per watt.

QWhat are some of the technical specifications of the first-generation Arm AGI CPU mentioned in the article?

AThe Arm AGI CPU is manufactured using TSMC's 3nm process, features a dual-chiplet design, integrates 136 Arm Neoverse V3 high-performance cores, has 2MB L2 cache per core, supports a 3.7GHz clock speed, provides 6GB/s memory bandwidth per core, has memory latency below 100ns, uses a 96-lane PCIe Gen 6 interface, supports CXL 3.0 protocol, and has a TDP of 300W.

İlgili Okumalar

Ten-Thousand-Word Analysis: From $10 to $290, MRVL Wins the Entire AI Era by 'Not Making GPUs'

Marvell Technology's stock price surged from under $10 in 2016 to a record $290 in June 2026, fueled not by making GPUs, but by dominating AI infrastructure connectivity. This analysis argues the market misvalues MRVL as merely a smaller Broadcom in custom AI chips, overlooking its true, unique position. Marvell's core strength lies in enabling high-speed data flow for AI clusters through three interconnected businesses. First, it holds a commanding ~70% market share in high-speed optical DSPs (essential for data center light modules), a deep-moat business with accelerating growth. Second, its custom AI chip design business serves hyperscalers like AWS, Microsoft, and Google, with a significant revenue pipeline despite lower margins. Third, stable cash flows come from Ethernet switch chips and enterprise storage controllers. Together, they form a full-stack "AI data movement" platform. CEO Matt Murphy's transformative leadership since 2016, involving strategic divestments, key acquisitions (like Inphi for optical DSPs), and securing long-term agreements with major cloud providers, repositioned the company. A pivotal $2 billion strategic investment from NVIDIA in 2026 underscored Marvell's critical role in the AI ecosystem, particularly through collaborations like NVLink Fusion. While Marvell faces risks—including client concentration (losing the Amazon Trainium3 design), lower-margin business mix, competitive threats, insider selling, and complex supply chains—its fundamentals remain strong. The optical interconnect moat is widening with the acquisition of Celestial AI (photonics fabric), and financial metrics show accelerating revenue growth and operating leverage. With a PEG ratio suggesting undervaluation relative to its growth, the thesis is that the market undervalues Marvell's monopolistic position in AI "plumbing" while overemphasizing its competitive custom chip segment. The story transcends investing, symbolizing how in any complex system—from the internet to AI—the value of "connection" ultimately surpasses that of individual "nodes."

marsbit27 dk önce

Ten-Thousand-Word Analysis: From $10 to $290, MRVL Wins the Entire AI Era by 'Not Making GPUs'

marsbit27 dk önce

AI Relay Stations Spark Heated Debate on Zhihu: Behind Cheap Tokens, What Are Users Really Worried About?

A discussion on Zhihu about "AI relay stations" shifted the niche developer topic of "cheap tokens" into broader user awareness. Users moved beyond simply questioning the legitimacy of these services to focus on practical concerns: Where do cheap tokens truly come from? Is the model being accessed the real one? Can relay stations see prompts, code, and API keys? For occasional users, are the risks worth it? The core debate centered less on price and more on trust. A primary worry is model authenticity—the risk of "model swapping," where users paying for a premium model might be routed to a cheaper one, creating an information asymmetry. Others argued that cost comparisons matter; while cheaper than official pay-as-you-go APIs, relay stations may not be the lowest-cost option versus subscriptions, domestic models, or free tiers, making user needs assessment crucial. Speculation about token sources ranged from legitimate bulk discounts to gray-area methods like account sharing or exploiting regional pricing. This opacity makes risk assessment difficult for users. Data security emerged as a critical concern, especially for enterprise use. When processing sensitive information like code, contracts, or client data, the inability to verify a relay station's data handling, retention, or access policies poses significant compliance and confidentiality risks. The evolving consensus suggests relay stations can be used cautiously for low-sensitivity, disposable tasks (e.g., summarizing public info, simple translation). However, they should not be the default for sensitive, professional, or production workflows involving proprietary data, Agents, or automated systems. Recommendations include avoiding large prepayments, not relying on a single service, using test prompts to monitor quality, anonymizing data where possible, and keeping official channels as backups. Ultimately, the discussion framed tokens not just as a billing unit but as a measure of real cost encompassing price, model integrity, data security, and service stability. The popularity of relay stations highlights user demand for affordable access, but the debate underscores a key trade-off: the savings from cheap tokens may come at the price of trust, transparency, and control over one's data and AI experience.

marsbit57 dk önce

AI Relay Stations Spark Heated Debate on Zhihu: Behind Cheap Tokens, What Are Users Really Worried About?

marsbit57 dk önce

In-Depth Research Report on TradFi: The Convergence Wave of Crypto and Traditional Finance

In 2026, the crypto industry is undergoing a profound infrastructure-level transformation—TradFi assets are migrating on-chain at an unprecedented pace. According to CoinGecko's Q1 2026 report, the total value locked (TVL) of tokenized real-world assets (RWA) has surpassed $31 billion, a nearly 4x increase from $7.8 billion at the beginning of 2025, with the sector’s aggregate market capitalization reaching $19.3 billion. Among these, the market cap of tokenized stocks surged from $2 million to $486 million, with Q1 spot trading volume reaching $15.1 billion—a single quarter already surpassing the entire second half of 2025. RWA perpetual contract Q1 trading volume reached a staggering $524.8 billion, far exceeding the $313 billion for all of 2025. Meanwhile, BlackRock's BUIDL fund has reached $2.3 billion in scale and has filed for two new tokenized funds, signaling that the world's largest asset manager's tokenization strategy is evolving from pilot to product suite expansion. HTX, as a core participant in the crypto exchange sector, officially launched TradFi perpetual futures products including NVDA, AAPL, MSFT, META, and SPY in 2026, enabling crypto users to gain 24/7 trading access to core U.S. equities. Boston Consulting Group predicts that global tokenized asset scale could reach $16 trillion by 2030, while McKinsey offers a conservative estimate of approximately $2 trillion. The on-chain migration of TradFi assets is no longer a "future narrative" but a structural transformation unfolding in real time, as crypto exchanges evolve from single crypto asset trading platforms toward "multi-asset-class trading infrastructure."

HTX Learn1 saat önce

In-Depth Research Report on TradFi: The Convergence Wave of Crypto and Traditional Finance

HTX Learn1 saat önce

İşlemler

Spot
Futures

Popüler Makaleler

CHIP Nasıl Satın Alınır

HTX.com’a hoş geldiniz! USD.AI (CHIP) satın alma işlemlerini basit ve kullanışlı bir hâle getirdik. Adım adım açıkladığımız rehberimizi takip ederek kripto yolculuğunuza başlayın. 1. Adım: HTX Hesabınızı OluşturunHTX'te ücretsiz bir hesap açmak için e-posta adresinizi veya telefon numaranızı kullanın. Sorunsuzca kaydolun ve tüm özelliklerin kilidini açın. Hesabımı Aç2. Adım: Kripto Satın Al Bölümüne Gidin ve Ödeme Yönteminizi SeçinKredi/Banka Kartı: Visa veya Mastercard'ınızı kullanarak anında USD.AI (CHIP) satın alın.Bakiye: Sorunsuz bir şekilde işlem yapmak için HTX hesap bakiyenizdeki fonları kullanın.Üçüncü Taraflar: Kullanımı kolaylaştırmak için Google Pay ve Apple Pay gibi popüler ödeme yöntemlerini ekledik.P2P: HTX'teki diğer kullanıcılarla doğrudan işlem yapın.Borsa Dışı (OTC): Yatırımcılar için kişiye özel hizmetler ve rekabetçi döviz kurları sunuyoruz.3. Adım: USD.AI (CHIP) Varlıklarınızı SaklayınUSD.AI (CHIP) satın aldıktan sonra HTX hesabınızda saklayın. Alternatif olarak, blok zinciri transferi yoluyla başka bir yere gönderebilir veya diğer kripto para birimlerini takas etmek için kullanabilirsiniz.4. Adım: USD.AI (CHIP) Varlıklarınızla İşlem YapınHTX'in spot piyasasında USD.AI (CHIP) ile kolayca işlemler yapın.Hesabınıza erişin, işlem çiftinizi seçin, işlemlerinizi gerçekleştirin ve gerçek zamanlı olarak izleyin. Hem yeni başlayanlar hem de deneyimli yatırımcılar için kullanıcı dostu bir deneyim sunuyoruz.

294 Toplam GörüntülenmeYayınlanma 2026.04.21Güncellenme 2026.06.02

CHIP Nasıl Satın Alınır

Tartışmalar

HTX Topluluğuna hoş geldiniz. Burada, en son platform gelişmeleri hakkında bilgi sahibi olabilir ve profesyonel piyasa görüşlerine erişebilirsiniz. Kullanıcıların CHIP (CHIP) fiyatı hakkındaki görüşleri aşağıda sunulmaktadır.

活动图片