Stop Staring at GPUs, Intel Unleashes a Major Move—Can It End Nvidia's Computing Power Monopoly?

marsbit发布于2026-06-03更新于2026-06-03

Over the past two years, AI hardware has essentially had one core focus: the GPU.

From large model training to inference clusters, and from edge to cloud computing power, the entire industry has been discussing who can secure more GPUs and who can pack more compute cards into data centers. It's fair to say the entire AI industry has been revolving around GPUs, which has also driven Nvidia's stock price to record highs.

However, at COMPUTEX 2026, Intel presented a different perspective: AI's next stage should not focus solely on GPUs. The core of this argument is the keyword repeatedly emphasized by CEO Pat Gelsinger in his keynote speech: Agentic AI, which we commonly refer to as agents.

Image Source: Intel

Agents Are Changing the Computing Ecosystem

The difference between agents and traditional AI is actually quite significant. Traditional AI operates like a turn-based Q&A machine, while agents are meant to integrate into real-world workflows, proactively completing cycles of "thinking, planning, acting, and reflecting." In other words, they must learn to read data, call tools, execute tasks, check results, and continuously adjust their next steps based on feedback.

This means AI inference is no longer a "one-off deal" but becomes a continuously running system of self-decision-making and self-reasoning, fundamentally changing how computing power is utilized. Therefore, Intel's core message this time is: Agentic AI will reshape the compute power ratio within data centers.

Currently, in the cutting-edge model training phase, the CPU-to-GPU ratio can approach 1:8, with GPUs bearing the vast majority of the computational load. However, upon entering an agentic inference mode, CPUs need to handle task orchestration, tool invocation, data migration, and system coordination. In this scenario, the CPU-to-GPU ratio will gradually move towards 1:1, or may even require higher CPU density to rapidly decompose tasks.

In fact, when an agent not only generates an answer but also needs to continuously invoke models, tools, and external systems, its operational state is completely different from that of traditional AI. Intel mentioned a statistic in the presentation: compared to single-turn inference, an agent's Token consumption can increase by up to 1000 times.

Image Source: Intel

In other words, agents don't bring simple growth in inference volume, but rather more complex, higher-frequency, and more fragmented system loads. Throwing all these loads at GPUs for resolution would be inefficient and expensive.

The Xeon 6+ processor launched by Intel is built on the Intel 18A process, featuring up to 288 efficiency cores and equipped with up to 576MB of L3 cache. Targeting cloud-native, Agentic AI, and network-intensive workloads, it promises higher energy efficiency and more stable sustained performance.

In Intel's proposed solution, a single liquid-cooled rack occupying 32U of compute space can provide 36,864 cores; the rack power consumption is only about 100kW, sufficient to support high-density agent deployment. While 100kW may sound daunting, compared to previous server racks with equivalent performance, power consumption has already been significantly reduced.

Beyond Xeon 6+, there's something even more noteworthy: Intel's restructuring of the inference architecture.

In the presentation, Intel announced a partnership with SambaNova, Vista Equity Partners, Cambium Capital, and others to officially launch a new, fully disaggregated inference solution. This solution runs on the Vector Core Compute Agent Cloud, where Intel Xeon 6 processors handle orchestration and execution, SambaNova SN40 RDUs are responsible for decoding, and finally, NVIDIA Blackwell GPUs handle pre-filling.

Image Source: Intel

This new architecture is specifically designed for agentic workloads. Unlike many past AI systems that tended to offload most of the work in the inference pipeline to GPUs, in this system, CPUs, RDUs, and GPUs will each have their specific roles—handling system scheduling, decoding, pre-filling, and other different stages respectively—ensuring each inference phase runs on the most suitable hardware to maximize efficiency.

Following the introduction of Xeon 6+, the recently launched 3rd Gen Core Ultra processors also made another appearance. They represent another link in Intel's AI ecosystem—the core of edge-side AI. In the presentation, the hybrid local server demonstrated by Intel and Perplexity was precisely built on the 3rd Gen Core Ultra and Xeon 6+ cloud servers.

Image Source: Intel

It can dynamically allocate workloads between the local device and the cloud based on device capability and functional requirements, further reducing reliance on cloud computing power. This is also the ideal form for future AI PCs: by dynamically distributing performance, it lowers Token costs while ensuring task immediacy and data privacy.

Beyond PCs, Intel is extending the 3rd Gen Core Ultra to gaming handhelds and edge computing. The newly announced Arc G3 series of processors are optimized for handheld gaming devices based on the same architecture and will be available later this month (the integrated graphics card most anticipated by handheld gamers is coming).

From General-Purpose to Custom, Intel Aims to Be 'Everywhere'

Beyond general-purpose processors, Intel also emphasized custom chips this time, a business segment CEO Pat Gelsinger has been championing since taking the helm.

Intel believes the custom chip market will be vast in the future because as AI penetrates various industries, customers will become increasingly dissatisfied with general-purpose compute power. In pursuit of higher efficiency and performance, they will gradually lean towards custom chips to maintain their competitiveness.

In the presentation, Intel mentioned collaborations, such as with Google to launch IPUs—chips crucial for cloud service providers to enhance infrastructure performance. Intel is also partnering with telecom clients like Ericsson to provide advanced wireless infrastructure chips globally.

This actually reveals another theme of Gelsinger's speech: Intel is no longer relying on a single, general-purpose chip to win the market. Instead, it's packaging chips, systems, software, and industry partnerships into a complete set of solutions that can be freely customized according to the needs of different enterprises, thereby maximizing Intel's advantages.

Image Source: Intel

From the perspective of Lei Technology, Intel is essentially redefining its position in the ecosystem: data centers need CPUs for agent orchestration; inference systems require heterogeneous disaggregation to reduce costs; PCs need local AI to handle privacy and compliance; edge and embodied intelligence require high-efficiency chips; and industry clients need customized chips.

By meeting the needs of enterprises across different fields and various points in the value chain, Intel aims to become even more "everywhere" than Nvidia.

Of course, the pressure on Intel remains immense. Nvidia's advantages in AI accelerators and software ecosystems are still evident, and AMD continues its offensive in server CPUs and AI chips. For Intel to successfully navigate this path, it ultimately depends on the mass production speed of the 18A process, whether the Xeon 6+ rack-level solutions can be deployed quickly, and whether customers can truly see significant benefits from this new architecture.

But at least this time, Intel's direction is clearer than before.

It can be said that as AI enters the era of agents, competition is no longer just about comparing the peak performance of a single chip. Instead, it involves the collaborative efficiency optimization of the entire computing system. GPUs remain important, but CPUs, edge devices, local AI, and custom chips are also regaining critical importance.

And what Intel aims to seize is precisely this window of opportunity where AI infrastructure is undergoing a re-division of labor.

你可能也喜欢

AGI不是终点，DeepMind新论文：迈向ASI，真正的AI进步才刚开始

DeepMind团队最新研究报告提出，通用人工智能（AGI）很可能不是AI发展的终点。AI将继续超越人类水平，走向超级人工智能（ASI）。报告区分了AGI（达到人类中位认知水平）、ASI（在所有重要领域整体超越人类专家集体）和UAI（理论智能上界）三个概念，并探讨了从AGI迈向ASI的四条潜在路径：持续扩展计算、模型与数据规模；算法持续演化乃至范式转变；系统通过递归自我改进形成正反馈；以及通过多智能体协作形成超越单体的集体智能。报告同时指出了发展过程中可能面临的六大关键瓶颈：高质量人类数据可能面临枯竭的“数据墙”；能源、芯片等经济和自然资源压力；现有神经网络范式在持续学习、稳定推理等方面的根本性局限；研究难度随领域成熟而增加；AI可能难以自主提炼新概念原语的“抽象壁垒”；以及监管、治理与社会反弹带来的影响。报告最后强调，一旦AI能力超越人类，现有以人类水平为参照的评估体系将失效，需要建立面向后AGI时代的新评估机制。ASI的发展仍受物理规律、资源等现实约束，其具体路径与速度存在高度不确定性，未来需持续开展跨学科研究以应对这一前景。

marsbit49分钟前

marsbit49分钟前

Kraken推出OpenAI和Anthropic的Pre-IPO永续合约，提供高达5倍杠杆

加密货币交易所Kraken推出了针对OpenAI和Anthropic的“IPO前永续合约”（pre-IPO perps），允许符合条件的交易者在这两家备受关注的私人人工智能公司公开上市前，进行高达5倍杠杆的多空交易。此举标志着加密货币衍生品平台正超越数字资产范畴，尝试对接私人公司股权等链下资产，为散户投资者提供原本难以触及的热门投资主题（如AI）的敞口。然而，这类产品与普通的加密货币永续合约存在显著差异。其定价基于不透明、非连续的私人市场估值（受融资轮次、二级交易、IPO预期等因素影响），而非流动的现货市场价格，因此风险管理更为复杂。高杠杆进一步放大了与估值波动、流动性及上市时间表相关的独特风险。总之，Kraken的推出显示了加密交易基础设施向更广泛投机标的的扩张，为投资者提供了新的表达观点的工具，但产品本身风险较高，投资者需充分了解其复杂性和风险后再谨慎参与。

bitcoinist55分钟前

Kraken推出OpenAI和Anthropic的Pre-IPO永续合约，提供高达5倍杠杆

bitcoinist55分钟前

给 OpenAI 盘前定价：Hyperliquid 上一门半年生死的新生意

本文探讨了在Hyperliquid平台上兴起的未上市公司（如SpaceX、OpenAI）“盘前定价”合约交易现象。匿名团队Trade.xyz通过聚焦即将上市的SpaceX（有明确的纳斯达克最终定价作为锚点），成功打造了该平台最大的盘前市场。相反，由知名风投Paradigm支持的Ventuals，虽手握OpenAI和Anthropic等稀缺标的，却因其定价机制（一半依赖非公开的老股交易价，一半依赖自身合约价格的均线）导致价格失真、缺乏流动性，最终在运营九个月后关停。其关停时给出的OpenAI（约1341美元）和Anthropic（约1619美元）的“最终报价”，虽被部分内部人士参考，但其公允性存疑。文章指出，为未上市公司提供实时交易价格的需求真实存在，并正吸引Coinbase等大型机构入场，但该生意的核心挑战——如何在没有公开市场的情况下形成公允、可持续的定价机制——仍未解决。Ventuals的案例揭示了，缺乏坚实价值锚点的定价游戏难以长久。

marsbit59分钟前

marsbit59分钟前

“七巨头”不够用了？SpaceX 上市散户狂买，华尔街端出“AI 科技十巨头”

SpaceX于上周五上市首日吸引散户净买入1.17亿美元，占当天全美散户股票买盘的56%，其IPO募资约750亿美元，估值约1.75万亿美元。研究机构Vanda据此提出“FAB 10”（前沿AI与科技十巨头）新概念，主张在原有的“七巨头”基础上，加入SpaceX、OpenAI和Anthropic，认为这些公司代表未来十年科技与AI方向。OpenAI与Anthropic尚未上市，预计估值或达万亿美元级别。与此同时，美国银行曾提出偏向半导体硬件的“AI Big 10”组合。Vanda指出，SpaceX的火爆可能从其他热门板块（如芯片股）抽走资金。尽管科技板块估值已显泡沫迹象，但市场对新巨头的追捧仍在改写投资框架。

marsbit1小时前

marsbit1小时前

对话大摩数字战略主管：比特币涨到一百万美元不是不可能，但我希望它慢一点

摩根士丹利数字资产战略主管Amy Oldenburg分享了她对比特币的看法与机构参与路径。她认为，比特币的早期价值逻辑根植于新兴市场对去中心化金融的需求，例如非洲的移动货币解决了传统银行缺失下的安全与普惠问题。尽管摩根士丹利已推出现货比特币ETF（MSBT）并创下发行纪录，但机构大规模采用仍面临挑战。主要原因包括：银行控股公司面临的严格资本监管、财务顾问因比特币价格横盘而持谨慎态度，以及客户教育不足。她指出，当前市场注意力被黄金、AI等热门资产分散，比特币需要时间或一场真正的体系危机来证明其“数字黄金”的价值储备地位。对于未来，Oldenburg预测比特币将缓慢增长而非突然飙升，并认为价格达到100万美元并非不可能，但她希望上涨过程更平稳以减少波动性。她强调，理解比特币与其他加密资产的差异、以及自我保管与通过ETF持有敞口的区别至关重要。最后，她认为行业仍处早期，密码朋克精神与机构服务可以共存，共同推动数字资产领域的长期发展。

marsbit1小时前