Stop Staring at GPUs, Intel Unleashes a Major Move—Can It End Nvidia's Computing Power Monopoly?

marsbitОпубликовано 2026-06-03Обновлено 2026-06-03

Over the past two years, AI hardware has essentially had one core focus: the GPU.

From large model training to inference clusters, and from edge to cloud computing power, the entire industry has been discussing who can secure more GPUs and who can pack more compute cards into data centers. It's fair to say the entire AI industry has been revolving around GPUs, which has also driven Nvidia's stock price to record highs.

However, at COMPUTEX 2026, Intel presented a different perspective: AI's next stage should not focus solely on GPUs. The core of this argument is the keyword repeatedly emphasized by CEO Pat Gelsinger in his keynote speech: Agentic AI, which we commonly refer to as agents.

Image Source: Intel

Agents Are Changing the Computing Ecosystem

The difference between agents and traditional AI is actually quite significant. Traditional AI operates like a turn-based Q&A machine, while agents are meant to integrate into real-world workflows, proactively completing cycles of "thinking, planning, acting, and reflecting." In other words, they must learn to read data, call tools, execute tasks, check results, and continuously adjust their next steps based on feedback.

This means AI inference is no longer a "one-off deal" but becomes a continuously running system of self-decision-making and self-reasoning, fundamentally changing how computing power is utilized. Therefore, Intel's core message this time is: Agentic AI will reshape the compute power ratio within data centers.

Currently, in the cutting-edge model training phase, the CPU-to-GPU ratio can approach 1:8, with GPUs bearing the vast majority of the computational load. However, upon entering an agentic inference mode, CPUs need to handle task orchestration, tool invocation, data migration, and system coordination. In this scenario, the CPU-to-GPU ratio will gradually move towards 1:1, or may even require higher CPU density to rapidly decompose tasks.

In fact, when an agent not only generates an answer but also needs to continuously invoke models, tools, and external systems, its operational state is completely different from that of traditional AI. Intel mentioned a statistic in the presentation: compared to single-turn inference, an agent's Token consumption can increase by up to 1000 times.

Image Source: Intel

In other words, agents don't bring simple growth in inference volume, but rather more complex, higher-frequency, and more fragmented system loads. Throwing all these loads at GPUs for resolution would be inefficient and expensive.

The Xeon 6+ processor launched by Intel is built on the Intel 18A process, featuring up to 288 efficiency cores and equipped with up to 576MB of L3 cache. Targeting cloud-native, Agentic AI, and network-intensive workloads, it promises higher energy efficiency and more stable sustained performance.

In Intel's proposed solution, a single liquid-cooled rack occupying 32U of compute space can provide 36,864 cores; the rack power consumption is only about 100kW, sufficient to support high-density agent deployment. While 100kW may sound daunting, compared to previous server racks with equivalent performance, power consumption has already been significantly reduced.

Beyond Xeon 6+, there's something even more noteworthy: Intel's restructuring of the inference architecture.

In the presentation, Intel announced a partnership with SambaNova, Vista Equity Partners, Cambium Capital, and others to officially launch a new, fully disaggregated inference solution. This solution runs on the Vector Core Compute Agent Cloud, where Intel Xeon 6 processors handle orchestration and execution, SambaNova SN40 RDUs are responsible for decoding, and finally, NVIDIA Blackwell GPUs handle pre-filling.

Image Source: Intel

This new architecture is specifically designed for agentic workloads. Unlike many past AI systems that tended to offload most of the work in the inference pipeline to GPUs, in this system, CPUs, RDUs, and GPUs will each have their specific roles—handling system scheduling, decoding, pre-filling, and other different stages respectively—ensuring each inference phase runs on the most suitable hardware to maximize efficiency.

Following the introduction of Xeon 6+, the recently launched 3rd Gen Core Ultra processors also made another appearance. They represent another link in Intel's AI ecosystem—the core of edge-side AI. In the presentation, the hybrid local server demonstrated by Intel and Perplexity was precisely built on the 3rd Gen Core Ultra and Xeon 6+ cloud servers.

Image Source: Intel

It can dynamically allocate workloads between the local device and the cloud based on device capability and functional requirements, further reducing reliance on cloud computing power. This is also the ideal form for future AI PCs: by dynamically distributing performance, it lowers Token costs while ensuring task immediacy and data privacy.

Beyond PCs, Intel is extending the 3rd Gen Core Ultra to gaming handhelds and edge computing. The newly announced Arc G3 series of processors are optimized for handheld gaming devices based on the same architecture and will be available later this month (the integrated graphics card most anticipated by handheld gamers is coming).

From General-Purpose to Custom, Intel Aims to Be 'Everywhere'

Beyond general-purpose processors, Intel also emphasized custom chips this time, a business segment CEO Pat Gelsinger has been championing since taking the helm.

Intel believes the custom chip market will be vast in the future because as AI penetrates various industries, customers will become increasingly dissatisfied with general-purpose compute power. In pursuit of higher efficiency and performance, they will gradually lean towards custom chips to maintain their competitiveness.

In the presentation, Intel mentioned collaborations, such as with Google to launch IPUs—chips crucial for cloud service providers to enhance infrastructure performance. Intel is also partnering with telecom clients like Ericsson to provide advanced wireless infrastructure chips globally.

This actually reveals another theme of Gelsinger's speech: Intel is no longer relying on a single, general-purpose chip to win the market. Instead, it's packaging chips, systems, software, and industry partnerships into a complete set of solutions that can be freely customized according to the needs of different enterprises, thereby maximizing Intel's advantages.

Image Source: Intel

From the perspective of Lei Technology, Intel is essentially redefining its position in the ecosystem: data centers need CPUs for agent orchestration; inference systems require heterogeneous disaggregation to reduce costs; PCs need local AI to handle privacy and compliance; edge and embodied intelligence require high-efficiency chips; and industry clients need customized chips.

By meeting the needs of enterprises across different fields and various points in the value chain, Intel aims to become even more "everywhere" than Nvidia.

Of course, the pressure on Intel remains immense. Nvidia's advantages in AI accelerators and software ecosystems are still evident, and AMD continues its offensive in server CPUs and AI chips. For Intel to successfully navigate this path, it ultimately depends on the mass production speed of the 18A process, whether the Xeon 6+ rack-level solutions can be deployed quickly, and whether customers can truly see significant benefits from this new architecture.

But at least this time, Intel's direction is clearer than before.

It can be said that as AI enters the era of agents, competition is no longer just about comparing the peak performance of a single chip. Instead, it involves the collaborative efficiency optimization of the entire computing system. GPUs remain important, but CPUs, edge devices, local AI, and custom chips are also regaining critical importance.

And what Intel aims to seize is precisely this window of opportunity where AI infrastructure is undergoing a re-division of labor.

Похожее

After Marvell's 32% Surge, the Chinese Chip Family Behind It Emerges

The stock price of Marvell Technology surged 32.5% on June 2nd, driven by NVIDIA CEO Jensen Huang highlighting its custom ASICs and optical interconnects as core to AI data center architecture. This event brought attention to the Chinese semiconductor family behind Marvell: the Dai siblings. The story centers on three siblings, all UC Berkeley graduates, whose three-decade entrepreneurial journey aligns with major semiconductor industry shifts. In 1995, youngest sister Dai Wei Li co-founded Marvell with her husband Sehat Sutardja and his brother, focusing on storage controllers. Eldest brother Dai Wei Min founded EDA company Ultima, later sold to Cadence, and later founded VeriSilicon (芯原) in China, becoming a leading semiconductor IP provider. Second brother Dai Wei Jin co-founded EDA firm Silicon Perspective (sold to Cadence) and GPU IP company Vivante, later acquired by VeriSilicon. The combined "Dai-Sutardja" family network extends beyond Marvell. Their ventures and investments form a comprehensive ecosystem for the post-Moore's Law, chiplet era. Key holdings include: Dream Big Semiconductor (AI SuperNICs, acquired by Arm), Alphawave (high-speed SerDes IP, acquired by Qualcomm), and Silicon Box (a chiplet advanced packaging foundry). VeriSilicon itself thrives on the AI ASIC and IP boom in China. Collectively, the family's AI infrastructure-related portfolio is estimated at over $22 billion. Their strategy represents a distinct path: building critical components for open standards and key manufacturing capacity in the chiplet era, rather than pursuing standalone AI chip dominance. While this path may not create the next NVIDIA, it has enabled repeated successful exits and sustained influence within the global semiconductor industry.

marsbit40 мин. назад

After Marvell's 32% Surge, the Chinese Chip Family Behind It Emerges

marsbit40 мин. назад

Microsoft is Afraid of Being Marginalized by AI Giants

Microsoft, once the defining force of the PC era, now faces a familiar challenge in the AI age: the risk of being relegated to a profitable but invisible infrastructure provider. This anxiety was laid bare at Build 2026, where CEO Satya Nadella unveiled a major strategic pivot. The catalyst was a quiet April agreement that dissolved Microsoft's exclusive licensing and cloud-hosting deal with OpenAI, its once-vital partner. This erased Microsoft's key AI moat. With OpenAI and Anthropic defining AI applications and gaining enterprise traction—even within Microsoft's own ranks—Nadella had to answer: without exclusivity, what is Microsoft's role? The answer was a suite of seven in-house AI models, a developer-focused AI workstation (Surface RTX Spark Dev Box), and, most crucially, the Agent 365 platform for enterprise AI governance. The models, notably targeting Anthropic's strengths in coding and enterprise, signal a defensive move. However, the broader strategy is to make the models themselves less decisive. Financially, Microsoft's AI revenue is strong, driven largely by Azure running others' models. Yet its user-facing products like Copilot show weak penetration and engagement. Microsoft earns infrastructure money but lacks direct user mindshare. Nadella's core fear is being "hollowed out." As OpenAI and Anthropic prepare for IPOs and gain financial independence, they may build their own infrastructure, threatening Azure's lucrative AI revenue stream. Microsoft's window is to entrench itself deeper: not as the model creator, but as the indispensable platform for securely deploying, managing, and governing all AI models within the enterprise through Agent 365. Build 2026 revealed Microsoft's bet: in the AI era, the ultimate power lies not in any single model, but in the enterprise "operating system" that controls them. Nadella is determined to ensure Microsoft is the driver of this new era, not just a passenger.

marsbit53 мин. назад

Microsoft is Afraid of Being Marginalized by AI Giants

marsbit53 мин. назад

US Stocks Surge 16% in Two Months: Only 4 Occurrences in History, the Most Recent Was Just Before the 1987 Crash

U.S. stocks have surged 16% over April and May, a gain seen only four times since WWII, according to Deutsche Bank strategist Henry Allen. Three prior instances followed recessions, but the sole non-recession precedent was in early 1987, months before the "Black Monday" crash. The current rally is underpinned by AI enthusiasm and strong data, but its speed without an economic recovery is historically unprecedented. Meanwhile, credit spreads remain near record lows despite growing consumer strain—the U.S. savings rate hit just 2.6% in April, a level last seen before the Global Financial Crisis, and consumer sentiment is at its lowest since 1952. Market risks are concentrated. While stocks and credit markets appear immune to geopolitical tensions, sovereign bonds tell a different story. Treasury yields have tracked oil prices and recently hit multi-year highs, creating a widening divergence with equities. Surprisingly stable oil prices, despite the prolonged closure of the Strait of Hormuz, have been a key pillar for risk assets. However, if the blockage persists, this support could fade, raising the risk of a stagflation shock not currently priced in.

marsbit54 мин. назад

US Stocks Surge 16% in Two Months: Only 4 Occurrences in History, the Most Recent Was Just Before the 1987 Crash

marsbit54 мин. назад

CPU, Quietly Returning to the Center of the AI Computing Power Stage

Over the past three years, AI computing power narratives have been dominated by GPUs. However, starting in 2026, this story began to shift. While training large models remains GPU-intensive, the rapid growth of inference and AI agent workloads, which require high levels of task orchestration, concurrency, and data flow management, has highlighted a renewed critical role for CPUs. These are tasks GPUs are not designed to handle. Intel's recent launch of the Xeon 6+ processor, built on its Intel 18A process and featuring up to 288 efficiency cores (E-cores), exemplifies this strategic pivot. It is positioned not as a mere companion to GPUs but as the essential "control plane" for AI infrastructure, optimized for high-density, energy-efficient, and high-throughput workloads characteristic of AI agents and inference. This "CPU resurgence" is not about CPUs outperforming GPUs in raw computation. It reflects a systemic bottleneck: as AI scales from training single models to deploying countless intelligent agents, the demand for coordination and data handling surges. Major cloud providers are also developing their own high-density ARM-based server CPUs for similar workloads. However, Intel's success with this strategy faces significant challenges. Competition includes NVIDIA's integrated CPU-GPU solutions, the expanding adoption of cloud vendors' in-house ARM CPUs, and the crucial market test of Intel's 18A manufacturing process against rivals like TSMC's N2. In conclusion, CPUs are indeed reclaiming a central, though redefined, role in AI compute—managing the complex orchestration that enables massive-scale AI deployment. While the trend is clear, which company will ultimately lead this CPU resurgence remains an open question to be decided in the data centers of 2027 and beyond.

marsbit1 ч. назад

CPU, Quietly Returning to the Center of the AI Computing Power Stage

marsbit1 ч. назад

TON Rebrands Native Token As Gram, Reviving Original White Paper Name

The native token of the Toncoin (TON) network is being rebranded to 'Gram,' reviving its original name from the project's first white paper. This change, announced by Telegram co-founder and CEO Pavel Durov, is the fourth step in his "Make TON Great Again" (MTONGA) initiative. The transition period is expected to take about three weeks, and a new logo has been previewed. The rebrand follows Telegram's official return to the ecosystem in May as its largest validator, after a six-year absence that began with a 2020 legal dispute with the SEC. Durov stated that the move "returns to our roots" and paves the way for future developments, with three more steps remaining in the MTONGA roadmap. At the time of the announcement, Gram was trading around $2.02.

bitcoinist1 ч. назад

TON Rebrands Native Token As Gram, Reviving Original White Paper Name

bitcoinist1 ч. назад

Торговля

Спот

Фьючерсы

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на S (S) представлены ниже.

Stop Staring at GPUs, Intel Unleashes a Major Move—Can It End Nvidia's Computing Power Monopoly?

Agents Are Changing the Computing Ecosystem

From General-Purpose to Custom, Intel Aims to Be 'Everywhere'

Похожее

After Marvell's 32% Surge, the Chinese Chip Family Behind It Emerges

Microsoft is Afraid of Being Marginalized by AI Giants

US Stocks Surge 16% in Two Months: Only 4 Occurrences in History, the Most Recent Was Just Before the 1987 Crash

CPU, Quietly Returning to the Center of the AI Computing Power Stage

TON Rebrands Native Token As Gram, Reviving Original White Paper Name

Торговля

Популярные статьи

Как купить S

Sonic: Обновления под руководством Андре Кронье – новая звезда Layer-1 на фоне спада рынка

HTX Learn: Пройдите обучение по "Sonic" и разделите 1000 USDT

Обсуждения

Топ вопросы

Популярные категории

Популярные теги