# Nvidia Related Articles

HTX News Center provides the latest articles and in-depth analysis on "Nvidia", covering market trends, project updates, tech developments, and regulatory policies in the crypto industry.

CPU Makes a Comeback to the Table, A $170 Billion "Power Seizure" Drama Begins

A new era is dawning for the server CPU (Central Processing Unit), driven by the shift from AI model training to large-scale reasoning and the rise of Agentic AI. This article explores how the CPU is reclaiming a central role in the AI data center. For years, the focus has been on the GPU (Graphics Processing Unit) for AI training. However, as AI moves to the inference and Agent phase—where tasks involve complex, multi-step reasoning, tool calls, and data management—the workload balance is flipping. Studies show CPUs now handle over 70% of the workload in Agentic AI, up from 10-30% in training. This is because Agent tasks generate massive intermediate data (KV Cache) that exceeds GPU memory, forcing it to be offloaded to the CPU's larger, more scalable memory pools. This increased importance is translating into market changes. Major players are taking note: NVIDIA launched its first standalone CPU line, Vera, based on ARM architecture and optimized for Agent performance. AMD doubled its server CPU market forecast to over $1200 billion by 2030. Analyst reports project the total server CPU market could reach $1700 billion by 2030, with AI-driven demand being a primary driver. Furthermore, the classic ratio of CPUs to GPUs in AI servers is rapidly changing, converging from 1:8 toward 1:1 for Agent deployments. This surge in demand has led to a rare industry-wide price increase of 10-15% for server CPUs from Intel and AMD, breaking a decade-long trend of "more performance for the same price." Demand is bifurcating into high-core-count CPUs for in-rack GPU support and moderate-core CPUs for standalone Agent task orchestration. In China, this global trend presents an opportunity for domestic CPU manufacturers like Hygon (海光信息) and Huawei Kunpeng, who are bolstered by both growing AI infrastructure needs and national policies promoting technological self-reliance ("xin chuang"). The maturity of their software ecosystems is also accelerating, evidenced by faster adaptation to new AI models. In conclusion, the narrative is shifting from a GPU-centric view to one where CPU-GPU synergy is critical. The CPU is no longer a peripheral component but a performance-defining bottleneck and a key growth driver in the AI hardware stack, opening a massive new market estimated in the hundreds of billions of dollars.

marsbit06/19 13:41

CPU Makes a Comeback to the Table, A $170 Billion "Power Seizure" Drama Begins

marsbit06/19 13:41

Dylan Patel: Founder of SemiAnalysis, Praised by Jensen Huang, is a 'Beekeeper' and 'Forum Enthusiast'

Dylan Patel, founder of the independent research firm SemiAnalysis, has an unconventional background. A former beekeeper from rural Georgia, he entered the semiconductor world as a self-taught "forum warrior," discussing chip technology anonymously online from a young age. He launched the SemiAnalysis blog in May 2020, which later transitioned to a paid subscription model. The firm has grown from a one-person operation to a global team of around 60, with a dedicated teardown lab. Its detailed, technically-focused analysis on semiconductor supply chains, AI infrastructure, and products has earned significant industry recognition. Notably, NVIDIA founder Jensen Huang has publicly cited their reports. In a landmark case, a critical 2024 report on AMD's MI300X GPU software stack led to a 90-minute call with AMD CEO Lisa Su, who thanked him for the constructive feedback. SemiAnalysis later acknowledged AMD's improvements. The firm's influence on markets was seen when a report on NVIDIA's Rubin memory configuration was partially shared, affecting memory stock prices. Dylan Patel emphasized the importance of context, contrasting the shared excerpt with the report's actual title. SemiAnalysis, now a multi-faceted consultancy with revenue projected to reach $100 million, is known for its deep technical insights that influence major industry players and investment decisions.

marsbit06/19 01:08

Dylan Patel: Founder of SemiAnalysis, Praised by Jensen Huang, is a 'Beekeeper' and 'Forum Enthusiast'

marsbit06/19 01:08

Dylan Patel: SemiAnalysis, Praised by Jensen Huang, is Founded by a 'Beekeeper and Forum Warrior'

Dylan Patel, founder of the independent research firm SemiAnalysis, has an unconventional background. Growing up in rural Georgia, he later worked as a beekeeper in Minnesota. His entry into semiconductors began as a self-taught "forum warrior," engaging anonymously in online tech communities from a young age. In May 2020, he started the SemiAnalysis blog on WordPress, later moving it to Substack as a paid subscription service. The firm has since evolved from a one-person operation into a global company with around 60 employees, featuring a dedicated chip teardown lab. Its revenue, reaching $20 million last year, is projected to surpass $100 million this year. SemiAnalysis is highly regarded in the AI and semiconductor industry for its deep technical analysis. NVIDIA founder Jensen Huang has publicly praised its reports. In a notable instance, a critical report on AMD's MI300X GPU software shortcomings prompted a 90-minute call with CEO Lisa Su, who thanked Patel for the "constructive feedback." A later report acknowledged AMD's subsequent improvements. The firm's analyses have significant market impact. For example, a June report discussing potential memory configuration changes in NVIDIA's next-generation servers was cited as a factor in pressure on memory-related stocks. Patel plans to establish a venture capital firm, having already made personal investments in about 20 startups. SemiAnalysis combines roles as a consultancy, model platform, and tech lab, focusing on the practical bottlenecks in AI infrastructure.

Odaily星球日报06/19 01:03

Dylan Patel: SemiAnalysis, Praised by Jensen Huang, is Founded by a 'Beekeeper and Forum Warrior'

Odaily星球日报06/19 01:03

NVIDIA CPU Advances, China's RISC-V Responds: Semiconductor Deep Dive - Part Four

NVIDIA is set to launch its new Vera AI data center CPU in China as early as August, with high pricing. While this move offers a new option, it highlights China's continued dependence on foreign-controlled Arm architecture. In response, the Chinese semiconductor industry is increasingly turning to RISC-V as a strategic alternative for achieving high-performance computing autonomy. The article explores the concept of the "impossible triangle" in CPU development—balancing prosperity, control, and autonomy—and posits that RISC-V's open-source, modular nature offers a unique path to achieving all three. While RISC-V is already dominant in embedded systems, the focus is now shifting to data centers and AI workloads. China has become a global hotspot for RISC-V development, driven by AI-driven compute demand, supply chain concerns from export controls, cost benefits of open-source, and strong policy support. Multiple Chinese companies have reportedly crossed the key performance threshold of 15 SPECint per GHz, a benchmark for entering the high-performance CPU club. Progress extends beyond single-core benchmarks. Companies are developing complete computing subsystems, including commercial-grade coherent network-on-chip (NoC) technology and server processors with up to 40 cores that strictly adhere to the RVA23 standard to ensure software compatibility. Real-world applications are emerging in areas like video transcoding and edge AI. However, significant challenges remain. The RISC-V ecosystem faces fragmentation, immature toolchains and verification processes, and gaps in single-core performance and energy efficiency compared to mature x86 and Arm architectures. The formidable software moat, epitomized by NVIDIA's CUDA, is a long-term hurdle. In conclusion, while RISC-V cannot immediately replace offerings like NVIDIA's Vera, it represents a viable long-term path for China to develop a self-sufficient, high-performance CPU ecosystem. The journey is acknowledged to be long and arduous, requiring sustained effort to overcome technical and ecosystem challenges.

marsbit06/18 17:38

NVIDIA CPU Advances, China's RISC-V Responds: Semiconductor Deep Dive - Part Four

marsbit06/18 17:38

qinbaFrank: Review and Outlook of the AI Computing Power Wave — From the Three Debates on NVIDIA to Optical Interconnect and SpaceX IPO, How is Capital Rotating?

**Summary: Retrospective and Outlook on the AI Computing Wave - A Framework for Capital Rotation** Based on a presentation by investor qinbaFrank, this analysis reviews the AI computing market trajectory since 2023 and outlines a forward-looking framework. **Key Phases and Market Debates:** The AI bull market progressed through three major debates: 1) The necessity of massive capital expenditure (late 2023). 2) The sustainability of tech giants' spending (early 2024-early 2025). 3) Potential overestimation of compute needs (early 2025). Consensus solidified in late 2025 as model capabilities and utility demonstrably improved. **Core Thesis: Penetration Rate Drives Commercialization.** Unlike the 2000 dot-com bubble, the current AI wave benefits from mature digital infrastructure, enabling faster adoption. The critical threshold is 10% penetration; surpassing it (with recent enterprise intent surveys showing ~18%) indicates entry into a rapid growth "golden period" where user scale and willingness to pay increase simultaneously. **AI vs. Internet: A Fundamental Difference.** While the internet enhanced connection efficiency, AI directly substitutes human cognition and labor. Once AI performance exceeds the "societal average" human level, its commercial value scales exponentially as payment shifts from human labor costs to AI service fees. **Investment Logic Evolution in the Compute Chain.** The focus has expanded from GPUs to a systemic re-rating of the entire hardware stack: storage/HBM, CPUs, interconnects, power, and advanced packaging. The framework is: **short-term "scarcity pricing," mid-term "upgrade pricing" (e.g., optical interconnects, power networks), and long-term "Physical AI" pricing** (edge computing, robotics). **Market Focus Shift and Adjustment Framework.** The market is transitioning from "hardware scarcity" to "commercialization validation." The ultimate anchor for the narrative is sustained high growth in model providers' Annual Recurring Revenue (ARR) and cloud business revenue, which justifies continued capital expenditure. Adjustments are categorized into three levels: * **L1 (Minor):** Driven by valuation compression or macro noise (e.g., single CPI print). Fundamentals intact. * **L2 (Moderate):** Triggered by significant macro events requiring risk repricing. Requires new data for confidence restoration. * **L3 (Major):** Involves a reset of the core industrial narrative or macro regime (e.g., AI commercialization growth stalling). The **crucial dividing line** is whether AI commercialization growth slows. Without a slowdown, pullbacks are likely L1/L2 "repricing" events. A genuine growth deceleration would signal an L2/L3 narrative reset. **Conclusion: A Foundational Civilizational Leap.** AI represents a foundational upgrade to "intelligence" itself—akin to humanity mastering fire—rather than a single-point industrial revolution. This底层能力跃迁 (underlying capability leap) will spawn successive waves of innovation (Agent, robotics, industry workflow重构). The journey will be波浪式的 (wavelike), driven by cycles of scarcity, technological upgrades, and远期兑现 (long-term realization).

marsbit06/17 11:28

qinbaFrank: Review and Outlook of the AI Computing Power Wave — From the Three Debates on NVIDIA to Optical Interconnect and SpaceX IPO, How is Capital Rotating?

marsbit06/17 11:28

AMD Launches Compact AI Host, Directly Challenging NVIDIA DGX Spark

In June 2026, AMD announced the Ryzen AI Halo, a compact AI developer desktop to rival NVIDIA's DGX Spark. Both feature 128GB unified memory for running 200B+ parameter models locally. Priced from $2,949 to $3,999, AMD undercuts NVIDIA's $3,999+ DGX Spark. The core divergence lies in architecture and philosophy. Ryzen AI Halo uses an x86-based Ryzen AI Max+ 395 APU (CPU+GPU+NPU), runs standard Windows/Linux, and emphasizes general-purpose PC flexibility. DGX Spark uses an ARM-based Grace Blackwell Superchip, runs a custom DGX OS, and includes a high-speed ConnectX-7 NIC for cluster prototyping, anchoring it to NVIDIA's full-stack CUDA ecosystem. AMD's ROCm software has improved, with simpler installation and support for major frameworks, but still lags behind CUDA's 17-year maturity in community support and cutting-edge library availability. AMD's broader strategy focuses on becoming a viable second-source supplier. Key moves include acquiring design capabilities via ZT Systems (while outsourcing manufacturing) and securing two major 6GW GPU supply deals with OpenAI and Meta in late 2025/early 2026. These contracts validate AMD's role in diversifying the AI supply chain, rather than outright beating NVIDIA. NVIDIA counters with a tightly integrated stack from desktop (DGX Spark) to data center, emphasizing seamless scalability and enterprise software subscriptions (AI Enterprise). In summary, Ryzen AI Halo represents AMD's pragmatic path: offering a cost-effective, open-ecosystem alternative for developers wary of vendor lock-in, while its large data center contracts aim to capture share from customers seeking a second GPU supplier. The choice boils down to a familiar, flexible PC environment with potential software gaps (AMD) versus a premium, optimized, but locked-in ecosystem (NVIDIA).

marsbit06/16 09:14

AMD Launches Compact AI Host, Directly Challenging NVIDIA DGX Spark

marsbit06/16 09:14

Xpeng and NIO Compete on Computing Power, Li Auto Shifts Architecture

On June 15, 2026, Li Auto unveiled details of its self-developed chip, Mahe M100, for its new L9 Livis model. CTO Xie Yan stated the goal was not just a faster chip, but a fundamentally different one, targeting the chip architecture itself. While competitors like NIO, Xpeng, and Huawei highlight TOPS (computing power) figures for their self-developed chips, Li Auto’s Mahe M100 focuses on redesigning the underlying architecture. It employs a "dynamic data flow architecture" to address memory bandwidth bottlenecks in large model inference, claiming up to 3x the effective computing power of Nvidia's Thor U for its specific workloads and a 40% reduction in latency. The chip's design was peer-reviewed and accepted at ISCA 2026. However, this performance is highly optimized for Li Auto's own VLA2.1 algorithm, meaning it may not generalize as well to other tasks. Li Auto aims to achieve full-stack in-house development with Mahe M100, covering chip, compiler, OS, AI algorithms, and domain controller—a level of vertical integration few competitors match. Beyond the chip, CEO Li Xiang introduced a new strategic narrative: the "embodied intelligent vehicle," defined as an integration of an EV, a professional driver, an AI computer, and a life assistant. This shifts competition from features like large screens to systemic AI capabilities. A key commitment was that Li Auto's Mahe VLA autonomous driving model will match Tesla's FSD V14 by Q4 2026, with specific OTA milestones set for July, September, and December. Financially, Li Auto faces pressure with declining revenue and vehicle gross margins since Q4 2025, while maintaining high R&D investment (approx. ¥12B in 2026, 50% AI-related). Its 2026 sales target is 550,000 vehicles, up from 406,000 in 2025. The new L9 Livis garnered over 10,000 pre-orders in two weeks. The effectiveness of these strategic moves—new products, OTAs, and the novel chip architecture—will begin to show in Q3 2026 financial results, with the year-end FSD V14 benchmark being the ultimate test.

marsbit06/16 04:52

Xpeng and NIO Compete on Computing Power, Li Auto Shifts Architecture

marsbit06/16 04:52

活动图片