# Chip Related Articles

HTX News Center provides the latest articles and in-depth analysis on "Chip", covering market trends, project updates, tech developments, and regulatory policies in the crypto industry.

GPU Rental Prices Drop 30% in Three Weeks: AI Value Chain Migrating from Nvidia to Memory Chips

GPU rental prices for Nvidia's flagship B200 chip have fallen by approximately 30% over three weeks, dropping from a high of $6.11/hour to $4.22/hour. This decline signals a potential easing of the "compute scarcity" narrative that has long supported AI hardware valuations. Concurrently, the semiconductor market is witnessing a significant divergence: while the VanEck Semiconductor ETF (SMH) has risen 15% in the past month, with memory giants Micron and SanDisk each surging nearly 60%, Nvidia's stock has declined about 3% over the same period. Analysts suggest this shift indicates that the AI value chain's bottleneck and profits are migrating from compute (GPUs) to memory. Demand for high-bandwidth memory (HBM) remains intensely strong, with contract prices soaring over 100% in H1 2026, granting memory manufacturers significant pricing power. In contrast, increased B200 supply from improved manufacturing yields and competitive pressure from new cloud providers are softening GPU rental rates. While long-term contracts, like SpaceX's $30 billion deal with Google, show sustained large-scale demand for Nvidia hardware, the softening spot prices pressure the margins of cloud providers and could eventually impact Nvidia's order flow if chip prices don't adjust. The key takeaway for investors is not a weakening AI thesis, but a recalibration within the sector: pricing power appears to be strengthening for memory chipmakers while showing signs of strain for leading GPU suppliers.

marsbit6h ago

GPU Rental Prices Drop 30% in Three Weeks: AI Value Chain Migrating from Nvidia to Memory Chips

marsbit6h ago

Xpeng and NIO Compete on Computing Power, Li Auto Shifts Architecture

On June 15, 2026, Li Auto unveiled details of its self-developed chip, Mahe M100, for its new L9 Livis model. CTO Xie Yan stated the goal was not just a faster chip, but a fundamentally different one, targeting the chip architecture itself. While competitors like NIO, Xpeng, and Huawei highlight TOPS (computing power) figures for their self-developed chips, Li Auto’s Mahe M100 focuses on redesigning the underlying architecture. It employs a "dynamic data flow architecture" to address memory bandwidth bottlenecks in large model inference, claiming up to 3x the effective computing power of Nvidia's Thor U for its specific workloads and a 40% reduction in latency. The chip's design was peer-reviewed and accepted at ISCA 2026. However, this performance is highly optimized for Li Auto's own VLA2.1 algorithm, meaning it may not generalize as well to other tasks. Li Auto aims to achieve full-stack in-house development with Mahe M100, covering chip, compiler, OS, AI algorithms, and domain controller—a level of vertical integration few competitors match. Beyond the chip, CEO Li Xiang introduced a new strategic narrative: the "embodied intelligent vehicle," defined as an integration of an EV, a professional driver, an AI computer, and a life assistant. This shifts competition from features like large screens to systemic AI capabilities. A key commitment was that Li Auto's Mahe VLA autonomous driving model will match Tesla's FSD V14 by Q4 2026, with specific OTA milestones set for July, September, and December. Financially, Li Auto faces pressure with declining revenue and vehicle gross margins since Q4 2025, while maintaining high R&D investment (approx. ¥12B in 2026, 50% AI-related). Its 2026 sales target is 550,000 vehicles, up from 406,000 in 2025. The new L9 Livis garnered over 10,000 pre-orders in two weeks. The effectiveness of these strategic moves—new products, OTAs, and the novel chip architecture—will begin to show in Q3 2026 financial results, with the year-end FSD V14 benchmark being the ultimate test.

marsbit06/16 04:52

Xpeng and NIO Compete on Computing Power, Li Auto Shifts Architecture

marsbit06/16 04:52

SemiAnalysis Dissects Huawei's Kirin 9030: Process Technology Halted, So They Folded the Chip

SemiAnalysis has published a detailed teardown report on the HiSilicon Kirin 9030 Pro chipset found in Huawei's Mate 80 Pro. Fabricated using SMIC's most advanced N+3 node without EUV lithography, the analysis reveals significant technical achievements and strategic shifts. The report indicates SMIC's N+3 has achieved transistor density comparable to TSMC's N6 (113.4 vs 107.7 MTr/mm²), primarily through aggressive use of Self-Aligned Quadruple Patterning (SAQP) for its metal layers. This results in a notably small 32.5nm M0 metal pitch. However, SemiAnalysis notes this achievement comes with significantly higher process complexity, cost, and potential yield challenges compared to competitors using more advanced tools. The Kirin 9030 design maximizes this constrained density. While its GPU performance has improved ~70% and matches Qualcomm's 2022 flagship level, the CPU core's IPC lags behind current top-tier designs from Apple and Qualcomm, a gap attributed to the underlying manufacturing technology rather than design capability. Facing long-term restrictions on advanced tools, Huawei is charting a new path. The report highlights the company's "LogicFolding" roadmap, a 3D stacking technique aimed at shortening signal paths to boost performance and efficiency. The goal is to reach 5GHz frequency and a projected density of 295 MTr/mm² by 2031. SemiAnalysis concludes that export controls have not halted China's chip progress but have fundamentally altered its trajectory, making it more expensive and complex. This has spurred innovation in alternative areas like 3D stacking and domestic EDA tool development, with Huawei's supply chain also beginning to integrate Chinese memory from CXMT.

marsbit06/15 06:52

SemiAnalysis Dissects Huawei's Kirin 9030: Process Technology Halted, So They Folded the Chip

marsbit06/15 06:52

Microsoft Announces Commercial-Grade Quantum Computer to be Completed in Three Years: Will the Boots Land?

Microsoft announces plans to build a commercially viable quantum computer by 2029, a significant acceleration from the previous industry consensus of a decade. The breakthrough is fueled by their new Majorana 2 quantum chip, which boasts a record-breaking average qubit lifetime of 20 seconds—a 1,000-fold reliability improvement over its predecessor. This leap was achieved by leveraging topological qubits, a theoretically more stable technology using Majorana zero modes, and switching the core superconducting material from aluminum to lead. Crucially, Microsoft's "Discovery" agentic AI platform accelerated the R&D process. AI agents autonomously analyzed vast experimental data, optimized manufacturing parameters (like the lead alloy composition), and solved issues like "ghost noise," dramatically speeding up experimentation. While the 20-second coherence time is a landmark, challenges remain: scaling from 12 qubits to the millions needed for practical applications, managing compilation costs, and verifying quantum results. Skeptics call for peer-reviewed data, and questions persist about whether even 20 seconds is sufficient for complex algorithms like breaking RSA encryption. The race is on with other approaches (superconducting, trapped ions), but Microsoft's confidence in its topological roadmap signals a potential shortcut to a scalable quantum future.

marsbit06/15 03:30

Microsoft Announces Commercial-Grade Quantum Computer to be Completed in Three Years: Will the Boots Land?

marsbit06/15 03:30

Crossing the 'Memory Wall': The Wafer-Level Revolution and Computing Power Routes in the AI Inference Era

In 2026, a historic shift occurred in AI as major cloud providers' inference spending surpassed training spending for the first time, signaling a move from "building large models" to "using large models." This shifts the core challenge from computing power to the "memory wall"—the bottleneck of data movement (model weights, activations, KV Cache) between external DRAM and processors, where energy and latency from data transfer far exceed computation itself. Companies like Nvidia face GPU idle time due to bandwidth limits. In contrast, Cerebras Systems adopts a radical "wafer-scale" approach with its Wafer-Scale Engine (WSE). Instead of cutting a silicon wafer into many chips, Cerebras uses almost the entire wafer as one massive chip (WSE-3). This design provides 44GB of on-chip SRAM, delivering memory bandwidth thousands of times higher than traditional HBM (e.g., 21 PB/s vs. Nvidia B200). For LLM inference, weights are streamed layer-by-layer from external MemoryX storage to the chip, avoiding HBM bottlenecks. This results in token generation speeds 1.5–5 times faster than Nvidia's B200 in some models and significant advantages in first-token latency and long-context tasks. Additionally, Cerebras's architecture offers much lower interconnect power consumption (0.15 pJ/bit vs. GPU's ~10 pJ/bit). However, Cerebras faces challenges: SRAM scaling has slowed with advanced nodes, limiting future capacity gains; the chip requires specialized liquid cooling and custom software stacks; and its external I/O bandwidth (150 GB/s) is low compared to NVLink, hindering multi-system scaling for very large models. Competition is intensifying. Major players are pursuing three paths: 1) Developing proprietary inference ASICs (e.g., Google TPU, Microsoft Maia), 2) Leveraging advanced packaging (e.g., TSMC's SoW) to democratize wafer-scale-like integration, potentially eroding Cerebras's process advantage within a few years, and 3) Exploring optical interconnects for ultimate bandwidth. Commercially, Cerebras is transitioning from a hardware vendor to a service provider, facing the immense challenge of building high-power, specialized data centers to meet large contracts (e.g., 250MW/year from 2026–2028). In conclusion, the AI inference era presents a fundamental architectural trade-off. Cerebras opts for extreme physical optimization for low-latency, single-task performance, while Nvidia prioritizes versatility and massive cluster throughput. The path forward remains uncertain, with technology and business models still evolving in the race toward advanced AI.

marsbit06/05 11:07

Crossing the 'Memory Wall': The Wafer-Level Revolution and Computing Power Routes in the AI Inference Era

marsbit06/05 11:07

Dalio Warns: AI Boom Shows Signs of a Bubble, Day of Reckoning Will Be the Time of Burst

Ray Dalio, founder of Bridgewater Associates, warns that the current artificial intelligence investment boom shows classic signs of a bubble, which he expects will eventually burst. In a Bloomberg Television interview, he noted that great technological revolutions often lead to capital inflows that create bubbles, making it difficult for investors and companies to calibrate their spending accurately—either overspending to capture market share or underspending and losing their competitive position. This caution comes amid significant rallies in AI-related assets, particularly chipmakers, driven by soaring demand for data centers and high-bandwidth chips, raising debates about overheating valuations. In contrast, Nvidia CEO Jensen Huang recently asserted that investors embracing the AI wave would see "crazy" returns and dismissed concerns over return on investment for data center spending as outdated. Dalio, however, focuses on the risks in the profit realization phase. He argues that bubbles tend to show signs of破裂 when markets transition from investment to the need for tangible returns, describing the burst as a process of converting paper wealth into cash. While acknowledging AI's intrinsic value, he expressed concern over the future profitability of some AI companies, suggesting the market is repeating a familiar pattern. The 76-year-old billionaire, who fully exited Bridgewater in 2025, has a net worth estimated at $21.5 billion according to the Bloomberg Billionaires Index.

marsbit06/05 10:17

Dalio Warns: AI Boom Shows Signs of a Bubble, Day of Reckoning Will Be the Time of Burst

marsbit06/05 10:17

AI PC Battle: Bet on the Toll Booth, Not the Camp

**Title:** The AI PC Battle: Don't Bet on Sides, Bet on the Tollbooth **Summary:** The AI PC competition is moving beyond simple "x86 vs. Arm" narratives. The core investment thesis should focus on identifying which players can sustain margins, cash flow, and pricing power throughout the upgrade cycle, rather than backing a particular architecture. The opportunity is analyzed in three layers: 1. **The Advanced Foundry Tollbooth:** TSMC is positioned to collect "tolls" regardless of which chip designer wins, due to its dominant ~70% share in advanced semiconductor manufacturing, which is essential for high-end AI PC chips. 2. **Compute & Platform Spillover:** AMD represents an offensive in the x86 CPU+GPU space, while NVIDIA leverages its GPU and CUDA software stack dominance. Both benefit from the demand for increased local AI compute. 3. **Architecture Diffusion & Turnaround Plays:** ARM and Intel offer potential for significant upside (elasticity), but investments here require stricter discipline due to higher execution risks and competitive challenges. The industry is transitioning from concept to shipment validation. While short-term forecasts for AI PC adoption have been revised down slightly due to tariffs and procurement delays, the long-term trend towards AI becoming a standard PC feature remains intact. The key driver for upgrade cycles will be whether compelling enterprise applications (e.g., privacy-sensitive computing, low-latency inference) emerge beyond consumer-focused features like meeting summarization. Investment strategy should prioritize companies with platform-level advantages and recurring revenue streams. TSMC offers high certainty as the foundational tollbooth. AMD presents a strong offensive play within the established ecosystem. ARM and Intel are higher-risk, higher-potential-reward turnaround bets. The report cautions against chasing short-term hype and emphasizes a disciplined, long-term approach focused on buying ecosystem strength and cash-flow certainty after market enthusiasm subsides. **Key Risks:** Underwhelming AI PC applications slowing upgrade cycles; slow improvement in Windows on Arm compatibility; macro/tariff impacts on PC demand; potential advanced node supply-demand mismatches affecting TSMC; high overall AI sector valuations making stocks vulnerable to a risk-off shift in markets.

marsbit06/04 07:10

AI PC Battle: Bet on the Toll Booth, Not the Camp

marsbit06/04 07:10

活动图片