The Underlying Logic of Bottleneck Propagation in the AI Computing Power Industry Chain

Q: What are the four sequential bottleneck stages in the AI computing power supply chain as described in the article, and which one is identified as the 'ultimate bottleneck'?

The four sequential bottleneck stages are: 1) GPU/Computing, 2) Memory (HBM), 3) Optical Interconnect, and 4) Power + Liquid Cooling. The article identifies the fourth stage, Power and Liquid Cooling, as the 'ultimate bottleneck' or final physical constraint, as even if all other components are ready, a lack of power and cooling prevents the AI clusters from running.

Q: Why did High Bandwidth Memory (HBM) become a critical bottleneck after the initial GPU shortage was alleviated?

HBM became the critical bottleneck because as GPU computing power increased to handle massive AI models with trillions of parameters, the need for faster data transfer (memory bandwidth) created a 'memory wall.' HBM, which is much faster than traditional DDR memory, is essential for feeding data to these powerful GPUs. Its complex manufacturing process (involving TSVs and stacking) and limited suppliers (SK Hynix, Samsung, Micron) made its supply unable to keep up with explosive demand, delaying entire AI cluster deployments even when GPU chips were available.

Q: According to the article, what is the fundamental reason the industry is transitioning from copper cables to optical interconnects for scaling AI clusters?

The fundamental reason is the physical limitations of copper cables. While usable within a single server rack, copper cables face severe signal attenuation, excessive weight (e.g., over 1.36 tons for an NVL72 rack), high power consumption for signal integrity, and distance constraints when scaling to multi-rack clusters with thousands of GPUs. Optical interconnects (like CPO and silicon photonics) offer higher bandwidth density, lower power per bit, and longer transmission distances, making them a necessity for breaking the performance ceiling of large-scale AI data centers.

Q: How does the article characterize the nature of bottlenecks in the AI computing power supply chain, and what investment shift does this logic explain?

The article characterizes the bottlenecks as forming a system-level 'Leontief production function,' where components like GPU, HBM, interconnect, power, and cooling are complementary constraints—the system's capacity is determined by the lowest-performing (most bottlenecked) component. This logic explains the shift in investment focus from earlier leaders like NVIDIA and TSMC to companies in subsequent bottleneck areas: HBM suppliers (SK Hynix, etc.), optical component makers (Lumentum, Coherent), and power/cooling infrastructure providers (Vertiv, power companies), as each bottleneck转移 reshapes value distribution in the产业链.

Q: What specific data points from major investment banks does the article cite to illustrate the scale and unpredictability of current AI infrastructure demand?

The article cites several independent data points: Morgan Stanley noted a 2.5x increase in global weekly LLM token consumption in 3 months. J.P. Morgan identified a 122 GW financing gap for data center projects over 5 years and that 44% of new U.S. power projects face over 4-year grid connection waits. Bank of America significantly raised Alphabet's 2026 CAPEX forecast to $181.5 billion (a doubling year-over-year), expecting a 62% drop in free cash flow. These figures from different research paths collectively show AI demand has exceeded all traditional planning models for power, semiconductor equipment, and memory pricing.

marsbitPublished on 2026-05-22Last updated on 2026-05-22

Abstract

The article analyzes the evolving bottleneck progression within the AI compute supply chain. Initially constrained by GPU chip and advanced packaging capacity (2022-2024), the primary bottleneck shifted to HBM memory (2024-2025) due to massive model parameter growth. As cluster scale expands, physical limits of copper interconnects are making optical interconnect technologies the next critical phase (2025-2026). The ultimate, emerging constraint is power delivery and advanced liquid cooling (from 2026 onward), driven by skyrocketing rack power densities exceeding traditional infrastructure limits. The core thesis is that AI compute demand follows a "Leontief" production function where solving one bottleneck immediately exposes the next in the sequence: Compute (GPU) → Memory (HBM) → Interconnect (Optics) → Power & Cooling. Each shift reallocates value and investment across the semiconductor and infrastructure landscape.

Author: qinbafrank

In February, in the article "What Does This War of Capital Expenditure Mean?", it was discussed that key segments in the computing power industry chain can still capture the greatest value: chips, packaging & testing, memory, optical modules, etc. Those with capacity that is difficult to expand rapidly or those with extremely high moats will enjoy the红利 of massive capital expenditures.

There is still significant room for efficiency optimization: Distillation, quantization, MoE, dedicated chips, liquid cooling, nuclear fusion (long-term) on the inference side may reduce the energy consumption and cost per unit of computing power by another 10–100 times. Opportunities should be sought in these segments.

Recently, multiple investment banks including Morgan Stanley, J.P. Morgan, Bank of America, Goldman Sachs, UBS, Citi, Bernstein, and HSBC have published update reports on AI/semiconductors/power/memory. The bottlenecks for AI hardware have expanded from the single dimension of "GPU supply" to collective tension across five dimensions: power, chips, memory, equipment, and materials.

The scale of AI demand has broken through the forecast intervals of all traditional power planning, semiconductor equipment capacity, memory price models, and robot installation assumptions.

Morgan Stanley's global thematic research review points out that the global weekly large language model token consumption soared from 6.4 trillion to 22.7 trillion within 3 months, an increase of 2.5 times. The U.S. data center power gap for 2025-28 is 55 GW; J.P. Morgan's inaugural coverage of data center high-performance computing project debt directly gives a "122 GW financing gap in the next 5 years" figure. U.S. 5-year power planning has surged from 101 GW to 230 GW, with 44% of new projects experiencing grid connection wait times exceeding 4 years; Bank of America's latest target price report for Alphabet directly revises its 2026 capital expenditure upward to $181.5 billion, doubling year-on-year, with free cash flow declining 62%. These three sets of data are not outputs from the same framework, but independent portraits from three separate institutions on different research paths.

The evolution of bottlenecks in the semiconductor industry chain (especially in the AI computing power field) precisely progresses in this clear sequential order: "Computing (GPU) → Memory (HBM, etc.) → Optical Interconnect → Power/Liquid Cooling". This is the industry consensus for 2025-2026. As AI training/inference clusters scale from single cabinets (dozens of GPUs) to super-large scale (thousands to hundreds of thousands of GPUs), each time a bottleneck in one segment is resolved, the next physical/supply chain constraint is immediately exposed, forming "Leontief-style" complementary constraints (if one is missing, nothing can be shipped).

It is necessary to understand why this evolution occurs, the current status, and the underlying physical/engineering reasons:

1. First Phase Bottleneck: GPU Computing (Dominant from 2022-2024) Core Constraint:

High-end GPU (e.g., NVIDIA Hopper H100 → Blackwell B200 → Rubin) wafer capacity itself + advanced packaging.

Why it was the bottleneck: AI large models require massive parallel computing. TSMC's 4nm/3nm/2nm logic processes + CoWoS (2.5D/3D packaging) capacity once became the biggest choke point. Even if front-end wafers were sufficient, the back-end capability to package logic chips + HBM stacks couldn't keep up, preventing the entire GPU from being produced.

Easing situation: TSMC aggressively expanded CoWoS (capacity doubling 2024-2025), NVIDIA Blackwell is shipping in large volumes. But this only unlocked the "computing" segment, immediately exposing new problems.

2. Second Phase Bottleneck: Memory (HBM High Bandwidth Memory, becoming the tightest from 2024-2025)

Core Constraint: HBM3/HBM3e/HBM4 capacity.

Why it became the next bottleneck: GPU computing power increased, but model parameters exploded (trillions to tens of trillions of parameters), making data movement (memory bandwidth) the "memory wall." HBM can transmit several TB of data per second, over 20 times faster than conventional DDR memory. Because HBM is adjacent to the logic chip, data doesn't need to travel far, thus saving energy.

A single B200 GPU requires 192GB+ of HBM3e. A single cabinet (NVL72) HBM total capacity has reached 30-40TB, and bandwidth demands far exceed traditional DRAM.

Supply chain status: Only SK Hynix, Samsung, and Micron can mass-produce HBM, with complex processes (TSV + stacking). 2025 supply is already sold out, 2026 remains in short supply, with prices soaring 246% year-on-year. Even if GPU chips are ready, without HBM, assembly and delivery are impossible, causing delays in entire AI cluster deployments.

Result: Memory transformed from a "commodity" into a strategic choke point, potentially accounting for 30% of capital expenditures.

3. Third Phase Bottleneck: Optical Interconnect (Transition underway in 2025-2026)

Core Constraint: Physical limits of copper cables (NVLink/NVSwitch) in bandwidth, distance, power consumption, and weight.

Why a shift to optics is inevitable: Copper can still work within a single cabinet (72 GPUs), but when scaling to multi-cabinet or even thousands of GPU interconnects, copper cable attenuation is severe (effective distance <1 meter at 1.8TB/s bandwidth), weight explodes (NVL72 cabinet copper cables exceed 5,000, total weight 1.36 tons), and power consumption is high (replaceable optical modules replacing copper add an extra 20,000W). Signal integrity, latency, and cooling cannot support larger clusters.

Solution: Shift to optical interconnect (CPO Co-Packaged Optics + Silicon Photonics). Embedding optical engines directly next to the GPU/ASIC, using fiber optics for scale-out, achieving higher bandwidth density, lower per-bit power consumption, and longer distances.

NVIDIA heavily bet on this at GTC 2026, having invested in optical companies. Demand for 800G/1.6T optical modules is exploding. Companies like Lumentum, Broadcom, Coherent, Ayar Labs become new winners.

Current progress: Copper has reached its limit. Optics are shifting from "optional" to "mandatory," breaking through AI data center performance ceilings.

4. Fourth Phase Bottleneck (The Current Frontier): Power + Liquid Cooling (Becoming the ultimate physical constraint from 2026 onwards) Core Constraint: Power Wall + Cooling Wall + Grid Access.

Why it's the ultimate bottleneck: Each GPU's power consumption rose from 300W→700-1200W. Single cabinet power surged from 10-20kW (CPU era) to 120-200kW+ or even higher. Traditional air cooling has a physical limit of only 20-50kW, with unacceptable noise, airflow, and energy consumption.

Power side: Data centers require GW-level power supply, with grid connection queues potentially lasting years. Delivery cycles for transformers, solid-state transformers, and other equipment are extending to 100 weeks. Microsoft's CEO once bluntly stated, "We have GPUs but no electricity to plug them into."

Liquid cooling side: Must switch to Direct-to-Chip liquid cooling or immersion cooling, combined with microfluidics, cold plates, and other technologies. TSMC has demonstrated silicon-based liquid cooling on the CoWoS platform, supporting >2.6kW TDP. Liquid cooling/thermal management companies like Vertiv (VRT) are becoming new infrastructure core players.

Chain reaction: PUE (Power Usage Effectiveness) requirements are <1.2. Waste heat recovery, nuclear/new energy grid integration have become new topics. Even if all previous segments are solved, without power and cooling, cabinets cannot be racked and operated.

The Essential Logic of AI Computing Power Industry Chain Bottleneck Shifts AI computing power is not a "single-point" issue, but a systemic Leontief production function — GPU, HBM, interconnect, power, cooling must match based on the lowest-capacity component. Hyperscalers (Google, Microsoft, Meta, etc.) each time they solve one, immediately push capital and innovation to the next segment.

Currently (2026), we are in the transition period of "accelerated optical interconnect deployment + large-scale commercialization of power/liquid cooling." New bottlenecks may yet emerge (e.g., lasers, fiber materials, or grid transformers), but this chain of "computing → memory → optics → power/cooling" has become the recognized industry path.

This also explains why the investment logic is shifting from NVIDIA/TSMC to the HBM trio (SK Hynix, etc.), optical manufacturers (Lumentum, Coherent), and liquid cooling/power infrastructure companies (Vertiv, related power supply companies).

Every bottleneck shift is reshaping the value distribution across the entire semiconductor + data center industry chain.

Trending Cryptos

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

BrevisBREV

PancakeSwapCAKE

JUSTJST

Following the Coldcard Hack, One of the Largest Bitcoin Wallet Hacks Recently, a New Wave of Losses Begins! Losses Are Mounting

Following a major hack targeting Coldcard hardware wallets, losses have surged to approximately 1,367 BTC ($88.6 million) across 4,585 addresses. The third wave of attacks stole an additional 207.7 BTC, exhibiting different patterns from the first two. While initial attacks used shared deposit addresses and targeted P2WPKH wallets, the latest wave employed unique recipient addresses per victim and focused on P2WSH addresses. Analysis by Galaxy Research cannot definitively link all three waves to the same attacker, raising the possibility of a second actor exploiting the known vulnerability. The stolen funds, predominantly from wallets holding under 1 BTC, remain unspent. The vulnerable Coldcard firmware was released in March 2021, and all stolen coins originate from after that date.

cryptonews.ru45m ago

Following the Coldcard Hack, One of the Largest Bitcoin Wallet Hacks Recently, a New Wave of Losses Begins! Losses Are Mounting

cryptonews.ru45m ago

Trump Media sells another 2,628 BTC, holdings fall to 4,261 BTC

Trump Media & Technology Group has sold an additional 2,628 Bitcoin (worth approximately $165 million), continuing a series of sales over the past seven months. According to blockchain data from Arkham cited by Lookonchain, these latest transfers to Crypto.com bring the company's total reported Bitcoin sales to 7,281 BTC (worth about $545 million), reducing its holdings by 63%. The company's remaining Bitcoin holdings now stand at 4,261 BTC, valued at $269.8 million. Trump Media initially purchased 11,542 BTC at an average price of $118,522 before beginning the sales. This activity occurs amid broader scrutiny of crypto ventures linked to former President Donald Trump, as lawmakers debate the CLARITY Act, which focuses on ethics rules, digital asset ownership, and potential conflicts of interest for public officials.

cointelegraph1h ago

Trump Media sells another 2,628 BTC, holdings fall to 4,261 BTC

cointelegraph1h ago

In Jinjiang, Fujian, a Storage Super Unicorn Lies Quiet

In Fujian's Jinjiang, a city known for sportswear, lies a quiet semiconductor giant: Fujian Jinhua Integrated Circuit Co. (JHICC). Once a promising domestic DRAM manufacturer alongside Yangtze Memory and ChangXin Memory Technologies (CXMT), its journey was derailed in 2018 when the U.S. placed it on an Entity List and filed criminal charges for alleged trade secret theft. This halted production for years. A turning point came in February 2024 when a U.S. federal court found JHICC not guilty. However, it had lost crucial time. While CXMT soared to become a top-valued A-share company in 2024, JHICC, with an estimated valuation of 80 billion RMB, was just restarting. Its current output is primarily customized DDR4 chips, not the advanced DDR5/HBM demanded for AI, but it still benefits from the broader memory chip upcycle. JHICC's story is tied to Chen Zhengkun, a veteran engineer who left Micron to lead the venture. Founded in 2016 with state-backed funding, JHICC partnered with Taiwan's UMC to develop DRAM technology. Rapid progress was cut short by the U.S. actions, which Micron initiated, partly due to its heavy reliance on the Chinese market. Post-sanctions, Chen's team worked to rebuild the production line with reduced reliance on U.S. technology. According to its records, JHICC achieved small-scale production and revenue growth under immense pressure. It now focuses on the stable "niche" DRAM market (e.g., TVs, routers) with a monthly capacity of ~40,000 wafers, aiming for 60,000 by 2026. It holds over 1,000 patents but remains on the Entity List. For Jinjiang, investing in JHICC was a bold industrial leap. The local government provided unwavering financial and logistical support during the crisis, helping the company survive. JHICC has become the anchor for a growing local semiconductor cluster. Though its scale lags behind domestic peers, JHICC's persistence symbolizes a hard-won foothold in a global market long dominated by Samsung, SK Hynix, and Micron. Having missed one boom, it seeks a place in the new AI-driven memory supercycle.

marsbit1h ago

In Jinjiang, Fujian, a Storage Super Unicorn Lies Quiet

marsbit1h ago

Amid a 38GW Power Shortage, Why Have Bitcoin Mining Facilities Suddenly Become the New Gateway to AI Compute?

Under the impending threat of a 38GW power deficit projected for US data centers between 2026-2028, decommissioned Bitcoin mining sites are emerging as critical infrastructure for AI compute. A growing number of listed North American miners—including TeraWulf, Hut 8, and Cipher Mining—are pivoting to become "Powered Shell Providers." They are repurposing their existing assets: pre-approved grid connections, land, and substations to offer ready-to-use data center shells to cloud providers and AI companies. This shift capitalizes on the severe bottleneck in securing new, large-scale power access, which can take 5-7 years. Analysts at Morgan Stanley estimate retrofitted mining facilities could supply 10-19GW of the needed capacity. The core value proposition is no longer cryptocurrency mining, but selling a scarce "time to power" advantage in the race for AI infrastructure.

华尔街日报1h ago

Amid a 38GW Power Shortage, Why Have Bitcoin Mining Facilities Suddenly Become the New Gateway to AI Compute?

华尔街日报1h ago

Michael Saylor: 'We Never Said We Would Never Sell Bitcoin'

Michael Saylor stated that his company never made a commitment to never sell its bitcoin holdings, though it expects to remain a net buyer of bitcoin long-term. His comments came following reports that the company had received new authorization to sell up to $5 billion in bitcoin. Saylor clarified that this authorization is not new and was announced on June 29th as part of the company's capital management strategy. He emphasized that the authorization permits but does not obligate sales for specific purposes and that no new approval has been announced. Saylor also noted the company never officially adopted a "bitcoin will never be sold" policy.

cryptonews.ru3h ago

Michael Saylor: 'We Never Said We Would Never Sell Bitcoin'

cryptonews.ru3h ago

Trading

Spot

Hot Articles

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

Talus is a decentralized AI Agent framework built on the Sui, designed to solve the structural problems of current AI systems: centralization, opacity, and a lack of native economic identity.

43.4k Total ViewsPublished 2026.03.18Updated 2026.03.18

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

By 2026, the integration of artificial intelligence and cryptocurrency has advanced from proof-of-concept to a new stage of "system-level integration".

3.0k Total ViewsPublished 2026.03.26Updated 2026.03.26

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

In 2026, the U.S. IPO market has regained momentum.

36.9k Total ViewsPublished 2026.07.08Updated 2026.07.08

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.