An In-Depth Analysis of the Reasons Behind the Recent Surge in the Memory Storage Industry

marsbitPubblicato 2026-05-11Pubblicato ultima volta 2026-05-11

Introduzione

"Understanding the Recent Boom in the Memory Industry" summarizes the key structural shift in the memory sector driven by AI. Traditionally, the memory industry (DRAM, NAND) has been highly cyclical, with prices driven by inventory cycles and consumer electronics demand, which can be deferred. However, AI has fundamentally changed the demand structure. AI workloads create continuous, non-deferrable demand for high-bandwidth memory (HBM) due to GPU memory bottlenecks. This high-margin HBM production is prioritized by major manufacturers like Samsung, SK hynix, and Micron, structurally reducing the supply of traditional DRAM and NAND. Consequently, the market is transitioning from a spot-based model to a contract-based allocation system. HBM supply is reportedly booked through 2026, leading to spot shortages and sharp price increases for other memory types. Delivery lead times have also extended dramatically. Pricing power is shifting from the open market to those with long-term contracts. Primary beneficiaries are HBM-leading suppliers and large hyperscalers (Microsoft, AWS, Google) that can lock in future supply. Key risks to this "AI supercycle" include a potential slowdown in AI capital expenditure, disruptive new memory architectures, or a return to aggressive industry capacity expansion, which could reintroduce cyclicality.

Author: hoidya | 0xU

1/ What Exactly is the Memory Storage Industry?

The memory storage industry primarily consists of three core products: DRAM, NAND, and HBM. Together, they form the data memory system for all digital devices. Whether it's smartphones, computers, or data centers, they all rely on this layer of infrastructure to handle temporary data processing and long-term storage.

Functionally, DRAM is used for temporary storage of data during operation, meeting the high-speed read/write demands of the computation process. NAND is used for long-term data storage, akin to the device's persistent memory layer. HBM, on the other hand, is a new form evolved for high-performance computing environments, designed to address the bandwidth bottleneck between GPUs and computing units.

From a system architecture perspective, the storage industry is not an independent component separate from computing systems; rather, it is a fundamental dependency layer for all computing systems. Any computing task must first "read data," then "compute," and finally "write back the results." Therefore, storage is one of the foundational constraints in the computing process, not an optional module.

Over the past two decades, the demand in this industry has mainly come from three sources: consumer electronics (phones and PCs), enterprise servers, and internet infrastructure. The common characteristics of these demands are strong decentralization, updatable and delayable cycles, and limited scale per demand point. Consequently, the market has long classified it as a typical cyclical semiconductor industry.

2/ Why Has Storage Long Been Viewed as a Cyclical Industry?

The fundamental reason why the storage industry has long exhibited strong cyclicality lies in the asymmetry of its supply-demand structure. Demand typically correlates with consumer electronics cycles and enterprise IT spending cycles, while supply is driven by wafer fab investments, which have a significant time lag.

When demand rises, prices increase rapidly, prompting manufacturers to expand production. However, due to the typical 12- to 24-month lead time for new capacity construction, new supply often floods the market after the demand peak has passed, leading to a rapid price decline. This mechanism creates a typical boom-bust cycle.

This cyclical structure was particularly evident from 2010 to 2022. For instance, the DRAM industry experienced cycles of rapid decline from high-profit margins to losses, followed by rebounds when new demand recovered. This volatility has led the market to long regard the storage industry as a "high-volatility, low-predictability" cyclical asset class.

During this phase, the industry's pricing mechanism was essentially inventory-driven. Prices rose when inventory fell and dropped when inventory accumulated, with demand itself acting more as a triggering variable rather than a structural one.

3/ What Was the Demand Structure Like Before AI?

Before the advent of Artificial Intelligence, storage demand was primarily driven by consumer electronics and traditional internet infrastructure. Consumer electronics are characterized by long replacement cycles and relatively predictable demand, such as the typical 2-3 year smartphone upgrade cycle. Server and enterprise storage demand relied more on the rhythm of IT capital expenditures, also exhibiting strong cyclicality.

In this structure, storage as a standardized product was priced mainly by supply-demand dynamics, not by long-term, locked-in demand from any single large customer. Thus, the market had a highly spot nature, with price signals quickly reflecting inventory changes and capacity adjustments.

In other words, before AI, the demand structure of the storage industry was fragmented and lacked long-term rigid constraints. This was also the core foundation for its cyclical characteristics.

4/ Why Has AI Completely Transformed the Storage Demand Structure? (From Cyclical Commodity to Infrastructure)

Historically, storage demand was driven by consumer electronics (phones, PCs), which is essentially "deferrable consumption." But AI brings a completely different demand function: it is a persistent computing system, and memory usage grows linearly or even super-linearly with model size.

Taking AI data centers as an example, during training and inference, the GPU is not the computational bottleneck; the memory bandwidth is. This directly pushes HBM into becoming a rigid demand. Industry data shows that the demand for high-bandwidth memory from AI servers is growing at a rate far exceeding that of traditional DRAM, leading to long-term lock-ups of HBM capacity, with some reports even indicating pre-sales through 2026.

More critically, the supply side is changing: because HBM offers significantly higher profit margins than traditional DRAM, manufacturers are actively reallocating capacity, shifting wafers from DDR4/DDR5 to HBM production. This structural crowding-out effect is causing "non-demand-driven shortages" in traditional DRAM and NAND.

Extreme market signals are already appearing: spot prices for some DRAM and NAND products have risen 15–20% within a quarter, and "intra-day price adjustments" have emerged.

5/ How Was Storage Priced in the Past?

Between 2010 and 2022, the pricing mechanism in the storage industry was highly typical, a standard semiconductor cycle model:

Prices were driven by inventory cycles, not by demand structure.

When inventory decreased → prices rose → manufacturers expanded production → oversupply emerged → prices collapsed.

The core constraints of this mechanism were the "lag in capacity construction (1–2 years) + deferrable nature of demand."

For example, in the previous cycle, the DRAM industry frequently experienced substantial profit volatility on a quarterly basis, even swinging from high margins to losses and back rapidly.

However, this mechanism has been disrupted in the AI era because two variables have changed simultaneously:

  • First, demand has shifted from fragmented consumption to centralized procurement.
  • Second, supply has shifted from "free-market capacity expansion" to "profit-prioritized allocation (HBM first)."

The result is: cyclical fluctuations still exist, but price elasticity has been structurally compressed.

6/ What Structural Changes Are Happening Now?

The core change in the current (2024–2026) memory market is not just price increases, but a market structure shift from a "spot market" to a "contract allocation system."

First is the crowding-out effect of HBM. Because HBM yields significantly higher profit per wafer than DDR4/DDR5, Samsung, SK hynix, and Micron are all prioritizing capacity allocation towards HBM production. Industry data shows HBM is rapidly rising from a low single-digit share to a structural level of 40%+ of DRAM revenue.

This structural adjustment leads to two outcomes:

  • First, contraction in traditional DRAM supply.
  • Second, NAND enters a state of passive tightness.

Simultaneously, the market is entering an extreme state of supply-demand imbalance: DRAM industry revenue grew 17.1% year-over-year in Q2 2025, but the source of growth was not a demand explosion; it was jointly driven by price increases and supply constraints.

More extreme signals come from the delivery side: industry lead times have extended from the normal 8–12 weeks to 39–52 weeks, with some automotive-grade memory even exceeding 70 weeks.

This signifies a key structural change: memory is no longer an "immediately tradable commodity" but has become a "rationed resource."

This creates a positive feedback loop:

Price increases → manufacturers reduce spot supply → buyers lock in orders early → further reduces spot liquidity → prices continue to rise.

7/ Who Benefits in This Structure?

The profit structure within the storage industry is undergoing a clear migration.

Tier One: Supply Side (Samsung / SK hynix / Micron)

These companies are transitioning from "cyclical manufacturers" to "AI infrastructure suppliers." Among them, SK hynix's leading position in HBM is gradually making it a holder of structural pricing power, with its DRAM market share reportedly rising to around 38%.

Tier Two: Demand Side (Microsoft / AWS / Google)

These companies are locking in future supply through long-term contracts, essentially engaging in "time arbitrage": using current capital expenditure to lock in future AI computing power and memory costs.

Tier Three: AI Model Companies (OpenAI, etc.)

They are caught between cash flow pressure and compute demand, forming a closed loop through financing → capex → locking in supply.

The key change is that: pricing power is shifting from the "market" to "contract structures."

8/ Risks and Falsification Conditions

This round of the "AI memory supercycle" has at least three clear falsification conditions:

First, if AI capex enters a contraction cycle (hyperscalers reduce investment intensity), the current demand structure would quickly distort, as memory demand is highly dependent on AI compute expansion.

Second, if the HBM technology path is superseded (e.g., by new memory architectures or compute-memory fusion), the current HBM price premium would be compressed, causing capacity to flow back to DRAM/NAND.

Third, if the capacity expansion cycle re-accelerates (Samsung / SK hynix re-enter aggressive expansion), the current supply constraints could reverse into an oversupply cycle within 1–2 years.

In other words, the premise for this structure's validity is:

AI demand growth rate > capacity expansion rate + technology substitution rate

Domande pertinenti

QWhat are the three core products that constitute the storage industry according to the article?

AAccording to the article, the three core products are DRAM, NAND, and HBM.

QWhy has the storage industry long been viewed as a cyclical sector?

AIt has been viewed as cyclical due to asymmetric supply and demand structures. Demand correlates with consumer electronics and IT spending cycles, while supply, driven by fab investments, has a significant lag time (12-24 months). This leads to boom-bust cycles where new supply arrives after demand peaks, causing prices to fall.

QHow has the rise of AI fundamentally changed the demand structure for memory?

AAI has shifted demand from 'deferrable consumption' (like phones/PCs) to a continuous compute system where memory usage grows linearly or super-linearly with model scale. This creates rigid, long-term demand (e.g., for HBM) and has led to a shift from a spot market to a contract allocation system.

QWhat is the 'HBM displacement effect' mentioned in the article and what are its consequences?

AThe HBM displacement effect refers to memory manufacturers prioritizing production of HBM (which offers significantly higher profit margins) over traditional DRAM like DDR4/DDR5. This causes a structural supply squeeze for traditional DRAM and NAND, leading to non-demand-driven shortages and price increases.

QWhat are the three key conditions that could disprove or end the current 'AI memory supercycle'?

AThe three key conditions are: 1) A contraction in AI capex spending by hyperscalers, 2) A technological shift that replaces HBM (e.g., new memory architectures), and 3) An accelerated cycle of aggressive capacity expansion by major manufacturers like Samsung/SK Hynix, leading to oversupply within 1-2 years.

Letture associate

GitHub, Transfixed by AI

On the night of February 9th, GitHub suffered a major outage caused by a simple configuration change—reducing a cache refresh interval from 12 to 2 hours—that triggered a cascade of failures. This was not an isolated event, but part of a broader pattern. In early 2026, GitHub experienced at least 8 major incidents, failing to meet its promised 99.9% availability. These outages stemmed from structural issues: explosive growth in load, tight service coupling, and insufficient protection against abnormal traffic. This unprecedented load is driven by AI Agents. In 2025, GitHub handled ~1 billion commits. By 2026, weekly commits reached 275 million, projecting to ~14 billion for the year—a 14x increase. AI tools like Claude Code now contribute 4.5% of all public repository commits, with weekly submissions surging 25x in just three months. AI-generated pull requests jumped from 4 million to 17 million per month in half a year. Unlike human developers, AI Agents work continuously, generating commits at a scale that overwhelms infrastructure designed for human rhythms. The surge also shattered GitHub's business model. Copilot's flat-rate pricing, based on assisting human developers, became unsustainable as Agentic AI sessions consumed resources worth hundreds of dollars for a few dollars in fees. In response, GitHub imposed usage limits and, by June 1st, shifted to a pay-per-use "AI Credits" system. Facing this new reality, GitHub realized a 10x scaling plan was insufficient. It announced a need to *redesign* its architecture for 30x current scale—decoupling services, adding fault isolation, and improving change management to prevent cascading failures. Other platforms like Stripe and AWS are facing similar challenges with AI Agents. Fundamentally, GitHub is transitioning from a human collaboration platform to an "exhaust pipe" for automated AI workflows. Its detailed post-mortem reports aim to maintain trust during this turbulent rebuild. The February outage was not just a technical glitch, but a signal of the software industry's entry into a new, AI-driven era.

marsbit15 min fa

GitHub, Transfixed by AI

marsbit15 min fa

Both Suffer Massive Losses Exceeding $90 Billion, Which Is in Greater Peril: Strategy or Bitmine?

Facing massive paper losses exceeding $90 billion each amidst a sharp market downturn, "Digital Asset Treasury" (DAT) giants Strategy and Bitmine find themselves in a precarious position, but with different underlying risks. Strategy, heavily invested in Bitcoin (BTC), faces significant financial strain. Its strategy relies heavily on debt, including convertible notes and preferred stock (STRC) requiring substantial dividend payments. With its cash reserves dwindling and BTC offering no staking yield for cash flow, Strategy's high leverage makes it vulnerable. A continued price decline could force asset sales to meet obligations, potentially creating a negative feedback loop. Its market value has already fallen sharply. In contrast, Bitmine, an Ethereum (ETH) holder, appears on firmer financial ground. It primarily funds its purchases through equity offerings (like ATM programs), avoiding debt pressure. It also generates income by staking a large portion of its ETH holdings. While not immune to market drops and shareholder dilution concerns, Bitmine maintains more flexibility, recently announcing a new preferred share offering to raise further capital. The core divergence lies in their financing: Bitmine uses equity (investor money), while Strategy uses debt (borrowed money). Consequently, Bitmine currently faces less immediate liquidity pressure than Strategy, which must navigate the dual challenge of servicing debt/dividends and a declining core asset (BTC) price.

marsbit22 min fa

Both Suffer Massive Losses Exceeding $90 Billion, Which Is in Greater Peril: Strategy or Bitmine?

marsbit22 min fa

Where the AI Bubble Really Is: Which Layer of Players Are Naked

AI Bubble: Where It Really Is and Who's Swimming Naked This analysis dissects the AI industry not as a single entity but as a five-layer pyramid, arguing that bubbles are concentrated in specific tiers, not uniformly distributed. **Key Distinction from the 2000 Dot-com Bubble:** Unlike 2000, where companies had stock prices before revenue, today's leading AI players have massive, contract-backed revenue driving their valuations. Core infrastructure demand is real, with every GPU running at full capacity for paying customers. **The Five-Layer Pyramid & Bubble Assessment:** * **L0 (Fab/Manufacturing) & Top L4 (Leading AI Apps): NO BUBBLE.** Companies like TSMC, NVIDIA, major cloud providers (Microsoft, Google, Meta, Amazon), and top AI labs have real revenues and orders. Supply is tightly constrained by TSMC's disciplined capacity control and physical limits like power/land for data centers, preventing a supply glut. * **L1 (Memory): BATTLEGROUND.** Sky-high HBM margins could signal a new structural cycle or a classic "boom before bust." The oligopoly of three major players may enforce supply discipline, making this a high-stakes bet. * **L2 (Interconnect/Optical Modules): BUBBLE TERRITORY.** Companies like Lumentum and AAOI have seen stock surges (4-10x) far outpacing revenue growth. This hardware segment has lower physical barriers to expansion than fabs, allowing speculation. It mirrors the 2000 bubble's epicenter—optics. * **L3 (Infrastructure/"GPU Landlords"): VULNERABLE.** GPU leasing companies profit from the current compute shortage but own no long-term moat. Their business model relies on a temporary bottleneck that will ease as big tech expands and new tech (e.g., potential space-based data centers) emerges. * **L4 Long Tail (VC-backed Startups): STRONG BUBBLE SIGNALS.** VC funding concentration in AI is twice that of the 1999 peak. Many startups with little revenue use the valuation logic of successful giants to justify their own, creating high risk of a "valuation crunch" when funding dries up. **Critical Risks to Monitor:** 1. **GPU Depreciation & Accounting:** Companies extending the assumed useful life of GPUs artificially boost profits. The true economic life depends on future generational leaps from NVIDIA. 2. **"GPU Credit" & Off-Balance-Sheet Leverage:** Emerging structures where shell companies borrow to buy GPUs and lease them out (with chipmakers sometimes investing) move debt off major balance sheets. This echoes the "vendor financing" of 2000 and the securitization risks of 2008, though currently small-scale. 3. **TSMC Abandoning Caution:** If the primary supply bottleneck (TSMC's conservative capacity planning) breaks, runaway supply could trigger a bust. 4. **Algorithmic Efficiency Breakthrough:** A major leap in software efficiency could drastically reduce the need for raw compute hardware, undermining the investment thesis. **Conclusion:** The AI boom is expensive and has frothy areas, but its core is underpinned by real demand and physical supply constraints. The bubble risk is layered: most present in optical components, GPU leasing, and the long-tail startup ecosystem, while the foundational chip manufacturing and leading application layers remain relatively solid—for now.

marsbit35 min fa

Where the AI Bubble Really Is: Which Layer of Players Are Naked

marsbit35 min fa

Trading

Spot
Futures
活动图片