An In-Depth Analysis of the Reasons Behind the Recent Surge in the Memory Storage Industry

marsbitPublished on 2026-05-11Last updated on 2026-05-11

Abstract

"Understanding the Recent Boom in the Memory Industry" summarizes the key structural shift in the memory sector driven by AI. Traditionally, the memory industry (DRAM, NAND) has been highly cyclical, with prices driven by inventory cycles and consumer electronics demand, which can be deferred. However, AI has fundamentally changed the demand structure. AI workloads create continuous, non-deferrable demand for high-bandwidth memory (HBM) due to GPU memory bottlenecks. This high-margin HBM production is prioritized by major manufacturers like Samsung, SK hynix, and Micron, structurally reducing the supply of traditional DRAM and NAND. Consequently, the market is transitioning from a spot-based model to a contract-based allocation system. HBM supply is reportedly booked through 2026, leading to spot shortages and sharp price increases for other memory types. Delivery lead times have also extended dramatically. Pricing power is shifting from the open market to those with long-term contracts. Primary beneficiaries are HBM-leading suppliers and large hyperscalers (Microsoft, AWS, Google) that can lock in future supply. Key risks to this "AI supercycle" include a potential slowdown in AI capital expenditure, disruptive new memory architectures, or a return to aggressive industry capacity expansion, which could reintroduce cyclicality.

Author: hoidya | 0xU

1/ What Exactly is the Memory Storage Industry?

The memory storage industry primarily consists of three core products: DRAM, NAND, and HBM. Together, they form the data memory system for all digital devices. Whether it's smartphones, computers, or data centers, they all rely on this layer of infrastructure to handle temporary data processing and long-term storage.

Functionally, DRAM is used for temporary storage of data during operation, meeting the high-speed read/write demands of the computation process. NAND is used for long-term data storage, akin to the device's persistent memory layer. HBM, on the other hand, is a new form evolved for high-performance computing environments, designed to address the bandwidth bottleneck between GPUs and computing units.

From a system architecture perspective, the storage industry is not an independent component separate from computing systems; rather, it is a fundamental dependency layer for all computing systems. Any computing task must first "read data," then "compute," and finally "write back the results." Therefore, storage is one of the foundational constraints in the computing process, not an optional module.

Over the past two decades, the demand in this industry has mainly come from three sources: consumer electronics (phones and PCs), enterprise servers, and internet infrastructure. The common characteristics of these demands are strong decentralization, updatable and delayable cycles, and limited scale per demand point. Consequently, the market has long classified it as a typical cyclical semiconductor industry.

2/ Why Has Storage Long Been Viewed as a Cyclical Industry?

The fundamental reason why the storage industry has long exhibited strong cyclicality lies in the asymmetry of its supply-demand structure. Demand typically correlates with consumer electronics cycles and enterprise IT spending cycles, while supply is driven by wafer fab investments, which have a significant time lag.

When demand rises, prices increase rapidly, prompting manufacturers to expand production. However, due to the typical 12- to 24-month lead time for new capacity construction, new supply often floods the market after the demand peak has passed, leading to a rapid price decline. This mechanism creates a typical boom-bust cycle.

This cyclical structure was particularly evident from 2010 to 2022. For instance, the DRAM industry experienced cycles of rapid decline from high-profit margins to losses, followed by rebounds when new demand recovered. This volatility has led the market to long regard the storage industry as a "high-volatility, low-predictability" cyclical asset class.

During this phase, the industry's pricing mechanism was essentially inventory-driven. Prices rose when inventory fell and dropped when inventory accumulated, with demand itself acting more as a triggering variable rather than a structural one.

3/ What Was the Demand Structure Like Before AI?

Before the advent of Artificial Intelligence, storage demand was primarily driven by consumer electronics and traditional internet infrastructure. Consumer electronics are characterized by long replacement cycles and relatively predictable demand, such as the typical 2-3 year smartphone upgrade cycle. Server and enterprise storage demand relied more on the rhythm of IT capital expenditures, also exhibiting strong cyclicality.

In this structure, storage as a standardized product was priced mainly by supply-demand dynamics, not by long-term, locked-in demand from any single large customer. Thus, the market had a highly spot nature, with price signals quickly reflecting inventory changes and capacity adjustments.

In other words, before AI, the demand structure of the storage industry was fragmented and lacked long-term rigid constraints. This was also the core foundation for its cyclical characteristics.

4/ Why Has AI Completely Transformed the Storage Demand Structure? (From Cyclical Commodity to Infrastructure)

Historically, storage demand was driven by consumer electronics (phones, PCs), which is essentially "deferrable consumption." But AI brings a completely different demand function: it is a persistent computing system, and memory usage grows linearly or even super-linearly with model size.

Taking AI data centers as an example, during training and inference, the GPU is not the computational bottleneck; the memory bandwidth is. This directly pushes HBM into becoming a rigid demand. Industry data shows that the demand for high-bandwidth memory from AI servers is growing at a rate far exceeding that of traditional DRAM, leading to long-term lock-ups of HBM capacity, with some reports even indicating pre-sales through 2026.

More critically, the supply side is changing: because HBM offers significantly higher profit margins than traditional DRAM, manufacturers are actively reallocating capacity, shifting wafers from DDR4/DDR5 to HBM production. This structural crowding-out effect is causing "non-demand-driven shortages" in traditional DRAM and NAND.

Extreme market signals are already appearing: spot prices for some DRAM and NAND products have risen 15–20% within a quarter, and "intra-day price adjustments" have emerged.

5/ How Was Storage Priced in the Past?

Between 2010 and 2022, the pricing mechanism in the storage industry was highly typical, a standard semiconductor cycle model:

Prices were driven by inventory cycles, not by demand structure.

When inventory decreased → prices rose → manufacturers expanded production → oversupply emerged → prices collapsed.

The core constraints of this mechanism were the "lag in capacity construction (1–2 years) + deferrable nature of demand."

For example, in the previous cycle, the DRAM industry frequently experienced substantial profit volatility on a quarterly basis, even swinging from high margins to losses and back rapidly.

However, this mechanism has been disrupted in the AI era because two variables have changed simultaneously:

  • First, demand has shifted from fragmented consumption to centralized procurement.
  • Second, supply has shifted from "free-market capacity expansion" to "profit-prioritized allocation (HBM first)."

The result is: cyclical fluctuations still exist, but price elasticity has been structurally compressed.

6/ What Structural Changes Are Happening Now?

The core change in the current (2024–2026) memory market is not just price increases, but a market structure shift from a "spot market" to a "contract allocation system."

First is the crowding-out effect of HBM. Because HBM yields significantly higher profit per wafer than DDR4/DDR5, Samsung, SK hynix, and Micron are all prioritizing capacity allocation towards HBM production. Industry data shows HBM is rapidly rising from a low single-digit share to a structural level of 40%+ of DRAM revenue.

This structural adjustment leads to two outcomes:

  • First, contraction in traditional DRAM supply.
  • Second, NAND enters a state of passive tightness.

Simultaneously, the market is entering an extreme state of supply-demand imbalance: DRAM industry revenue grew 17.1% year-over-year in Q2 2025, but the source of growth was not a demand explosion; it was jointly driven by price increases and supply constraints.

More extreme signals come from the delivery side: industry lead times have extended from the normal 8–12 weeks to 39–52 weeks, with some automotive-grade memory even exceeding 70 weeks.

This signifies a key structural change: memory is no longer an "immediately tradable commodity" but has become a "rationed resource."

This creates a positive feedback loop:

Price increases → manufacturers reduce spot supply → buyers lock in orders early → further reduces spot liquidity → prices continue to rise.

7/ Who Benefits in This Structure?

The profit structure within the storage industry is undergoing a clear migration.

Tier One: Supply Side (Samsung / SK hynix / Micron)

These companies are transitioning from "cyclical manufacturers" to "AI infrastructure suppliers." Among them, SK hynix's leading position in HBM is gradually making it a holder of structural pricing power, with its DRAM market share reportedly rising to around 38%.

Tier Two: Demand Side (Microsoft / AWS / Google)

These companies are locking in future supply through long-term contracts, essentially engaging in "time arbitrage": using current capital expenditure to lock in future AI computing power and memory costs.

Tier Three: AI Model Companies (OpenAI, etc.)

They are caught between cash flow pressure and compute demand, forming a closed loop through financing → capex → locking in supply.

The key change is that: pricing power is shifting from the "market" to "contract structures."

8/ Risks and Falsification Conditions

This round of the "AI memory supercycle" has at least three clear falsification conditions:

First, if AI capex enters a contraction cycle (hyperscalers reduce investment intensity), the current demand structure would quickly distort, as memory demand is highly dependent on AI compute expansion.

Second, if the HBM technology path is superseded (e.g., by new memory architectures or compute-memory fusion), the current HBM price premium would be compressed, causing capacity to flow back to DRAM/NAND.

Third, if the capacity expansion cycle re-accelerates (Samsung / SK hynix re-enter aggressive expansion), the current supply constraints could reverse into an oversupply cycle within 1–2 years.

In other words, the premise for this structure's validity is:

AI demand growth rate > capacity expansion rate + technology substitution rate

Related Questions

QWhat are the three core products that constitute the storage industry according to the article?

AAccording to the article, the three core products are DRAM, NAND, and HBM.

QWhy has the storage industry long been viewed as a cyclical sector?

AIt has been viewed as cyclical due to asymmetric supply and demand structures. Demand correlates with consumer electronics and IT spending cycles, while supply, driven by fab investments, has a significant lag time (12-24 months). This leads to boom-bust cycles where new supply arrives after demand peaks, causing prices to fall.

QHow has the rise of AI fundamentally changed the demand structure for memory?

AAI has shifted demand from 'deferrable consumption' (like phones/PCs) to a continuous compute system where memory usage grows linearly or super-linearly with model scale. This creates rigid, long-term demand (e.g., for HBM) and has led to a shift from a spot market to a contract allocation system.

QWhat is the 'HBM displacement effect' mentioned in the article and what are its consequences?

AThe HBM displacement effect refers to memory manufacturers prioritizing production of HBM (which offers significantly higher profit margins) over traditional DRAM like DDR4/DDR5. This causes a structural supply squeeze for traditional DRAM and NAND, leading to non-demand-driven shortages and price increases.

QWhat are the three key conditions that could disprove or end the current 'AI memory supercycle'?

AThe three key conditions are: 1) A contraction in AI capex spending by hyperscalers, 2) A technological shift that replaces HBM (e.g., new memory architectures), and 3) An accelerated cycle of aggressive capacity expansion by major manufacturers like Samsung/SK Hynix, leading to oversupply within 1-2 years.

Related Reads

Fei-Fei Li's Team Clarifies the Concept of 'World Models', Sora Merely a Renderer

"World Models" has become a widely used yet confusing term in AI. To address this, a team led by Fei-Fei Li and World Labs proposed a functional taxonomy based on the Partially Observable Markov Decision Process framework. This taxonomy categorizes systems called "world models" into three distinct projections: Renderers, Simulators, and Planners. Renderers, like OpenAI's Sora and other video generation models, focus on producing photorealistic visual outputs for human perception. They prioritize visual fidelity over physical accuracy. Simulators, such as NVIDIA Omniverse, aim to compute precise future environmental states for computational tasks like engineering analysis or digital twins. Planners, like Vision-Language-Action models, take in observations and goals to output executable actions for robots or agents. The article clarifies that most current "world models," including Sora, are primarily Renderers. They generate convincing visuals but lack the core ability to simulate state transitions based on actions, a key requirement for a true world model in classic reinforcement learning definitions. This conceptual confusion has practical implications, leading to potential misalignment in technology selection, investment, and public understanding of AI capabilities. Clear categorization is crucial. It helps enterprises avoid costly mistakes (e.g., using a renderer for robot training), allows investors to accurately assess markets, and enables researchers to build comparable benchmarks. While future systems may integrate these functions, recognizing current boundaries is essential for honest assessment and progress.

marsbit7m ago

Fei-Fei Li's Team Clarifies the Concept of 'World Models', Sora Merely a Renderer

marsbit7m ago

Bloomberg Uncovered: How Do China's Wealthy Circumvent the Annual $50,000 Limit to Transfer Assets?

**Summary: How Wealthy Chinese Circumvent $50,000 Annual Foreign Exchange Limits** Despite China's strict capital controls, including an annual $50,000 per person foreign exchange quota, an estimated $150 billion in funds still leaves the country annually via various gray and underground channels. This report outlines the evolution of China's "capital wall" and the methods used to bypass it. **The Evolving Capital Controls:** * **Foundation (1994):** The system of "current account convertibility with strict capital account controls" was established. * **Quota Set (2007):** The $50,000 individual annual forex purchase limit was formalized. * **Crackdown Begins (2015-2017):** Following market volatility, enforcement tightened. Banks were required to scrutinize transactions, and channels like using UnionPay cards for Hong Kong insurance premiums or buying overseas property were blocked. * **Digital & Legal Upgrades (2024-2026):** Enhanced algorithms now flag suspicious patterns (e.g., "smurfing"). The Common Reporting Standard (CRS) provides Chinese tax authorities with data on citizens' offshore accounts. Unlicensed cross-border brokers have been targeted. **Five Primary Methods for Moving Capital:** 1. **Underground Banking / "Hawala" (Duiqiao):** The largest-scale method. No money crosses borders. Clients pay RMB to a domestic account; an overseas associate deposits equivalent foreign currency into the client's offshore account. Risks include high fees, account freezes, and legal penalties. 2. **"Smurfing" or "Ant Moving":** Using multiple individuals' $50,000 quotas to pool funds for one offshore recipient. Increasingly detected by anti-money laundering algorithms. 3. **Trade Invoice Manipulation:** Businesses over-invoice imports or under-invoice exports via offshore shell companies, creating a pretext to transfer excess funds abroad under the guise of trade. 4. **Channel Migration:** After a crackdown on internet brokers, funds flow toward more compliant but costly channels like major banks' cross-border wealth management services or Qualified Domestic Institutional Investor (QDII) quotas. 5. **Structural Arrangements:** High-net-worth individuals use complex, high-cost legal structures involving offshore trusts, insurance, and investment migration programs to transfer asset ownership. **Regulatory Response: Focusing on People, Not Just Money** The current strategy extends oversight from enterprises to **individual residents**. Tools like CRS allow retroactive visibility into offshore assets. Cryptocurrencies, once seen as a potential loophole, are now actively monitored and prosecuted as an illegal channel. The underlying driver remains: with significant wealth concentrated among millions of affluent households seeking diversification amid domestic economic shifts, the incentive to move assets offshore persists despite regulatory barriers.

marsbit27m ago

Bloomberg Uncovered: How Do China's Wealthy Circumvent the Annual $50,000 Limit to Transfer Assets?

marsbit27m ago

Ethereum's Ballmer Moment: As Everyone Is Bearish, the Circulating Supply Is Disappearing

"Ethereum's Ballmer Moment: Circulation Shrinks Amid Bearish Sentiment" Amid widespread bearish sentiment, with prominent figures like Bankless founder David Hoffman selling ETH and young developers flocking to Solana, some argue Ethereum is entering its "Ballmer era"—akin to Microsoft's perceived stagnation under Steve Ballmer. While surface-level criticisms about slow protocol development, cautious leadership, and competitive pressure are valid, underlying fundamentals tell a different story. Approximately 30% of ETH is staked, major holders like BitMine are accumulating, and spot ETFs continue to absorb supply. Regulatory clarity, including the SEC/CFTC's March ruling on staking rewards and the potential passage of the CLARITY Act, is transforming crypto from a regulatory threat into a legitimized framework. This institutionalization, alongside a shrinking circulating supply (with net issuance around 0.23% annually), creates significant buy-side pressure independent of fee-based value capture. The broader crypto total addressable market is expanding through regulated stablecoins, tokenized assets, and institutional adoption. While public chains face competition from permissioned alternatives, the winning model appears to be permissioned assets settling on public chains like Ethereum and Solana. The author advocates a non-maximalist, barbell strategy: holding ETH for its institutional role and supply squeeze, SOL for consumer/throughput trends, BTC as a macro hedge, and a basket of next-gen L1s. Key bullish drivers for ETH include rapid circulation shrinkage, potential Q2 staked ETF approvals, regulatory tailwinds solidifying its role as a default settlement layer, and the optionality of an eventual "Satya moment" leadership shift. Despite bearish consensus, the current setup—where crypto is "not hot" and regulatory groundwork is being laid—presents a compelling investment opportunity. The crypto cycle's focus may have shifted to AI, but blockchain infrastructure is gaining a legal and institutional foothold precisely while attention is elsewhere.

marsbit27m ago

Ethereum's Ballmer Moment: As Everyone Is Bearish, the Circulating Supply Is Disappearing

marsbit27m ago

Trading

Spot
Futures
活动图片