An In-Depth Analysis of the Reasons Behind the Recent Surge in the Memory Storage Industry

marsbitОпубліковано о 2026-05-11Востаннє оновлено о 2026-05-11

Анотація

"Understanding the Recent Boom in the Memory Industry" summarizes the key structural shift in the memory sector driven by AI. Traditionally, the memory industry (DRAM, NAND) has been highly cyclical, with prices driven by inventory cycles and consumer electronics demand, which can be deferred. However, AI has fundamentally changed the demand structure. AI workloads create continuous, non-deferrable demand for high-bandwidth memory (HBM) due to GPU memory bottlenecks. This high-margin HBM production is prioritized by major manufacturers like Samsung, SK hynix, and Micron, structurally reducing the supply of traditional DRAM and NAND. Consequently, the market is transitioning from a spot-based model to a contract-based allocation system. HBM supply is reportedly booked through 2026, leading to spot shortages and sharp price increases for other memory types. Delivery lead times have also extended dramatically. Pricing power is shifting from the open market to those with long-term contracts. Primary beneficiaries are HBM-leading suppliers and large hyperscalers (Microsoft, AWS, Google) that can lock in future supply. Key risks to this "AI supercycle" include a potential slowdown in AI capital expenditure, disruptive new memory architectures, or a return to aggressive industry capacity expansion, which could reintroduce cyclicality.

Author: hoidya | 0xU

1/ What Exactly is the Memory Storage Industry?

The memory storage industry primarily consists of three core products: DRAM, NAND, and HBM. Together, they form the data memory system for all digital devices. Whether it's smartphones, computers, or data centers, they all rely on this layer of infrastructure to handle temporary data processing and long-term storage.

Functionally, DRAM is used for temporary storage of data during operation, meeting the high-speed read/write demands of the computation process. NAND is used for long-term data storage, akin to the device's persistent memory layer. HBM, on the other hand, is a new form evolved for high-performance computing environments, designed to address the bandwidth bottleneck between GPUs and computing units.

From a system architecture perspective, the storage industry is not an independent component separate from computing systems; rather, it is a fundamental dependency layer for all computing systems. Any computing task must first "read data," then "compute," and finally "write back the results." Therefore, storage is one of the foundational constraints in the computing process, not an optional module.

Over the past two decades, the demand in this industry has mainly come from three sources: consumer electronics (phones and PCs), enterprise servers, and internet infrastructure. The common characteristics of these demands are strong decentralization, updatable and delayable cycles, and limited scale per demand point. Consequently, the market has long classified it as a typical cyclical semiconductor industry.

2/ Why Has Storage Long Been Viewed as a Cyclical Industry?

The fundamental reason why the storage industry has long exhibited strong cyclicality lies in the asymmetry of its supply-demand structure. Demand typically correlates with consumer electronics cycles and enterprise IT spending cycles, while supply is driven by wafer fab investments, which have a significant time lag.

When demand rises, prices increase rapidly, prompting manufacturers to expand production. However, due to the typical 12- to 24-month lead time for new capacity construction, new supply often floods the market after the demand peak has passed, leading to a rapid price decline. This mechanism creates a typical boom-bust cycle.

This cyclical structure was particularly evident from 2010 to 2022. For instance, the DRAM industry experienced cycles of rapid decline from high-profit margins to losses, followed by rebounds when new demand recovered. This volatility has led the market to long regard the storage industry as a "high-volatility, low-predictability" cyclical asset class.

During this phase, the industry's pricing mechanism was essentially inventory-driven. Prices rose when inventory fell and dropped when inventory accumulated, with demand itself acting more as a triggering variable rather than a structural one.

3/ What Was the Demand Structure Like Before AI?

Before the advent of Artificial Intelligence, storage demand was primarily driven by consumer electronics and traditional internet infrastructure. Consumer electronics are characterized by long replacement cycles and relatively predictable demand, such as the typical 2-3 year smartphone upgrade cycle. Server and enterprise storage demand relied more on the rhythm of IT capital expenditures, also exhibiting strong cyclicality.

In this structure, storage as a standardized product was priced mainly by supply-demand dynamics, not by long-term, locked-in demand from any single large customer. Thus, the market had a highly spot nature, with price signals quickly reflecting inventory changes and capacity adjustments.

In other words, before AI, the demand structure of the storage industry was fragmented and lacked long-term rigid constraints. This was also the core foundation for its cyclical characteristics.

4/ Why Has AI Completely Transformed the Storage Demand Structure? (From Cyclical Commodity to Infrastructure)

Historically, storage demand was driven by consumer electronics (phones, PCs), which is essentially "deferrable consumption." But AI brings a completely different demand function: it is a persistent computing system, and memory usage grows linearly or even super-linearly with model size.

Taking AI data centers as an example, during training and inference, the GPU is not the computational bottleneck; the memory bandwidth is. This directly pushes HBM into becoming a rigid demand. Industry data shows that the demand for high-bandwidth memory from AI servers is growing at a rate far exceeding that of traditional DRAM, leading to long-term lock-ups of HBM capacity, with some reports even indicating pre-sales through 2026.

More critically, the supply side is changing: because HBM offers significantly higher profit margins than traditional DRAM, manufacturers are actively reallocating capacity, shifting wafers from DDR4/DDR5 to HBM production. This structural crowding-out effect is causing "non-demand-driven shortages" in traditional DRAM and NAND.

Extreme market signals are already appearing: spot prices for some DRAM and NAND products have risen 15–20% within a quarter, and "intra-day price adjustments" have emerged.

5/ How Was Storage Priced in the Past?

Between 2010 and 2022, the pricing mechanism in the storage industry was highly typical, a standard semiconductor cycle model:

Prices were driven by inventory cycles, not by demand structure.

When inventory decreased → prices rose → manufacturers expanded production → oversupply emerged → prices collapsed.

The core constraints of this mechanism were the "lag in capacity construction (1–2 years) + deferrable nature of demand."

For example, in the previous cycle, the DRAM industry frequently experienced substantial profit volatility on a quarterly basis, even swinging from high margins to losses and back rapidly.

However, this mechanism has been disrupted in the AI era because two variables have changed simultaneously:

  • First, demand has shifted from fragmented consumption to centralized procurement.
  • Second, supply has shifted from "free-market capacity expansion" to "profit-prioritized allocation (HBM first)."

The result is: cyclical fluctuations still exist, but price elasticity has been structurally compressed.

6/ What Structural Changes Are Happening Now?

The core change in the current (2024–2026) memory market is not just price increases, but a market structure shift from a "spot market" to a "contract allocation system."

First is the crowding-out effect of HBM. Because HBM yields significantly higher profit per wafer than DDR4/DDR5, Samsung, SK hynix, and Micron are all prioritizing capacity allocation towards HBM production. Industry data shows HBM is rapidly rising from a low single-digit share to a structural level of 40%+ of DRAM revenue.

This structural adjustment leads to two outcomes:

  • First, contraction in traditional DRAM supply.
  • Second, NAND enters a state of passive tightness.

Simultaneously, the market is entering an extreme state of supply-demand imbalance: DRAM industry revenue grew 17.1% year-over-year in Q2 2025, but the source of growth was not a demand explosion; it was jointly driven by price increases and supply constraints.

More extreme signals come from the delivery side: industry lead times have extended from the normal 8–12 weeks to 39–52 weeks, with some automotive-grade memory even exceeding 70 weeks.

This signifies a key structural change: memory is no longer an "immediately tradable commodity" but has become a "rationed resource."

This creates a positive feedback loop:

Price increases → manufacturers reduce spot supply → buyers lock in orders early → further reduces spot liquidity → prices continue to rise.

7/ Who Benefits in This Structure?

The profit structure within the storage industry is undergoing a clear migration.

Tier One: Supply Side (Samsung / SK hynix / Micron)

These companies are transitioning from "cyclical manufacturers" to "AI infrastructure suppliers." Among them, SK hynix's leading position in HBM is gradually making it a holder of structural pricing power, with its DRAM market share reportedly rising to around 38%.

Tier Two: Demand Side (Microsoft / AWS / Google)

These companies are locking in future supply through long-term contracts, essentially engaging in "time arbitrage": using current capital expenditure to lock in future AI computing power and memory costs.

Tier Three: AI Model Companies (OpenAI, etc.)

They are caught between cash flow pressure and compute demand, forming a closed loop through financing → capex → locking in supply.

The key change is that: pricing power is shifting from the "market" to "contract structures."

8/ Risks and Falsification Conditions

This round of the "AI memory supercycle" has at least three clear falsification conditions:

First, if AI capex enters a contraction cycle (hyperscalers reduce investment intensity), the current demand structure would quickly distort, as memory demand is highly dependent on AI compute expansion.

Second, if the HBM technology path is superseded (e.g., by new memory architectures or compute-memory fusion), the current HBM price premium would be compressed, causing capacity to flow back to DRAM/NAND.

Third, if the capacity expansion cycle re-accelerates (Samsung / SK hynix re-enter aggressive expansion), the current supply constraints could reverse into an oversupply cycle within 1–2 years.

In other words, the premise for this structure's validity is:

AI demand growth rate > capacity expansion rate + technology substitution rate

Пов'язані питання

QWhat are the three core products that constitute the storage industry according to the article?

AAccording to the article, the three core products are DRAM, NAND, and HBM.

QWhy has the storage industry long been viewed as a cyclical sector?

AIt has been viewed as cyclical due to asymmetric supply and demand structures. Demand correlates with consumer electronics and IT spending cycles, while supply, driven by fab investments, has a significant lag time (12-24 months). This leads to boom-bust cycles where new supply arrives after demand peaks, causing prices to fall.

QHow has the rise of AI fundamentally changed the demand structure for memory?

AAI has shifted demand from 'deferrable consumption' (like phones/PCs) to a continuous compute system where memory usage grows linearly or super-linearly with model scale. This creates rigid, long-term demand (e.g., for HBM) and has led to a shift from a spot market to a contract allocation system.

QWhat is the 'HBM displacement effect' mentioned in the article and what are its consequences?

AThe HBM displacement effect refers to memory manufacturers prioritizing production of HBM (which offers significantly higher profit margins) over traditional DRAM like DDR4/DDR5. This causes a structural supply squeeze for traditional DRAM and NAND, leading to non-demand-driven shortages and price increases.

QWhat are the three key conditions that could disprove or end the current 'AI memory supercycle'?

AThe three key conditions are: 1) A contraction in AI capex spending by hyperscalers, 2) A technological shift that replaces HBM (e.g., new memory architectures), and 3) An accelerated cycle of aggressive capacity expansion by major manufacturers like Samsung/SK Hynix, leading to oversupply within 1-2 years.

Пов'язані матеріали

AI Relay Stations Spark Heated Debate on Zhihu: Behind Cheap Tokens, What Are Users Really Worried About?

A discussion on Zhihu about "AI relay stations" shifted the niche developer topic of "cheap tokens" into broader user awareness. Users moved beyond simply questioning the legitimacy of these services to focus on practical concerns: Where do cheap tokens truly come from? Is the model being accessed the real one? Can relay stations see prompts, code, and API keys? For occasional users, are the risks worth it? The core debate centered less on price and more on trust. A primary worry is model authenticity—the risk of "model swapping," where users paying for a premium model might be routed to a cheaper one, creating an information asymmetry. Others argued that cost comparisons matter; while cheaper than official pay-as-you-go APIs, relay stations may not be the lowest-cost option versus subscriptions, domestic models, or free tiers, making user needs assessment crucial. Speculation about token sources ranged from legitimate bulk discounts to gray-area methods like account sharing or exploiting regional pricing. This opacity makes risk assessment difficult for users. Data security emerged as a critical concern, especially for enterprise use. When processing sensitive information like code, contracts, or client data, the inability to verify a relay station's data handling, retention, or access policies poses significant compliance and confidentiality risks. The evolving consensus suggests relay stations can be used cautiously for low-sensitivity, disposable tasks (e.g., summarizing public info, simple translation). However, they should not be the default for sensitive, professional, or production workflows involving proprietary data, Agents, or automated systems. Recommendations include avoiding large prepayments, not relying on a single service, using test prompts to monitor quality, anonymizing data where possible, and keeping official channels as backups. Ultimately, the discussion framed tokens not just as a billing unit but as a measure of real cost encompassing price, model integrity, data security, and service stability. The popularity of relay stations highlights user demand for affordable access, but the debate underscores a key trade-off: the savings from cheap tokens may come at the price of trust, transparency, and control over one's data and AI experience.

marsbit21 хв тому

AI Relay Stations Spark Heated Debate on Zhihu: Behind Cheap Tokens, What Are Users Really Worried About?

marsbit21 хв тому

In-Depth Research Report on TradFi: The Convergence Wave of Crypto and Traditional Finance

In 2026, the crypto industry is undergoing a profound infrastructure-level transformation—TradFi assets are migrating on-chain at an unprecedented pace. According to CoinGecko's Q1 2026 report, the total value locked (TVL) of tokenized real-world assets (RWA) has surpassed $31 billion, a nearly 4x increase from $7.8 billion at the beginning of 2025, with the sector’s aggregate market capitalization reaching $19.3 billion. Among these, the market cap of tokenized stocks surged from $2 million to $486 million, with Q1 spot trading volume reaching $15.1 billion—a single quarter already surpassing the entire second half of 2025. RWA perpetual contract Q1 trading volume reached a staggering $524.8 billion, far exceeding the $313 billion for all of 2025. Meanwhile, BlackRock's BUIDL fund has reached $2.3 billion in scale and has filed for two new tokenized funds, signaling that the world's largest asset manager's tokenization strategy is evolving from pilot to product suite expansion. HTX, as a core participant in the crypto exchange sector, officially launched TradFi perpetual futures products including NVDA, AAPL, MSFT, META, and SPY in 2026, enabling crypto users to gain 24/7 trading access to core U.S. equities. Boston Consulting Group predicts that global tokenized asset scale could reach $16 trillion by 2030, while McKinsey offers a conservative estimate of approximately $2 trillion. The on-chain migration of TradFi assets is no longer a "future narrative" but a structural transformation unfolding in real time, as crypto exchanges evolve from single crypto asset trading platforms toward "multi-asset-class trading infrastructure."

HTX Learn23 хв тому

In-Depth Research Report on TradFi: The Convergence Wave of Crypto and Traditional Finance

HTX Learn23 хв тому

Blocked Its Own Treasure, WeChat AI Steps Up

Tencent's stock surged over 10% on June 2nd amid reports that WeChat, with 1.43 billion monthly users, is finalizing tests for a native AI Agent. The reported feature, accessible by swiping right from the main interface, allows users to issue commands in natural language. The AI then decomposes tasks and automatically calls upon relevant Mini Programs within WeChat to complete actions like ordering food, booking tickets, or making payments, creating a closed-loop service execution system. This strategic shift follows the internal conflict and subsequent "blocking" of Tencent's standalone AI app, Yuanbao, by WeChat for violating sharing rules during a 2026 Spring Festival promotion. The incident highlighted a lack of internal consensus and exposed the weakness of competing in the standalone AI assistant arena against rivals like ByteDance's Doubao (345M MAU) and Alibaba's Qianwen. The new WeChat AI Agent aims to leverage WeChat's unique assets—its massive user base, standardized Mini Program APIs, WeChat Pay, and identity system—to move from simple content generation to actual task execution. Analysts note this changes the competitive landscape from model benchmarks to which AI can connect to more real-world services. However, success depends on key variables: the capability of Tencent's underlying Hunyuan model, managing massive inference costs, and redesigning incentives for Mini Program developers whose traffic might be bypassed. The move is seen as an attempt to keep user service intent within WeChat's ecosystem as AI begins to redefine how users access services.

marsbit1 год тому

Blocked Its Own Treasure, WeChat AI Steps Up

marsbit1 год тому

ByteDance Adopts Arm CPUs, Jensen Huang: So Sad I Didn't Buy Arm

**Summary:** At Computex 2026, Arm CEO Rene Haas announced that ByteDance and Oracle have adopted Arm's self-designed Arm AGI data center CPU. The company expects significant revenue growth from this product, projecting $20 billion in demand for the 2027/2028 fiscal years. Haas noted that restricting AI-capable CPUs from the US to China is nearly impossible due to their widespread applications. Arm's stock has surged dramatically this year, notably rising 16% after NVIDIA's Arm-based Vera CPU and RTX Spark announcements. A highlight was the informal, humorous on-stage conversation between Haas and NVIDIA CEO Jensen Huang. Huang joked about NVIDIA's failed attempt to acquire Arm and playfully lamented selling his Arm shares. Both executives showed a clear sense of camaraderie and shared regret over the missed merger. Key technical topics were discussed: 1. **AI PC Design:** Huang explained NVIDIA's RTX Spark superchip (with a 20-core Arm CPU) is designed for future AI agents that will autonomously run and use tools on PCs, blending local and cloud processing. 2. **Agent vs. OS:** Huang emphasized the operating system remains crucial, as AI agents rely on its APIs and tools to function. 3. **Growth Constraints:** He identified the shift to "useful AI" that generates profitable tokens as a primary driver for immense, almost limitless, computational demand. Haas outlined Arm's strategy across PC and data centers. For PCs, Arm collaborates with partners like NVIDIA and MediaTek, offering its compute subsystem (CSS) for custom SoCs. In data centers, its Arm AGI CPU (built on TSMC's 3nm process) has gained major partners including OpenAI, Meta, and now ByteDance and Oracle. Arm presented a multi-year roadmap for its in-house CPU line. The article concludes that while GPUs dominated the AI training race, the explosion of AI agents is shifting significant focus to CPUs for inference, state management, and tool orchestration. The industry is trending towards vertical integration, with companies like cloud providers designing chips and chip/IP firms offering full solutions, all competing to deliver more efficient computing per watt.

marsbit1 год тому

ByteDance Adopts Arm CPUs, Jensen Huang: So Sad I Didn't Buy Arm

marsbit1 год тому

Торгівля

Спот
Ф'ючерси
活动图片