55TB to 28TB? The Rumor and Panic Behind Rubin's Memory Being Halved

marsbitPublicado a 2026-06-05Actualizado a 2026-06-05

Resumen

Title: 55TB to 28TB? The Rumor and Panic Behind the Potential Halving of Rubin's Memory. On June 4th, a report from SemiAnalysis suggested NVIDIA's next-gen Vera Rubin NVL72 AI rack may ship with roughly 28TB of SOCAMM DRAM per rack instead of the anticipated 55TB, primarily using 96GB modules. This sparked a market panic, causing Micron's stock to drop over 10% on fears of halved memory demand. However, the article argues this panic is misguided for several key reasons. First, SOCAMM modules are socketed and upgradeable, not soldered. Lower initial configuration doesn't mean permanent demand loss. Second, the primary driver is a severe 2026 LPDDR5X supply shortage, not diminished need. NVIDIA is likely prioritizing rack shipments with available components. Third, with fixed total LPDDR5X supply, using less per rack could allow NVIDIA to ship *more* racks, not necessarily reducing overall memory orders. Micron's sharp drop was also attributed to a broader semiconductor sell-off triggered by Broadcom's earnings, with the SemiAnalysis report providing a convenient narrative for profit-taking after Micron's massive rally. In summary: the report on lower default configurations is likely accurate, but interpreting it as a demand collapse is wrong. The real risk for Micron lies in its reportedly minimal HBM4 share for Rubin, not in potentially flexible SOCAMM demand. The sell-off appears more like a correction amplified by coinciding negative catalysts.

Author: Chaoxiang Research

At 4 AM on June 4, SemiAnalysis, the most influential independent research institution in the semiconductor industry, published a morning report.

The core information was just one sentence: The SOCAMM DRAM capacity per rack for NVIDIA's Vera Rubin NVL72 might drop from the previously expected ~55TB to ~28TB. Most Rubin systems will use 96GB SOCAMM modules, not the 192GB modules widely anticipated by the market.

After the news spread, the market's reaction was simple and brutal: Memory demand is halved, bearish for Micron. MU plummeted over 10% intraday, sharply falling from the all-time high of $1,089 set just the day before to $971, wiping out over a hundred billion dollars in market cap in a single day.

The panic was real, but the question is, was the panic directed correctly?

First, Let's Do the Math

The Vera Rubin NVL72 is NVIDIA's next-generation flagship AI supercomputing rack. Each rack packs 72 Rubin GPUs and 36 Vera CPUs. The GPUs use HBM4, 288GB per GPU, totaling about 20.7TB per rack; this part remains unchanged. The change is on the CPU side.

Each Vera CPU has 8 SOCAMM slots, which can accommodate modules of different capacities. The official specification NVIDIA announced at CES 2026 is "up to 1.5TB LPDDR5X per Vera CPU," corresponding to a configuration with all 8 slots filled with 192GB modules. With 36 CPUs, that's 54TB.

What this SemiAnalysis report says is: Actual shipping configurations will most likely not be fully populated. Most systems will use 96GB modules, 8×96GB=768GB per CPU, and 36 CPUs equals about 28TB.

From 55TB to 28TB, a reduction of nearly half—headline writers could easily call it "memory demand slashed in half."

But the market miscalculated a key variable in this equation.

The Logical Flaws in the Panic

First, SOCAMM is a socketed, pluggable design, not soldered on.

This is the most easily overlooked technical detail in the whole story. Unlike the LPDDR soldered onto the motherboard in the GB300 Blackwell Ultra, the Vera Rubin platform adopts the JEDEC-standardized SOCAMM2 module—pluggable, hot-swappable, and upgradeable later. You can plug in 96GB today; tomorrow, if the customer needs more, pull them out and replace them with 192GB or even 256GB, as simple as swapping RAM sticks.

NVIDIA specifically emphasized this design at CES 2026: The entire compute tray assembly time was compressed from 2 hours to 5 minutes. Modularity, maintainability, and upgradability are among the biggest architectural evolutions of Vera Rubin compared to Blackwell.

Lowering the initial shipping configuration does not equate to permanent demand disappearance. It's more like a "board the train first, pay the fare later" strategy.

Second, the reason for the capacity reduction is not "not needed," but "not enough available."

SemiAnalysis founder Dylan Patel made a meaningful remark on Twitter: "I love how most people who amplify our reports leave out most of the content of the report. This happens all the time."

Reader comments on Digg regarding this news are also telling: 77.8% of the comments considered the secondary amplification to be sensationalized and out-of-context headlines.

What was left out? The context.

Global LPDDR5X supply is extremely tight in 2026. Micron explicitly stated at the Wolfe conference in late May that memory demand significantly exceeds supply capacity, a situation expected to persist beyond 2026. Micron's full-year HBM capacity for fiscal year 2026 is already sold out, DRAM average selling prices are up over 110% year-over-year, and gross margins have soared to 74%. Samsung and SK Hynix are similarly running at full capacity with strong sales.

Against this backdrop, NVIDIA's problem is not that customers don't want more memory, but that "I can't get enough LPDDR5X chips to fill every slot."

Reducing the default SOCAMM configuration per rack is essentially supply chain management at the engineering level: Rather than delaying the delivery of entire racks due to memory shortages, it's better to ship with a lower configuration first, getting the computing power online faster.

This is not a signal of contracting demand; on the contrary, it's a signal of demand overwhelming supply.

Third, less memory per rack ≠ fewer racks overall.

The market did a simple multiplication: Memory per rack halved → total demand halved. But there's another variable in this equation: shipment volume.

If SOCAMM per rack drops from 55TB to 28TB, NVIDIA, under the same LPDDR5X supply constraints, could actually assemble more racks. The same batch of memory that could only build 100 racks before might now build close to 200.

The total consumption of LPDDR5X hasn't decreased; it's just distributed across more racks. For NVIDIA, this is a pragmatic choice to get Rubin to market faster; for memory manufacturers, total order volume may not decline.

Furthermore, the demand for CPU-side memory in inference scenarios is highly elastic. Not all workloads require 1.5TB of LPDDR5X. Large model training indeed consumes memory, but for many inference tasks, especially agentic AI and long-context reasoning, KV cache can be flexibly scheduled between HBM and LPDDR via NVLink-C2C. For many customers, 768GB of CPU-side memory is already sufficient.

Then Why Did Micron Still Drop 10%?

Because SemiAnalysis was just the second straw that broke the camel's back.

The first straw was Broadcom. Before the US market opened on June 4, Broadcom released its Q2 earnings. The numbers themselves weren't bad: revenue of $22.19 billion, up 48% year-over-year, Non-GAAP EPS of $2.44 beating expectations. But CEO Hock Tan did not raise the full-year AI chip revenue guidance of $100 billion, which the market deemed "not enough." Broadcom's stock plunged 15%, dragging down the entire semiconductor sector.

There was no company-specific negative news from Micron that day. Multiple media outlets including TipRanks, Motley Fool, and 24/7 Wall St. explicitly pointed out that this was a "collateral damage" co-movement decline. As a core AI memory chain stock, Micron is highly tied to AI capital expenditure sentiment; Broadcom's guidance led the market to re-evaluate the expected growth rate of the entire AI chip supply chain.

The dissemination of SemiAnalysis's report on the same day gave traders already looking for selling reasons a perfect narrative: not only is overall AI sentiment weakening, but even the specific numbers for memory demand are shrinking.

A trillion-dollar market cap stock, up 900% over the past year, having just hit an all-time high the previous day. At this level, any negative headline is a catalyst for profit-taking. Panic doesn't need to be correct; it just needs an excuse.

Chaoxiang Interpretation

Three judgments.

First, the SemiAnalysis report itself is accurate, but the market's interpretation of it is wrong. The default SOCAMM configuration for Rubin NVL72 will most likely indeed be lower than the theoretical maximum, determined jointly by supply chain realities and customer demand elasticity. However, there's a gap between "default configuration reduction" and "memory demand contraction"—a gap filled by a pluggable, upgradeable modular architecture and an industry reality where demand far outstrips supply.

Second, Micron's core risk currently lies not in SOCAMM, but in HBM4. SemiAnalysis reported back in February that Micron's share in NVIDIA's Rubin platform HBM4 orders was zero, with SK Hynix taking 70% and Samsung 30%. Although Micron announced HBM4 mass production shipments in March, its market share is projected to be only around 18%. Conversely, Micron's position in the SOCAMM field is very solid: it was the first to launch the 256GB SOCAMM2 module and has been a core partner for NVIDIA's SOCAMM solution for five years. The actual impact of SOCAMM configuration reduction on Micron is far less than the marginalization of its HBM4 share.

Third, the nature of this decline is profit-taking in a trillion-dollar stock after hitting an all-time high, amplified by two independent catalysts. Broadcom provided the sentiment shock; SemiAnalysis provided the narrative ammunition. Combined, they triggered a 10% pullback in a stock that had risen 9-fold over the past 12 months. From a trading perspective, this isn't called "panic"; it's called "normal."

Dylan Patel's tweet was correct: Most people who amplified his report indeed left out the most important part.

The most dangerous thing in semiconductor investing isn't getting the direction wrong; it's reading the headline correctly but calculating the formula incorrectly.

Preguntas relacionadas

QWhat is the core change in configuration reported by SemiAnalysis for Nvidia's Vera Rubin NVL72 system?

AAccording to the SemiAnalysis report, the configuration for Nvidia's Vera Rubin NVL72 system is likely to change from using 192GB SOCAMM memory modules to 96GB modules. This reduces the SOCAMM DRAM capacity per rack from an expected 55TB to approximately 28TB.

QWhy did Micron's stock price drop significantly following the SemiAnalysis report?

AMicron's stock price drop was due to a combination of factors. The primary catalyst was Broadcom's earnings report, which did not raise its full-year AI chip revenue guidance, causing a negative sentiment shift across the semiconductor sector. The simultaneous circulation of SemiAnalysis's report provided a specific, headline-friendly narrative of reduced memory demand, which acted as a second catalyst. This triggered profit-taking in Micron, a stock that had already surged significantly over the past year.

QWhat is a key technical detail that mitigates the impact of the reported SOCAMM capacity reduction on future demand?

AA key mitigating technical detail is that the SOCAMM memory modules in the Vera Rubin platform are socketed, not soldered onto the motherboard. This means the modules are modular, hot-swappable, and can be upgraded later. A system shipped with 96GB modules can have its capacity increased by swapping in higher-capacity modules like 192GB or 256GB, analogous to upgrading RAM in a computer.

QWhat fundamental supply-demand dynamic does the article suggest is the real reason behind the potential reduction in per-rack SOCAMM memory?

AThe article suggests the reduction is not due to a lack of demand but a significant shortage of LPDDR5X supply in 2026. Micron and other memory makers have indicated that demand far outstrips supply. Nvidia is reportedly opting for a lower default configuration per rack as a supply chain management strategy. This allows them to ship more total racks with the same constrained memory supply, getting more computational power to market faster, rather than delaying entire shipments due to component shortages.

QAccording to the article's analysis, what is a greater long-term risk for Micron than the potential SOCAMM configuration change?

AAccording to the article's 'Chao Xiang' analysis, a greater long-term risk for Micron lies in HBM4, not SOCAMM. The article cites prior SemiAnalysis reports stating that Micron holds zero share in Nvidia's initial HBM4 orders for the Rubin platform, with SK Hynix and Samsung taking the bulk. While Micron is in a strong position in the SOCAMM market, its relative marginalization in the next-generation HBM4 market poses a more significant strategic challenge.

Lecturas Relacionadas

Earning 1000 Trillion in Half a Year, 'Pocketing' 20 Million per Capita: This Round of Wealth Creation in the Korean Stock Market is Unprecedented in Scale

The South Korean stock market is experiencing an unprecedented wealth surge in 2026, with household equity and fund asset values soaring by over 1,000 trillion KRW (~$730bn) year-to-date. This translates to an average per capita wealth increase of roughly 20 million KRW, fueled by a historic 109% rally in the KOSPI index. The boom is driven by three converging forces: an AI-driven semiconductor supercycle boosting giants like Samsung and SK Hynix; the government's "Value-Up" market reforms addressing long-standing corporate governance issues; and aggressive real estate regulations that have locked capital within financial markets, preventing profits from flowing back into property. This has triggered a wealth effect, boosting high-end consumption significantly. However, the gains are highly concentrated. The two semiconductor behemoths account for over half the index's value, but retail investors own relatively low stakes in them, systematically missing the biggest rallies. Wealth and consumption benefits are skewed towards luxury goods and imported cars, bypassing mainstream retail. Further risks stem from excessive leverage, with high trading volume in leveraged ETFs, and a market sentiment heavily reliant on the AI sector's fortunes and speculative rumors. While this cycle marks a potential shift from real estate to equities as a primary wealth generator for Koreans, its sustainability, amid structural imbalances and leverage, remains a critical test.

marsbitHace 6 min(s)

Earning 1000 Trillion in Half a Year, 'Pocketing' 20 Million per Capita: This Round of Wealth Creation in the Korean Stock Market is Unprecedented in Scale

marsbitHace 6 min(s)

Behind ZEC's Over 30% Plunge: An 'Unlimited Minting' Vulnerability with No Way to Prove if It Was Ever Exploited

A critical vulnerability was discovered in Zcash's Orchard privacy pool, allowing for the theoretical creation of undetectable counterfeit ZEC. Researcher Taylor Hornby found the flaw on May 29th, 2024, within the Orchard circuit's cryptographic constraints, which could let an attacker bypass asset conservation rules. Although a rapid emergency fix was deployed within days via a coordinated soft and hard fork, a core uncertainty remains: due to Orchard's privacy features, it is impossible to cryptographically prove whether this "unlimited mint" flaw was exploited in the nearly four years since the pool's 2022 launch. This uncertainty, rather than the patched flaw itself, triggered a market panic, causing ZEC's price to drop over 30%. While the Zcash Foundation stated no evidence of exploitation was found, independent entity Shielded Labs emphasized the impossibility of definitively proving no counterfeit ZEC was ever created. The incident highlights the unique trust challenge in privacy systems. To address this, developers are proposing a new network upgrade with enhanced auditing to allow verifiable proof of supply integrity. Notably, the researcher utilized the newly released AI model Claude Opus 4.8 as a tool during the security review, signaling the growing role of advanced AI in uncovering complex cryptographic vulnerabilities.

marsbitHace 8 min(s)

Behind ZEC's Over 30% Plunge: An 'Unlimited Minting' Vulnerability with No Way to Prove if It Was Ever Exploited

marsbitHace 8 min(s)

imToken's 10th Anniversary Unveils Strategic Direction for the Next Decade: Evolving from a Trusted Main Wallet to a Personal Digital Hub

On its tenth anniversary, decentralized wallet imToken announced its strategic vision for the next decade: evolving from a "trusted main wallet" into a "personal control interface." This new direction aims to help users manage not only digital assets but also identity, permissions, and AI agent actions in an increasingly open and intelligent internet. imToken outlined that while the past decade focused on Store, Send, and Stake—securing assets, enabling transfers, and facilitating network participation—the future introduces a fourth core proposition: Sign. This expanded concept goes beyond transaction signing to encompass expressing intent, granting permissions, setting rules, delegating actions, and revoking authorizations. As AI agents gain autonomy, imToken emphasizes the need for clear, verifiable, and revocable user control over their actions. CEO Ben He stated that imToken's mission is shifting from enabling ownership of digital assets to ensuring user sovereignty over their entire digital world in the AI era. The company's core principle has been upgraded from "Digital Assets, Under Your Control" to "Your Digital World, Under Your Control." Future development will focus on three areas: upholding self-custody principles, extending security from transactions to authorizations and automated actions, and building product capabilities for managing permissions, delegations, policies, and revocations. imToken views the wallet's role as expanding into a trusted control interface for human-AI collaboration, where managing keys, signatures, and permissions forms the infrastructure for personal digital sovereignty. Founded in 2016, imToken serves millions of users across 150+ countries, providing non-custodial wallet services supporting over 50 blockchain networks.

marsbitHace 9 min(s)

imToken's 10th Anniversary Unveils Strategic Direction for the Next Decade: Evolving from a Trusted Main Wallet to a Personal Digital Hub

marsbitHace 9 min(s)

With 300 Million Financing to Accumulate ETH, the Hidden Concerns Behind BitMine's High-Yield Preferred Shares

BitMine, led by Thomas Lee, plans to raise up to $300 million through an initial public offering of 3 million shares of perpetual Series A preferred stock on the NYSE (ticker: BMNP). The stock offers a fixed 9.5% annual dividend. The funds are intended to further the company's accumulation of Ethereum, expand its staking node operations, and for general corporate purposes. This move comes as BitMine faces significant challenges. Its massive Ethereum holdings, over 5.3 million ETH (roughly 4.5% of circulating supply), are currently at an unrealized loss exceeding $8.5 billion due to the crypto market downturn. The company's core business model relies on staking these ETH holdings to generate yield, which it presents as the primary means to cover the new, substantial annual dividend obligation of approximately $28.5 million if the offering is fully subscribed. While the model is similar to MicroStrategy's bitcoin-focused strategy of using capital markets to fund crypto acquisitions, BitMine's product differs with its fixed, non-adjustable dividend rate. The company acknowledges risks, stating dividend payments could also come from cash reserves, asset sales, or future financing, and warns that staking yields may underperform or be illiquid during market stress. The 9.5% fixed rate reflects the higher risk premium demanded from investors for a company heavily exposed to Ethereum's volatility.

Foresight NewsHace 12 min(s)

With 300 Million Financing to Accumulate ETH, the Hidden Concerns Behind BitMine's High-Yield Preferred Shares

Foresight NewsHace 12 min(s)

A Year of Observing Agent Payments: The Cold Reality Behind the Hot Narrative

A Year in Agent Payments: The Cold Reality Behind a Hot Narrative This article examines the current state of "Agent payments," a year after it became a major trend at the intersection of AI, payments, and crypto. Despite significant investments from major players like Stripe, Visa, and Google, the author—having built products and spoken with merchants and developers—finds genuine, large-scale demand still lacking. Key findings across several hyped scenarios reveal structural challenges: * **Agent-to-Merchant Commerce:** For most product categories (e.g., clothing, electronics), AI shopping via chat is inferior to traditional visual e-commerce. Merchant interest is largely defensive, focused on future-proofing rather than current consumer demand. True potential exists only in specific, high-frequency/low-decision scenarios (like food orders) or for simplifying broken checkout experiences, but these require massive consumer distribution, favoring incumbents. * **Agent-to-API/Machine Commerce:** While stablecoin micropayments are touted for API calls, developers already solve small-value payments via prepaid credits and subscriptions. Large SaaS providers prefer enterprise contracts over fragmented micro-pricing. The market exists for long-tail services outside the top providers but is inherently smaller than the hype suggests. * **Agent-to-Agent Payments:** This remains a theoretical long-term vision with negligible real transaction volume. The core challenges—discovery, trust, negotiation, dispute resolution—are unsolved. While the potential for a new, high-speed settlement layer is real, it is not the current market. * **Agent Finance:** This is the sole area with existing, paying customers (fund managers, DeFi users). AI enhances real-time monitoring and autonomous rebalancing, offering real capability gains. However, competition favors established, regulated institutions with existing licenses and client relationships. The author concludes that the core deficiency in the Agent economy is not merely a payment layer, but a more complex **coordination** capability—figuring out how Agents and humans work together, verify task completion, and settle outcomes. Payment is just one component of settlement, which is itself part of coordination. For large companies, investing now is a defensive, long-term bet with minimal cost. For startups, however, the imperative is to find markets that exist today, not wait for a future wave that remains on the horizon.

marsbitHace 14 min(s)

A Year of Observing Agent Payments: The Cold Reality Behind the Hot Narrative

marsbitHace 14 min(s)

Trading

Spot

Futuros

Artículos destacados

Cómo comprar 4

¡Bienvenido a HTX.com! Hemos hecho que comprar 4 (4) sea simple y conveniente. Sigue nuestra guía paso a paso para iniciar tu viaje de criptos.Paso 1: crea tu cuenta HTXUtiliza tu correo electrónico o número de teléfono para registrarte y obtener una cuenta gratuita en HTX. Experimenta un proceso de registro sin complicaciones y desbloquea todas las funciones.Obtener mi cuentaPaso 2: ve a Comprar cripto y elige tu método de pagoTarjeta de crédito/débito: usa tu Visa o Mastercard para comprar 4 (4) al instante.Saldo: utiliza fondos del saldo de tu cuenta HTX para tradear sin problemas.Terceros: hemos agregado métodos de pago populares como Google Pay y Apple Pay para mejorar la comodidad.P2P: tradear directamente con otros usuarios en HTX.Over-the-Counter (OTC): ofrecemos servicios personalizados y tipos de cambio competitivos para los traders.Paso 3: guarda tu 4 (4)Después de comprar tu 4 (4), guárdalo en tu cuenta HTX. Alternativamente, puedes enviarlo a otro lugar mediante transferencia blockchain o utilizarlo para tradear otras criptomonedas.Paso 4: tradear 4 (4)Tradear fácilmente con 4 (4) en HTX's mercado spot. Simplemente accede a tu cuenta, selecciona tu par de trading, ejecuta tus trades y monitorea en tiempo real. Ofrecemos una experiencia fácil de usar tanto para principiantes como para traders experimentados.

742 Vistas totalesPublicado en 2025.10.20Actualizado en 2026.06.02

Discusiones

Bienvenido a la comunidad de HTX. Aquí puedes mantenerte informado sobre los últimos desarrollos de la plataforma y acceder a análisis profesionales del mercado. A continuación se presentan las opiniones de los usuarios sobre el precio de 4 (4).