55TB to 28TB? The Rumor and Panic Behind Rubin's Memory Being Halved

marsbitXuất bản vào 2026-06-05Cập nhật gần nhất vào 2026-06-05

Tóm tắt

Title: 55TB to 28TB? The Rumor and Panic Behind the Potential Halving of Rubin's Memory. On June 4th, a report from SemiAnalysis suggested NVIDIA's next-gen Vera Rubin NVL72 AI rack may ship with roughly 28TB of SOCAMM DRAM per rack instead of the anticipated 55TB, primarily using 96GB modules. This sparked a market panic, causing Micron's stock to drop over 10% on fears of halved memory demand. However, the article argues this panic is misguided for several key reasons. First, SOCAMM modules are socketed and upgradeable, not soldered. Lower initial configuration doesn't mean permanent demand loss. Second, the primary driver is a severe 2026 LPDDR5X supply shortage, not diminished need. NVIDIA is likely prioritizing rack shipments with available components. Third, with fixed total LPDDR5X supply, using less per rack could allow NVIDIA to ship *more* racks, not necessarily reducing overall memory orders. Micron's sharp drop was also attributed to a broader semiconductor sell-off triggered by Broadcom's earnings, with the SemiAnalysis report providing a convenient narrative for profit-taking after Micron's massive rally. In summary: the report on lower default configurations is likely accurate, but interpreting it as a demand collapse is wrong. The real risk for Micron lies in its reportedly minimal HBM4 share for Rubin, not in potentially flexible SOCAMM demand. The sell-off appears more like a correction amplified by coinciding negative catalysts.

Author: Chaoxiang Research

At 4 AM on June 4, SemiAnalysis, the most influential independent research institution in the semiconductor industry, published a morning report.

The core information was just one sentence: The SOCAMM DRAM capacity per rack for NVIDIA's Vera Rubin NVL72 might drop from the previously expected ~55TB to ~28TB. Most Rubin systems will use 96GB SOCAMM modules, not the 192GB modules widely anticipated by the market.

After the news spread, the market's reaction was simple and brutal: Memory demand is halved, bearish for Micron. MU plummeted over 10% intraday, sharply falling from the all-time high of $1,089 set just the day before to $971, wiping out over a hundred billion dollars in market cap in a single day.

The panic was real, but the question is, was the panic directed correctly?

First, Let's Do the Math

The Vera Rubin NVL72 is NVIDIA's next-generation flagship AI supercomputing rack. Each rack packs 72 Rubin GPUs and 36 Vera CPUs. The GPUs use HBM4, 288GB per GPU, totaling about 20.7TB per rack; this part remains unchanged. The change is on the CPU side.

Each Vera CPU has 8 SOCAMM slots, which can accommodate modules of different capacities. The official specification NVIDIA announced at CES 2026 is "up to 1.5TB LPDDR5X per Vera CPU," corresponding to a configuration with all 8 slots filled with 192GB modules. With 36 CPUs, that's 54TB.

What this SemiAnalysis report says is: Actual shipping configurations will most likely not be fully populated. Most systems will use 96GB modules, 8×96GB=768GB per CPU, and 36 CPUs equals about 28TB.

From 55TB to 28TB, a reduction of nearly half—headline writers could easily call it "memory demand slashed in half."

But the market miscalculated a key variable in this equation.

The Logical Flaws in the Panic

First, SOCAMM is a socketed, pluggable design, not soldered on.

This is the most easily overlooked technical detail in the whole story. Unlike the LPDDR soldered onto the motherboard in the GB300 Blackwell Ultra, the Vera Rubin platform adopts the JEDEC-standardized SOCAMM2 module—pluggable, hot-swappable, and upgradeable later. You can plug in 96GB today; tomorrow, if the customer needs more, pull them out and replace them with 192GB or even 256GB, as simple as swapping RAM sticks.

NVIDIA specifically emphasized this design at CES 2026: The entire compute tray assembly time was compressed from 2 hours to 5 minutes. Modularity, maintainability, and upgradability are among the biggest architectural evolutions of Vera Rubin compared to Blackwell.

Lowering the initial shipping configuration does not equate to permanent demand disappearance. It's more like a "board the train first, pay the fare later" strategy.

Second, the reason for the capacity reduction is not "not needed," but "not enough available."

SemiAnalysis founder Dylan Patel made a meaningful remark on Twitter: "I love how most people who amplify our reports leave out most of the content of the report. This happens all the time."

Reader comments on Digg regarding this news are also telling: 77.8% of the comments considered the secondary amplification to be sensationalized and out-of-context headlines.

What was left out? The context.

Global LPDDR5X supply is extremely tight in 2026. Micron explicitly stated at the Wolfe conference in late May that memory demand significantly exceeds supply capacity, a situation expected to persist beyond 2026. Micron's full-year HBM capacity for fiscal year 2026 is already sold out, DRAM average selling prices are up over 110% year-over-year, and gross margins have soared to 74%. Samsung and SK Hynix are similarly running at full capacity with strong sales.

Against this backdrop, NVIDIA's problem is not that customers don't want more memory, but that "I can't get enough LPDDR5X chips to fill every slot."

Reducing the default SOCAMM configuration per rack is essentially supply chain management at the engineering level: Rather than delaying the delivery of entire racks due to memory shortages, it's better to ship with a lower configuration first, getting the computing power online faster.

This is not a signal of contracting demand; on the contrary, it's a signal of demand overwhelming supply.

Third, less memory per rack ≠ fewer racks overall.

The market did a simple multiplication: Memory per rack halved → total demand halved. But there's another variable in this equation: shipment volume.

If SOCAMM per rack drops from 55TB to 28TB, NVIDIA, under the same LPDDR5X supply constraints, could actually assemble more racks. The same batch of memory that could only build 100 racks before might now build close to 200.

The total consumption of LPDDR5X hasn't decreased; it's just distributed across more racks. For NVIDIA, this is a pragmatic choice to get Rubin to market faster; for memory manufacturers, total order volume may not decline.

Furthermore, the demand for CPU-side memory in inference scenarios is highly elastic. Not all workloads require 1.5TB of LPDDR5X. Large model training indeed consumes memory, but for many inference tasks, especially agentic AI and long-context reasoning, KV cache can be flexibly scheduled between HBM and LPDDR via NVLink-C2C. For many customers, 768GB of CPU-side memory is already sufficient.

Then Why Did Micron Still Drop 10%?

Because SemiAnalysis was just the second straw that broke the camel's back.

The first straw was Broadcom. Before the US market opened on June 4, Broadcom released its Q2 earnings. The numbers themselves weren't bad: revenue of $22.19 billion, up 48% year-over-year, Non-GAAP EPS of $2.44 beating expectations. But CEO Hock Tan did not raise the full-year AI chip revenue guidance of $100 billion, which the market deemed "not enough." Broadcom's stock plunged 15%, dragging down the entire semiconductor sector.

There was no company-specific negative news from Micron that day. Multiple media outlets including TipRanks, Motley Fool, and 24/7 Wall St. explicitly pointed out that this was a "collateral damage" co-movement decline. As a core AI memory chain stock, Micron is highly tied to AI capital expenditure sentiment; Broadcom's guidance led the market to re-evaluate the expected growth rate of the entire AI chip supply chain.

The dissemination of SemiAnalysis's report on the same day gave traders already looking for selling reasons a perfect narrative: not only is overall AI sentiment weakening, but even the specific numbers for memory demand are shrinking.

A trillion-dollar market cap stock, up 900% over the past year, having just hit an all-time high the previous day. At this level, any negative headline is a catalyst for profit-taking. Panic doesn't need to be correct; it just needs an excuse.

Chaoxiang Interpretation

Three judgments.

First, the SemiAnalysis report itself is accurate, but the market's interpretation of it is wrong. The default SOCAMM configuration for Rubin NVL72 will most likely indeed be lower than the theoretical maximum, determined jointly by supply chain realities and customer demand elasticity. However, there's a gap between "default configuration reduction" and "memory demand contraction"—a gap filled by a pluggable, upgradeable modular architecture and an industry reality where demand far outstrips supply.

Second, Micron's core risk currently lies not in SOCAMM, but in HBM4. SemiAnalysis reported back in February that Micron's share in NVIDIA's Rubin platform HBM4 orders was zero, with SK Hynix taking 70% and Samsung 30%. Although Micron announced HBM4 mass production shipments in March, its market share is projected to be only around 18%. Conversely, Micron's position in the SOCAMM field is very solid: it was the first to launch the 256GB SOCAMM2 module and has been a core partner for NVIDIA's SOCAMM solution for five years. The actual impact of SOCAMM configuration reduction on Micron is far less than the marginalization of its HBM4 share.

Third, the nature of this decline is profit-taking in a trillion-dollar stock after hitting an all-time high, amplified by two independent catalysts. Broadcom provided the sentiment shock; SemiAnalysis provided the narrative ammunition. Combined, they triggered a 10% pullback in a stock that had risen 9-fold over the past 12 months. From a trading perspective, this isn't called "panic"; it's called "normal."

Dylan Patel's tweet was correct: Most people who amplified his report indeed left out the most important part.

The most dangerous thing in semiconductor investing isn't getting the direction wrong; it's reading the headline correctly but calculating the formula incorrectly.

Câu hỏi Liên quan

QWhat is the core change in configuration reported by SemiAnalysis for Nvidia's Vera Rubin NVL72 system?

AAccording to the SemiAnalysis report, the configuration for Nvidia's Vera Rubin NVL72 system is likely to change from using 192GB SOCAMM memory modules to 96GB modules. This reduces the SOCAMM DRAM capacity per rack from an expected 55TB to approximately 28TB.

QWhy did Micron's stock price drop significantly following the SemiAnalysis report?

AMicron's stock price drop was due to a combination of factors. The primary catalyst was Broadcom's earnings report, which did not raise its full-year AI chip revenue guidance, causing a negative sentiment shift across the semiconductor sector. The simultaneous circulation of SemiAnalysis's report provided a specific, headline-friendly narrative of reduced memory demand, which acted as a second catalyst. This triggered profit-taking in Micron, a stock that had already surged significantly over the past year.

QWhat is a key technical detail that mitigates the impact of the reported SOCAMM capacity reduction on future demand?

AA key mitigating technical detail is that the SOCAMM memory modules in the Vera Rubin platform are socketed, not soldered onto the motherboard. This means the modules are modular, hot-swappable, and can be upgraded later. A system shipped with 96GB modules can have its capacity increased by swapping in higher-capacity modules like 192GB or 256GB, analogous to upgrading RAM in a computer.

QWhat fundamental supply-demand dynamic does the article suggest is the real reason behind the potential reduction in per-rack SOCAMM memory?

AThe article suggests the reduction is not due to a lack of demand but a significant shortage of LPDDR5X supply in 2026. Micron and other memory makers have indicated that demand far outstrips supply. Nvidia is reportedly opting for a lower default configuration per rack as a supply chain management strategy. This allows them to ship more total racks with the same constrained memory supply, getting more computational power to market faster, rather than delaying entire shipments due to component shortages.

QAccording to the article's analysis, what is a greater long-term risk for Micron than the potential SOCAMM configuration change?

AAccording to the article's 'Chao Xiang' analysis, a greater long-term risk for Micron lies in HBM4, not SOCAMM. The article cites prior SemiAnalysis reports stating that Micron holds zero share in Nvidia's initial HBM4 orders for the Rubin platform, with SK Hynix and Samsung taking the bulk. While Micron is in a strong position in the SOCAMM market, its relative marginalization in the next-generation HBM4 market poses a more significant strategic challenge.

Nội dung Liên quan

Sự phân bổ giá trị của stablecoin

Tác giả phân tích sự phân phối giá trị trong ngành stablecoin, chia thành 4 tầng: 1. **Lớp phát hành** (như Tether, Circle): Đúc stablecoin, nắm tài sản dự trữ, hưởng chênh lệch lãi suất. 2. **Lớp cơ sở hạ tầng** (như Bridge, BVNK): Kết nối stablecoin với hệ thống tài chính thực - xử lý nạp/rút tiền pháp định, đối ngân hàng, tuân thủ, quản lý tài sản. Đây là công việc phức tạp, tạo ra rào cản cạnh tranh. 3. **Lớp thu nhận/phân phối** (như Stripe, Coinbase): Tích hợp stablecoin vào hệ thống doanh nghiệp, quản lý thanh toán. 4. **Lớp ứng dụng**: Người dùng và doanh nghiệp cuối cùng. Lớp phát hành thu lợi nhuận lớn nhất. Tuy nhiên, tác giả nhấn mạnh tầm quan trọng tiềm năng của **lớp cơ sở hạ tầng**. Khó khăn thực sự không nằm ở chuyển tiền trên blockchain, mà ở việc kết nối nó với thế giới thực: đối ngân hàng, xử lý quy định, tích hợp vào quy trình nghiệp vụ doanh nghiệp. Các công ty như Stripe (mua lại Bridge) và Mastercard (mua lại BVNK) đang tranh giành để trở thành cổng kết nối mặc định này. Lớp cơ sở hạ tầng hiện ở giai đoạn đầu, vất vả, cần vốn, và bị kẹt giữa các tầng khác. Nhưng một khi stablecoin trở thành phương thức tài chính phổ biến, những công ty đã xây dựng được mạng lưới kết nối rộng khắp và sâu vào hệ thống doanh nghiệp sẽ có vị thế mạnh và quyền định giá.

marsbit5 giờ trước

marsbit5 giờ trước

Tại sao NVIDIA không thiếu tiền nhưng vẫn muốn vay 200 tỷ USD?

Tóm tắt: Mặc dù có dòng tiền tự do mạnh mẽ (khoảng 486 tỷ USD trong quý gần nhất) và không thiếu tiền mặt, NVIDIA vẫn có kế hoạch phát hành ít nhất 20 tỷ USD trái phiếu (hối phiếu cao cấp) với kỳ hạn lên đến 30 năm. Động thái này không phải vì thiếu vốn, mà là một chiến lược quản lý vốn chủ động, tận dụng xếp hạng tín dụng AA vừa được nâng cấp để huy động vốn dài hạn với chi phí thấp. Mục đích của việc vay nợ bao gồm tái cấp vốn, đầu tư vào cơ sở hạ tầng và trung tâm dữ liệu AI, nghiên cứu & phát triển, thanh toán trước cho chuỗi cung ứng và các khoản đầu tư chiến lược. So với việc phát hành thêm cổ phiếu (làm loãng lợi ích của cổ đông hiện tại), việc phát hành trái phiếu được coi là có lợi hơn cho cổ đông khi NVIDIA có thể vừa duy trì các chương trình mua lại cổ phiếu và cổ tức lớn, vừa tài trợ cho sự mở rộng dài hạn. Hành động của NVIDIA phản ánh một xu hướng rộng hơn trong ngành công nghệ, nơi các gã khổng lồ như Alphabet, Meta và Amazon cũng đang sử dụng vốn vay để tài trợ cho chu kỳ chi tiêu vốn nặng về AI. Điều này cho thấy câu chuyện đầu tư AI đang chuyển sang một giai đoạn mới, đòi hỏi nguồn vốn dài hạn và quy mô lớn cho cơ sở hạ tầng. Thách thức chính trong tương lai là khả năng sinh lời từ các khoản đầu tư AI này phải đủ cao và đúng hạn để biện minh cho kỳ vọng và cấu trúc vốn mới.

marsbit6 giờ trước

Tại sao NVIDIA không thiếu tiền nhưng vẫn muốn vay 200 tỷ USD?

marsbit6 giờ trước

Liberland Sa Thải Bộ Trưởng Công Nghệ Sau Vụ Cố Gắng Chiếm Quyền Blockchain Và Trang Web

Theo một nghị quyết chính thức được công bố, Quốc hội Liberland - một dự án vi quốc gia - đã bỏ phiếu bãi nhiệm Bộ trưởng Công nghệ Dorian Stern Vukotić. Ông bị cáo buộc thực hiện một loạt hành động nhằm giành quyền kiểm soát cơ sở hạ tầng kỹ thuật của dự án, bao gồm việc gỡ bỏ bảo vệ đa chữ ký (multisig) trên tài khoản quản trị Sudo, cố gắng chiếm quyền điều khiển tên miền Liberland.org, chặn quyền bỏ phiếu của Tổng thống Vít Jedlička và phát hành token trái phép. Sự việc này nêu bật những rủi ro quản trị thực tế trong không gian crypto, vượt ra ngoài các lỗ hổng hợp đồng thông minh đơn thuần. Nó liên quan đến quyền kiểm soát các điểm then chốt ngoài chuỗi như tài khoản quản trị, tên miền website, quyền truy cập biểu quyết và thiết kế đa chữ ký. Tranh chấp tại Liberland trở thành một nghiên cứu điển hình về cách thức các dự án phụ thuộc vào sự pha trộn giữa quản trị on-chain và các cơ chế kiểm soát off-chain. Bài học chính cho người dùng crypto là những tuyên bố về phi tập trung cần được kiểm chứng với thực tế vận hành. Nếu chỉ một số ít người có thể kiểm soát các chức năng quản trị hoặc quyền truy cập then chốt, thì hệ thống quản trị vẫn có thể dễ bị tổn thương. Sự kiện này phản ánh xu hướng rộng hơn trong thị trường crypto, nơi cơ sở hạ tầng, bảo mật và quản trị ngày càng trở nên quan trọng ngang bằng với biến động giá.

bitcoinist7 giờ trước

Liberland Sa Thải Bộ Trưởng Công Nghệ Sau Vụ Cố Gắng Chiếm Quyền Blockchain Và Trang Web

bitcoinist7 giờ trước

Làm Thế Nào Để Nghiên Cứu Tốt: Rèn Luyện Những Khả Năng Thực Sự Có Thể 'Luyện Tập Chủ Động'

Chưa ai thực sự dạy bạn cách làm nghiên cứu. Hầu hết mọi người chỉ học cách "trông giống" một nhà nghiên cứu, trong khi năng lực thực sự là sự tích lũy các kỹ năng có thể rèn luyện có chủ ý. **Chọn vấn đề của riêng bạn:** Đừng chỉ tiếp nhận vấn đề từ người khác. Hãy bắt đầu từ một kết quả bạn thực sự muốn đạt được và suy ngược để thiết kế thí nghiệm. Điều này tạo ra tính độc đáo. "Khiếu thẩm mỹ" trong nghiên cứu giống như một cơ bắp, có thể phát triển thông qua việc dự đoán kết quả thí nghiệm và kiểm tra lại các dự đoán đó. **Nâng cấp đầu vào:** Đừng chỉ đọc các xu hướng mới nhất. Tài liệu cũ (như bài phát biểu của Claude Shannon năm 1952) và các lĩnh vực rộng (thần kinh học, thống kê, kiến trúc máy tính) thường mang lại những hiểu biết sâu sắc có giá trị. Hãy đọc chính bài báo gốc, đặc biệt là phần phụ lục và hạn chế. **Viết mọi thứ xuống:** Viết lách là cơ chế phòng thủ rẻ nhất chống lại sự tự lừa dối. Nó phơi bày những lỗ hổng trong suy nghĩ. Giữ một cuốn nhật ký ghi lại giả thuyết, cài đặt, dự đoán, kết quả và bài học. Xem lại nó sẽ dạy bạn sự khiêm tốn. Tóm lại, nghiên cứu hiệu quả dựa trên việc chủ động chọn vấn đề, mở rộng nguồn tri thức và ghi chép trung thực để rèn giũa tư duy.

marsbit8 giờ trước

Làm Thế Nào Để Nghiên Cứu Tốt: Rèn Luyện Những Khả Năng Thực Sự Có Thể 'Luyện Tập Chủ Động'

marsbit8 giờ trước

Tăng Trưởng Vượt 150% Trong Nửa Tháng, Đằng Sau Sự Bùng Nổ Của Backpack

Backpack (BP), token của sàn giao dịch cùng tên, đã tăng hơn 150% trong nửa tháng, đạt mức cao 0,48 USD. Động lực chính đến từ việc ra mắt nền tảng Backpack Securities vào đầu tháng 6, cung cấp dịch vụ môi giới cổ phiếu Mỹ được quy định và hỗ trợ token hóa tài sản truyền thống. Sự kiện đáng chú ý là việc phát hành token SPCX, được neo 1:1 với cổ phiếu SpaceX thật, trên Solana vào ngày 12/6. Sản phẩm này cho phép giao dịch trên chuỗi 24/7 và có thể đổi ngược về tài khoản chứng khoán truyền thống, tạo cầu nối giữa thị trường vốn truyền thống và DeFi. BP có tổng cung 10 tỷ token với cơ chế phân phối độc đáo: 25% đầu tiên được airdrop cho cộng đồng, 37.5% tiếp theo sẽ mở khóa theo các cột mốc phát triển sản phẩm, và 37.5% cuối cùng sẽ được khóa cho đến sau IPO. Đội ngũ phát triển không nhận token ngay từ đầu. Người dùng có thể stake BP để nhận quyền chuyển đổi token thành cổ phần công ty khi công ty IPO, cùng các đặc quyền khác. Hiện khoảng 66% lượng token lưu hành đang được stake. Mặc dù từng có tranh cãi về cơ chế airdrop, sự chú ý của thị trường hiện đang dồn vào sự mở rộng thực tế của nền tảng và tiềm năng lâu dài của token BP trong hệ sinh thái mới kết hợp chứng khoán và tài sản token hóa.

marsbit8 giờ trước

Tăng Trưởng Vượt 150% Trong Nửa Tháng, Đằng Sau Sự Bùng Nổ Của Backpack

marsbit8 giờ trước

Giao dịch

Giao ngay

Hợp đồng Tương lai

Bài viết Nổi bật

Làm thế nào để Mua 4

Chào mừng bạn đến với HTX.com! Chúng tôi đã làm cho mua 4 (4) trở nên đơn giản và thuận tiện. Làm theo hướng dẫn từng bước của chúng tôi để bắt đầu hành trình tiền kỹ thuật số của bạn.Bước 1: Tạo Tài khoản HTX của BạnSử dụng email hoặc số điện thoại của bạn để đăng ký tài khoản miễn phí trên HTX. Trải nghiệm hành trình đăng ký không rắc rối và mở khóa tất cả tính năng. Nhận Tài khoản của tôiBước 2: Truy cập Mua Crypto và Chọn Phương thức Thanh toán của BạnThẻ Tín dụng/Ghi nợ: Sử dụng Visa hoặc Mastercard của bạn để mua 4 (4) ngay lập tức.Số dư: Sử dụng tiền từ số dư tài khoản HTX của bạn để giao dịch liền mạch.Bên thứ ba: Chúng tôi đã thêm những phương thức thanh toán phổ biến như Google Pay và Apple Pay để nâng cao sự tiện lợi.P2P: Giao dịch trực tiếp với người dùng khác trên HTX.Thị trường mua bán phi tập trung (OTC): Chúng tôi cung cấp những dịch vụ được thiết kế riêng và tỷ giá hối đoái cạnh tranh cho nhà giao dịch.Bước 3: Lưu trữ 4 (4) của BạnSau khi mua 4 (4), lưu trữ trong tài khoản HTX của bạn. Ngoài ra, bạn có thể gửi đi nơi khác qua chuyển khoản blockchain hoặc sử dụng để giao dịch những tiền kỹ thuật số khác.Bước 4: Giao dịch 4 (4)Giao dịch 4 (4) dễ dàng trên thị trường giao ngay của HTX. Chỉ cần truy cập vào tài khoản của bạn, chọn cặp giao dịch, thực hiện giao dịch và theo dõi trong thời gian thực. Chúng tôi cung cấp trải nghiệm thân thiện với người dùng cho cả người mới bắt đầu và người giao dịch dày dạn kinh nghiệm.

Tổng lượt xem 550Xuất bản vào 2025.10.20Cập nhật vào 2026.06.02

Thảo luận

Chào mừng đến với Cộng đồng HTX. Tại đây, bạn có thể được thông báo về những phát triển nền tảng mới nhất và có quyền truy cập vào thông tin chuyên sâu về thị trường. Ý kiến của người dùng về giá của 4 (4) được trình bày dưới đây.