From Token Explosion to Physical Bottlenecks: The Storage Bull Market Driven by Agentic AI

marsbitXuất bản vào 2026-05-22Cập nhật gần nhất vào 2026-05-22

Tóm tắt

**From Token Explosion to Physical Bottlenecks: The Agentic AI-Driven Storage Bull Market** The AI semiconductor narrative is shifting from training to inference, which now accounts for 66% of AI compute. In the inference "Decode" phase (autoregressive token generation), GPU performance is bottlenecked by memory bandwidth and capacity, not raw compute (FLOPS). The key constraints are **HBM (High Bandwidth Memory) bandwidth** (determining token generation speed) and **HBM capacity** (determining how many requests/models can be served simultaneously). This creates a core economics equation: Token cost is proportional to (GPU + power cost) divided by Tokens/sec, which is fundamentally limited by HBM specs. This drives unprecedented demand for advanced storage. **HBM**, a 3D-stacked DRAM, is critical for AI accelerators. Its complex production consumes 3-4x more wafer capacity than standard DRAM, squeezing supply for traditional memory (DDR) and causing severe shortages. **HBF (High Bandwidth Flash)**, an emerging high-bandwidth NAND, aims to bridge the gap between HBM speed and SSD capacity for AI model weights. The market is experiencing a historic, structurally driven super-cycle. Demand is fueled by a triple engine: 1) AI training (parameter arms race), 2) AI inference explosion (especially Agentic AI with long contexts), and 3) general data center expansion. Supply is constrained by the HBM产能挤压 effect and the 2-3 year lead time for new fab capacity. Analysts project a DRA...

Author: XinGPT

To understand storage, one must first understand what has happened across the entire semiconductor industry chain.

The Starting Point of the Story: Inference Has Surpassed Training

For the past two years, the semiconductor story has always been "NVIDIA + Training Clusters." However, starting in 2026, due to the sudden explosive demand for AI Agents, most notably the explosive growth of Claude Code, Anthropic's ARR surged from $90 billion to $300 billion in just 4 months.

Inference now accounts for 66% of AI computational load, whereas just two or three years ago, this number was only 33%. That is to say, in three years, the positions of inference and training have completely reversed.

And what does this mean?

Let's take the NVIDIA Blackwell B200 GPU as an example. Structurally, the B200 is composed of two Blackwell dies connected via the NV-HBI high-bandwidth interface, with die-to-die bandwidth reaching 10TB/s. The entire GPU has approximately 208 billion transistors, 148 SMs, 20,480 CUDA Cores, and 1,024 Tensor Cores, primarily responsible for matrix operations in large models.

Inside the GPU, data flows in layers based on their distance from the computing cores:

The first layer is Registers. They are closest to the computing cores, fastest in speed, and used to hold data currently being computed.

The second layer is L1 / Shared Memory. It is located inside each SM, used to cache short-term, frequently accessed data, reducing access to more distant memory.

The third layer is TMEM, or Tensor Memory. This is an important design introduced by Blackwell for Tensor Cores, intended to bring key data in matrix calculations closer to the computing units, improving Tensor Core utilization.

The fourth layer is L2 Cache. The B200 has approximately 126MB of shared L2 Cache, responsible for caching and reusing data among multiple SMs, especially suitable for repeatedly accessing model weights in inference scenarios.

The fifth layer is HBM3e. The B200 is equipped with 192GB of HBM3e, with a bandwidth of about 8TB/s, serving as the primary storage area for model weights, KV Cache, activation values, and input data. For large model inference, especially during the Decode phase, HBM bandwidth often directly determines token generation speed.

HBM stores large-scale model data, L2 / L1 / TMEM are responsible for delivering high-frequency data to the computing cores step by step, and Tensor Core / CUDA Core perform matrix operations. The calculation results are then written back to the cache or HBM and work collaboratively with other GPUs via NVLink / NV-HBI.

When a GPU performs AI inference, processing a prompt is divided into two phases: Prefill and Decode—which can be roughly understood as "Reading the Question" and "Answering the Question".

Phase One: Prefill (Context Processing, i.e., "Reading the Question")

You input a prompt, say 4096 tokens. The model's first task is to read all 4096 tokens simultaneously, figuring out the relationship between each token and every other token. This is called the Attention Mechanism, which requires calculating an S × S matrix, where S is the sequence length, here S = 4096.

There is a dedicated hardware unit in the GPU for matrix multiplication called the Tensor Core, which can be thought of as a "Matrix Calculator." In this phase, Tensor Core utilization is as high as 90-95%, working at full capacity.

A key concept here is arithmetic intensity, meaning "how many operations can be performed for every 1 byte of data read from memory." In the Prefill phase, this number is 200-400 ops/byte, which is very high, indicating that data read once can be reused many times.

Therefore, the bottleneck in the Prefill phase is computational power (FLOPS, floating-point operations per second), not memory bandwidth; HBM is barely taxed and under little pressure.

Phase Two: Decode (Token Generation, i.e., "Answering the Question")

After the Prefill phase ends, the model enters the Decode phase, which is the token-by-token answer generation stage. Models typically adopt an autoregressive generation mode.

Autoregressive means the model does not generate a complete answer at once, but outputs one token at a time. Each new token generated requires recalculation based on all previously generated tokens. Therefore, the longer the conversation, the heavier the computational burden for subsequent generation becomes.

For the GPU to generate each token, it needs to read two types of critical data from HBM high-bandwidth memory: The first is model weights, i.e., the parameters of the model itself.

The second is KV Cache. KV Cache is the intermediate result cache left by the attention mechanism, where K stands for Key and V for Value. Its purpose is to avoid the model recalculating the attention results for all past tokens every time, thereby accelerating long-text generation. However, the cost is that as the context grows longer, the KV Cache keeps increasing in size, occupying more and more HBM capacity and bandwidth.

For example, a 70B parameter large model, if stored in FP16 precision, the model weights require about 140GB. That is to say, to generate 1 token and complete one forward pass, the GPU needs to read approximately 140GB of weight data from HBM.

The problem is that although the Decode phase reads a lot of data, the actual computation needed is minimal. At this point, arithmetic intensity becomes very low.

In the Decode phase, arithmetic intensity might be only 1-2 ops/byte, meaning only 1-2 operations are performed for every byte read. The result is that Tensor Cores finish computing quickly, but subsequent time is spent waiting for the next batch of data to be delivered from HBM.

This is also why GPU utilization in large model inference is often only 20%-40%. High-end GPUs costing tens of thousands of dollars are often not operating at full computational load but are waiting for memory.

Therefore, the performance bottleneck in the Decode phase has shifted from FLOPS, i.e., theoretical computational power, to HBM bandwidth.

The most intuitive metric is TPOT, Time Per Output Token, meaning how much time is needed to generate each output token.

Taking an H100 running a 70B model as an example, the theoretical physical lower limit in the Decode phase is about 42 milliseconds. This number is primarily determined by the H100's HBM bandwidth. The H100's HBM bandwidth is about 3.35TB/s; if generating one token requires reading about 140GB of data, then the theoretical lower time limit is 140GB ÷ 3.35TB/s ≈ 42ms

Switching to Blackwell B200, with HBM bandwidth increased to about 8TB/s, reading the same 140GB data, the theoretical lower limit becomes: 140GB ÷ 8TB/s ≈ 17.5ms

That is to say, Decode speed increases by about 2.4 times, primarily due to HBM bandwidth improvement.

So, are there ways to alleviate this memory bottleneck?

The most important method is to increase batch size, i.e., have the GPU handle more user requests simultaneously. Because model weights can be shared among multiple requests, the GPU reads the weights once and can serve multiple users at the same time. This way, the memory read cost per user, per token is reduced.

This is also why modern AI inference serving systems all pursue high concurrency and large batch sizes. The larger the batch, the more likely the GPU's computing units are to be fully utilized, and the lower the unit token cost.

But batch size cannot be increased indefinitely because more concurrent requests lead to larger KV Cache. KV Cache needs to reside in HBM for a long time, so HBM capacity also becomes very critical. Bandwidth determines data read speed; capacity determines how many requests can be served simultaneously.

Therefore, the core bottleneck of AI inference can be summarized as: the Prefill phase is more computationally demanding because it processes a large number of input tokens in parallel. The Decode phase is more HBM-dependent because it repeatedly reads model weights and a growing KV Cache.

In practical inference services, most of the time is spent in the Decode phase. Therefore, the core bottleneck determining inference efficiency is the bandwidth and capacity of HBM.

Here is a summary of AI inference economics:

AI inference can be simplified to a core equation: Token Price ∝ (GPU Cost + Power Cost) / (Tokens Generated Per Second)

And Tokens Generated Per Second (Throughput) is directly limited by two physical constraints:

HBM Capacity: Determines how large a model a single card can hold, how long a context it can support, and the potential batch size.
HBM Bandwidth: Determines the speed of reading model weights and KV Cache from HBM for each token generated.

During the Decode phase (token-by-token output generation), GPU computational utilization is typically extremely low (<10%), because most of the time is spent reading data from HBM. This is a classic Memory-Bound scenario. At this point, HBM bandwidth directly determines token generation speed—doubling bandwidth nearly doubles throughput and nearly halves unit token cost.

When the model parameter count exceeds the HBM capacity of a single card, the system must employ Model Parallelism, distributing weights across multiple cards. Generating each token requires cross-card communication, introducing huge network latency and additional computational overhead. A core reason why OpenAI's early GPT-4 service had slow response times and high costs was that the model was too large, forcing distribution across many GPUs, and communication overhead consumed efficiency.

Therefore, every leap in HBM capacity and bandwidth directly corresponds to an exponential decline in token inference cost:

H100 → H200: Capacity +76%, Bandwidth +43% → Supports larger batches and longer contexts
B200 → Vera Rubin: Capacity unchanged (288GB), Bandwidth 2.75 times → Significant throughput improvement in Decode phase
Rubin Ultra: Capacity 4 times (1TB+)→ Can fit trillion-parameter MoE models within a single card/chassis, eliminating sharding overhead

NVIDIA claims Rubin can achieve a 10x reduction in per-Token Total Cost of Ownership (TCO) compared to Blackwell, essentially derived from the triple effect of HBM4 bandwidth leap (reducing Memory-Bound time) + FP4 precision (reducing data transfer volume) + larger capacity (reducing cross-machine communication).

Based on the above semiconductor shortage logic, we evaluate the seven segments of the AI industry chain, resulting in an industrial map of semiconductor sector tightness. We will release detailed research on other sectors subsequently.

Storage Sector Panorama: Classification, Technology Generations, Player List

From DRAM, NAND, NOR to HBM, HBF

Now, let's return to the storage industry. Storage chips can broadly be divided into two categories: one is "runtime memory," responsible for allowing CPUs, GPUs to quickly read and process data; the other is "long-term storage," responsible for preserving data after power-off.

From a product classification perspective, the three core traditional categories are DRAM, NAND Flash, and NOR Flash. Entering the AI era, HBM has become the most important high-end branch within DRAM, while HBF represents a new attempt by NAND Flash to extend into high-bandwidth, high-capacity AI inference scenarios.

Traditional Three Storage Categories: DRAM, NAND, NOR

DRAM is the most typical volatile memory, meaning data disappears after power-off. It primarily serves the role of "runtime data temporary storage" and is the main memory most relied upon by CPUs, GPUs, and other computing chips during operation.

DRAM's advantages are high speed and low latency, suitable for real-time computation; its disadvantages are higher cost and inability to retain data without power. Its technology generations primarily evolve around bandwidth, power consumption, and capacity. Current mainstream products include DDR4, DDR5, as well as GDDR and HBM for graphics and AI acceleration scenarios. DDR5 is gradually replacing DDR4, becoming the mainstream memory standard for servers and high-end PCs.

NAND Flash is the most widely used non-volatile memory, meaning data can be preserved after power-off. It is primarily used in SSDs, smartphone storage, USB drives, memory cards, and data center hard drives. Compared to DRAM, NAND is much slower but offers larger capacity and lower unit cost, making it more suitable as a "large-capacity data warehouse."

NAND technology evolution mainly follows two paths: one is moving from SLC, MLC, TLC to QLC, continuously increasing how many bits each memory cell can store; the other is moving from 2D NAND to 3D NAND, increasing capacity through vertical stacking. Currently, 200-layer+ and 300-layer+ 3D NAND have become the industry's competitive focus. The core contradiction for NAND is the balance between capacity, cost, performance, and endurance. TLC and QLC have lower cost and larger capacity, but weaker write endurance and performance compared to SLC and MLC.

NOR Flash is also non-volatile memory, but its positioning is completely different from NAND. NOR's characteristics are fast read speed, support for random access, and the ability to execute code directly, i.e., XIP (Execute In Place). Therefore, NOR is mainly used to store boot code, firmware, and critical system parameters, such as Boot Code and Firmware in mobile phones, automotive electronics, IoT devices, PC motherboards, and embedded systems.

NOR capacity is typically not large, write and erase speeds are not fast, and cost is higher than NAND, so it's not suitable for large-capacity storage. However, in scenarios like device boot-up, security authentication, automotive electronics, and industrial control, NOR's reliability and fast read capability are crucial. With the increase in automotive electronics and IoT devices, high-density NOR still has structural growth potential.

In short, DRAM is responsible for "runtime computation," NAND is responsible for "large-capacity storage," and NOR is responsible for "boot-up and firmware."

The Most Important Storage Branch in the AI Era: HBM

If traditional DRAM is the main memory for servers and PCs, then HBM is the high-end dedicated memory for AI GPUs. HBM stands for High Bandwidth Memory. It essentially still belongs to DRAM, but its structure and packaging are very different from traditional DDR and GDDR.

Traditional memory typically connects to the processor via PCB traces, and bandwidth improvements are limited by power consumption, distance, and signal integrity. HBM, on the other hand, adopts a 3D stacking structure, vertically stacking multiple layers of DRAM dies and connecting them vertically via TSVs (Through-Silicon Vias). Subsequently, the HBM stack is placed in the same package as the GPU via an Interposer, enabling very short-distance, ultra-wide interface data transmission.

This brings two direct results: First, HBM has extremely high bandwidth; Second, HBM is very close to the GPU, allowing data to be delivered to the computing cores faster. This is crucial for large model training and inference. Because although AI GPU Tensor Cores compute very fast, if HBM cannot timely provide model weights, KV Cache, and activation data, the GPU will experience "memory wait."

Therefore, HBM has become one of the core bottlenecks in AI computing systems. Especially during the Decode phase of large model inference, performance is often no longer determined by the GPU's theoretical FLOPS, but by HBM's bandwidth and capacity. Bandwidth determines the speed at which data is fed to the GPU; capacity determines how many model weights and KV Cache a single card can hold, and how large a batch it can serve simultaneously.

From a technology generation perspective, HBM iteration is rapid. HBM2E was mainly used in early AI acceleration and HPC; HBM3 significantly improved bandwidth; HBM3E is the mainstream configuration for current AI GPUs; HBM4 will further increase I/O width, single-stack bandwidth, and capacity, and introduce more complex base die designs. Future HBM4E, HBM5 will continue evolving towards higher bandwidth and larger capacity.

NAND's New Direction: HBF

HBF (High Bandwidth Flash) is a new type of storage born for AI inference. It can be understood as a "middle-tier high-speed storage" between HBM and ordinary SSDs. AI models are getting larger, with weight files often hundreds of GB, even TB level. HBM is fast enough but capacity limited and expensive; SSDs have sufficient capacity, but speed and latency cannot keep up with high-performance inference demands. The emergence of HBF aims to solve this contradiction.

It borrows HBM's 3D stacking and high-speed interconnect technology, vertically stacking multiple layers of NAND Flash dies and connecting them via high-speed channels, thus possessing much higher bandwidth than ordinary SSDs while maintaining large capacity characteristics. It is not meant to replace HBM, but to serve as a "near-line high-speed storage layer," specifically storing large model weights, so the GPU doesn't frequently wait for data during inference.

By 2026, HBF has entered the engineering sample phase. Manufacturers plan to begin delivering samples in the second half of the year and see the first commercial reference designs in 2027. Mass production is not expected until after 2028. Its significance lies in that AI inference servers can achieve bandwidth and capacity close to HBM at a lower cost than an all-HBM solution, enabling faster large model loading and higher throughput.

Player List: Who Holds Key Positions in the Storage Industry Chain

From a global perspective, the storage industry is highly concentrated, but competitive landscapes differ significantly across product lines, with each player having its core strengths and strategic focus.

DRAM and HBM Field: Samsung, SK Hynix, and Micron are the absolute core players. DRAM remains the main memory for servers, PCs, and mobile devices, while HBM is the critical high-speed memory for AI GPUs. Samsung excels in overall process and packaging integration, especially with advantages in 3D stacking and TSV reliability; SK Hynix, leveraging deep ties with NVIDIA and first-mover advantages in mass production, holds over half of the HBM market share; Micron is rapidly catching up on HBM3E and HBM4 engineering samples, showing notable progress especially in high-capacity stacking and reliability optimization.

For HBM, competition is not only about DRAM manufacturing processes but also TSV stacking, packaging yield, testing processes, customer qualification, and long-term supply capability. Securing large AI chip orders from NVIDIA, AMD, Google, etc., means holding pricing power and guaranteed capacity in the current AI storage supercycle.

NAND Flash Field: Core players include Samsung, Kioxia/Western Digital, SanDisk, SK Hynix, and Micron. In the past, the NAND market was primarily driven by consumer electronics, SSD, and data center demand. With the growth of AI inference demand, the development of high-bandwidth NAND or HBF technology provides new growth opportunities for manufacturers. Samsung still leads in high-density 3D NAND and enterprise SSDs; Kioxia/Western Digital have prominent advantages in storage scale and reliability; SanDisk focuses on HBF and high-bandwidth flash prototypes, rapidly targeting the AI inference market. SK Hynix and Micron are also actively following up on high-bandwidth NAND solutions, competing for AI data center market share.

NOR Flash Field: Major players include Macronix, GigaDevice, Winbond, and Infineon/Cypress. The NOR market size is smaller than DRAM or NAND, but applications are diverse and require high reliability. It's mainly used in automotive electronics, industrial control, IoT, and embedded systems, responsible for storing boot code, firmware, and critical parameters. Chinese manufacturers show relatively strong competitiveness in the NOR field. GigaDevice is continuously breaking through in high-density NOR and automotive-grade applications, steadily growing its market share.

In summary, the storage sector can be divided into three layers according to the AI semiconductor chain:

High-End AI Memory Players: Core HBM suppliers like SK Hynix, Samsung, Micron, targeting GPU high-bandwidth memory and large model training demand.
Large-Capacity Storage Players: NAND and SSD manufacturers like Samsung, Kioxia/Western Digital, SanDisk, SK Hynix, Micron. The future adoption of HBF will open new growth points in the AI inference market for these players.
Embedded and Control-Class Storage Players: NOR Flash suppliers like Macronix, GigaDevice, Winbond, Infineon/Cypress, focusing on scenarios with high reliability requirements like industrial, automotive, and IoT.

This landscape shows the layered trend of the storage industry chain driven by AI: HBM represents high-end computing power support, NAND/HBF represents the combination of large capacity and high bandwidth, and NOR ensures stability for embedded and industrial control. With the advancement of new technologies like HBF, the landscape of NAND and AI inference storage markets may undergo new changes.

We will delve into detailed financial analysis and business research for each company later in subsequent articles.

In-depth Supply-Demand Analysis: The Most Severe Structural Shortage in 15 Years

The current global memory chip market is experiencing a historic boom driven by multiple overlapping demand engines, with AI's explosion at its core. The three forces of AI training, AI inference, and the large-scale global data center expansion intertwine and reinforce each other, collectively forming a "bottomless pit" demand for memory chips. The transmission logic of these three engines differs, but their direction is completely aligned.

Demand Side

Engine One: AI Training—Parameter Arms Race Drives HBM Capacity Expansion

Large language model parameter scales are accelerating from hundreds of billions towards trillions. Each iteration is not linear—GPT-4 has about 1.8 trillion parameters (MoE architecture); the target for next-gen models is 10 trillion+. Training such models requires parallel processing on thousands, even tens of thousands of GPUs for months.

But the real bottleneck in training is the bandwidth between the GPU and HBM. During training, each GPU needs to repeatedly read and write gradients, optimizer states, activation values in HBM. Insufficient HBM capacity → cannot hold the complete model state → must communicate across GPUs → communication overhead consumes computational gains.

Thus, for each generation of NVIDIA's GPU, HBM capacity jumps:

From H100 to Rubin Ultra, single-card HBM capacity surges from 80 GB to ~1 TB, increasing 12.5 times in four years.

And it's not just NVIDIA. Google TPU v8AX, Amazon Trainium3 are also upgrading from HBM3E 8-Hi to 12-Hi. AMD MI350→MI400, HBM capacity jumps from 288 GB to 432 GB. The entire AI accelerator industry is racing in the same direction.

The training phase demands storage across the entire stack:

HBM: The GPU's "workbench," where all intermediate states of the training process reside. Without enough HBM → cannot train large models.
NVMe SSD: Training datasets can be tens of TB (e.g., Llama 3 used 15T tokens). They must be fed to HBM at high speed. Bottlenecks in data loading directly lengthen training time—if SSD bandwidth cannot keep up with GPU consumption, tens of thousands of GPUs idle waiting for data.
HDD: Cold data storage. Post-training model weights, historical checkpoints, raw corpora need long-term preservation.

Engine Two: AI Inference—Scale Effect Makes Storage Demand "Long-Tailed"

The single-shot computational complexity of inference is indeed much lower than training, but its demand characteristics are completely different: Training is a bursty, concentrated consumption; Inference is a continuously expanding, long-tail devouring force.

Several key numbers need to correct traditional perceptions:

Inference is no longer the "small share." In 2026, inference accounts for 66% of total AI computing load; two years ago this number was only 33%. The positions of inference and training have completely reversed. With the explosion of Agentic AI (AI capable of autonomously calling tools, multi-step reasoning, multi-agent collaboration), inference's share will only continue to rise.

Single inference memory demand is severely underestimated. A cutting-edge LLM's model weights occupy 100-200 GB (FP16 precision), and KV Cache consumes an equal or even greater amount of memory. There's a simple, brutal rule for KV Cache: every time the context token count doubles, the memory occupied by KV Cache also doubles. Agentic workloads typically have long contexts (multi-turn conversations, codebase context, document retrieval), making KV Cache pressure far exceed that of simple chatbots.

GPU's role division during inference:

Agentic AI spends most of its time in Decode. This is why HBM bandwidth determines inference speed, and HBM capacity determines how many users can be served simultaneously.

Scale effect is the real story. A training cluster might need thousands of GPUs, but inference deployment might require hundreds of thousands, even millions. Anthropic's ARR grew from $90 billion to $300 billion in just 4 months.

Anthropic itself stated: with more GPUs, ARR could be even higher. They are constrained by HBM supply on GPUs.

Inference's full-stack storage demand:

HBM: Core bottleneck for GPU inference (bandwidth + capacity). Long context + KV Cache in Agentic inference continually worsens this bottleneck.
Enterprise SSD: Model weight persistence, offloading KV Cache to SSD (long context strategy), user data storage. Enterprise SSD prices increased 140% month-over-month.
HBF (High Bandwidth Flash): Next-gen product from the SanDisk and Kioxia collaboration, single stack 512 GB, module 4 TB, designed specifically for inference weight storage. Mass production starts early 2027. If successful, the memory hierarchy for inference will be rewritten—HBM for speed, HBF for capacity.

Engine Three: Data Center Expansion—The "Amplifier" of Storage Value

Hyperscalers' CapEx surge isn't just for AI. Traditional cloud computing, big data, and streaming are also growing. But AI has changed the storage architecture of data centers.

Several specific data points:

In 2026, data centers consume ~70% of global DRAM capacity (SemiAnalysis), with server DRAM ~40%, HBM ~10%+ and rising rapidly
Seagate's FY2026 Q3 data center shipments accounted for 88%; nearline HDD capacity for 2026-2027 is completely sold out, with orders for 2028 already being discussed
Kioxia's 2026 NAND capacity has been 100% booked by customers
NVIDIA Vera Rubin NVL72 system memory demand jumps significantly from Blackwell NVL72 levels (SOCAMM LPDDR5-based memory becomes a new growth area)

HDD's unexpected resurgence is a particularly interesting side effect: enterprise SSDs are too expensive (month-over-month +140%), pushing a lot of cold data from SSDs back to HDDs. In the AI inference era, the economics of storage-compute separation give HDDs new survival logic—nearline HDD demand explodes; Seagate's net profit increased 248% year-over-year.

Cross-Validation of the Three Engines

In summary: Training defines the upper limit of HBM capacity, Inference defines the breadth of HBM demand, and Data center expansion provides the base for total demand volume. None of these three forces are slowing down, while the supply side (detailed in the next section) is an "iron cage." This is the underlying narrative of this storage cycle.

Supply Side: HBM Capacity Squeeze and 2-3 Year Capacity Build Cycle

Facing the explosive growth on the demand side, the supply side of memory chips is under unprecedented structural constraints, unable to respond quickly to market changes. This is the fundamental cause of the current severe global memory chip shortage and soaring prices. The core supply-side problems can be summarized in two points: first, the massive squeezing effect of HBM production on traditional DRAM and NAND capacity; second, the inherent 2-3 year long capacity build cycle for memory chips, resulting in extremely low supply elasticity.

According to SemiAnalysis estimates, the proportion of HBM wafer capacity to total DRAM wafer capacity for Samsung, SK Hynix, and Micron was less than 5% in 2022, but by the end of 2025, it had risen to about 20%. By the end of 2027, this proportion might reach about 35%.

First, HBM, as the current "golden goose" of the storage market, has an exceptionally complex and "resource-hungry" production process. As mentioned, HBM uses 3D stacking and TSV (Through-Silicon Via) technology to vertically package multiple DRAM dies together. This complex packaging process not only has high technical barriers and slow yield ramp-up but also consumes 3 to 4 times more wafer area than traditional DRAM products of equivalent value. This means producing a $100 HBM chip consumes the wafer area that could have produced 3-4 standard DRAM chips.

This structural shift in capacity causes severe knock-on effects. Statistics show that to secure HBM output, Samsung shifted about 30% of its consumer-grade DRAM capacity to HBM; SK Hynix's shift ratio is even higher at 40%; and Micron shifted about 25%. These numbers are staggering; they directly lead to a significant reduction in available global standard DRAM capacity.

Simultaneously, because HBM packaging requires advanced packaging lines (like CoWoS), this capacity is also heavily occupied, further limiting other advanced packaging businesses. This "suction effect" triggered by HBM causes markets for DDR4, DDR5, and NAND Flash, which were relatively balanced in supply and demand, to fall into severe shortage within a short period. For example, as DDR4 capacity was drastically reduced, its price experienced a shocking increase in the second half of 2025, rising up to 1800%, fully illustrating the severity of the capacity squeeze.

Thus, we can see this cycle's uniqueness lies in: the product shift driving demand growth—HBM—itself has higher manufacturing intensity, effectively introducing a reverse scaling effect. In past computing platform inflection points, like PCs, mobile phones, cloud computing, they mainly expanded demand without significantly constraining supply. But this time, the shift to HBM driven by AI increases demand on one hand, and on the other, due to HBM's manufacturing difficulty exceeding traditional DRAM, it effectively tightens supply.

Second, the semiconductor capacity build cycle is another major hard constraint on the supply side.

From deciding to invest in building a new wafer fab to finally producing qualified chips typically takes 2 to 3 years. This process includes plant construction, cleanroom setup, procurement, installation, and debugging of multi-million-dollar equipment (like EUV lithography machines), followed by lengthy process development and yield ramp-up.

In the current market environment, even if the three major manufacturers decided tomorrow to aggressively invest in expanding DRAM capacity, these new capacities would only begin to actually enter the market by the end of 2027 or early 2028. More importantly, facing the structural opportunity brought by AI, manufacturers' capital expenditure strategies have become more cautious and focused. They prefer to invest precious capex into higher-margin, more certain growth prospects like HBM and advanced DRAM technology, rather than massively expanding relatively mature, thinner-margin traditional NAND Flash capacity.

For example, SK Hynix plans to build a new advanced packaging plant in Indiana, USA, focusing on AI memory (including HBM); Micron plans to invest tens of billions in new fabs in Idaho and New York, USA—these investments are aimed at future HBM and advanced DRAM capacity, not solving current traditional storage shortages. Therefore, at least until the end of 2027, the market will hardly see any new supply that could significantly alleviate the current supply-demand tension. This supply rigidity, contrasted with demand elasticity, jointly pushes the storage industry into a "long-haul" supercycle.

Table 3: Major Manufacturers' Capacity Shift and Expansion Plans

According to estimates, from the end of Q4 2025 to Q4 2027, the combined 1b and 1c capacity of the three major manufacturers will increase by about 80%; by the end of 2026, nearly 30% of Samsung and SK Hynix's DRAM wafer production will shift to the 1c node, and Micron will also shift about 30% of its DRAM production to its corresponding 1γ node.

In 2026, actual new wafer output is limited and concentrated mostly on HBM; NAND sees almost no actual wafer increment.

According to SemiAnalysis data, it's expected that in 2026, almost all new wafer capacity will be concentrated in three fabs: Samsung's P4 (mainly Phase 1 and 3, Phase 4 has limited contribution by end-26), SK Hynix's M15X, and Micron's A3.

Quantifying the Supply-Demand Gap: Goldman Sachs Predicts a 4.9% DRAM Supply Shortage in 2026

The current imbalance in the storage market has reached an extremely severe level. Quantitative analyses by multiple top investment banks provide solid evidence for the strength of this "supercycle." Goldman Sachs released an in-depth research report in mid-2025, clearly stating that the current memory chip market is experiencing the most severe supply-demand imbalance in the past 15 years. Through detailed modeling of global DRAM and NAND Flash supply and demand, the report predicts staggering shortage figures. For the DRAM market, Goldman Sachs predicts supply shortages of 4.9% and 2.5% in 2026 and 2027, respectively. This means total global DRAM demand will exceed total supply by nearly 5%. In a market measured in millions of wafers, such a large-scale shortage is extremely rare, sufficient to trigger sustained, substantial price increases. The situation for NAND Flash is similarly concerning, with the report predicting it will also be in shortage, albeit slightly less severe than DRAM, but given its massive market size, its impact on the entire industry is equally profound.

This quantitative analysis deeply reveals the structural reasons behind the imbalance. Goldman Sachs' model fully considers the squeezing effect of HBM capacity expansion on traditional DRAM capacity and the lag of manufacturers' capital expenditure shifts. The report believes that although manufacturers are aware of the tightness and beginning to increase investment, due to the 2-3 year capacity build cycle, these investments cannot translate into effective supply in the short term. Therefore, the supply-demand gap will persist, possibly even widen, over the next one to two years.

According to calculations from SemiAnalysis's Memory Industry Model, the DRAM market supply-demand imbalance is worsening. In 2026, overall DRAM supply is expected to be about 7% lower than demand.

Within this overall shortage, the HBM supply shortage is expected to widen from about 5% this year to about 6% in 2026, and further to about 9% in 2027.

These quantitative figures are not just descriptions of the current market condition but also strong predictions for future price trends and manufacturer profitability. In a severely undersupplied market, pricing power lies entirely with the manufacturers. Customers, especially cloud service providers and AI companies needing stable supply, are willing to accept higher prices and sign long-term supply agreements to ensure sufficient chip supply.

Inventory Status: Inventory Days Drop to Historically Extreme Lows

Inventory level is one of the most sensitive and direct leading indicators for judging the position in the memory chip industry cycle. In a normal industry cycle, downstream customers (like PC makers, smartphone manufacturers, server brands) typically maintain inventory of about 12 to 16 weeks (84-120 days) to cope with demand fluctuations and supply chain uncertainty. However, in the current supercycle, inventory is not a simple buffer but a "barometer" reflecting a state of extreme market hunger. Current inventory levels across the entire storage industry chain have fallen to historically extreme lows.

We analyze inventory across four layers:

Layer 1: DRAM/HBM Manufacturers (Samsung / SK Hynix / Micron / Nanya)

Note: Bloomberg's DOI calculation includes total inventory (including WIP, raw materials, finished goods), while SK Hynix self-disclosed at the Goldman Sachs virtual investor conference in February 2026 that about 4 weeks (28 days) only refers to finished goods inventory for DRAM and NAND. Bloomberg/MacroMicro's latest update (after DART 5/15 cutoff) shows SK Hynix's total inventory DOI is 124.98 days (Q4 2025 was 143 days), Q1 2026 declined 18 days QoQ—finished goods inventory extremely low, many wafers stuck on HBM TSV drilling/stacking production lines. SK Hynix stated in the same conference: "No customer can fully meet their memory demand this year." Micron confirmed in Q1 FY2026 (Dec 2025) and Q2 FY2026 (Mar 2026) earnings calls for two consecutive quarters that DRAM inventory days "remain tight and below 120 days", with Q2 further using the phrase "especially tight".

HBM's TSV drilling, wafer thinning, 12-layer stacking, and testing processes are extremely lengthy; many wafers are stuck on production lines, while finished goods are shipped out immediately upon completion. This is a very dangerous signal, indicating the company's finished goods inventory is shipped out to customers almost simultaneously with completion, leaving almost no surplus stock. This "hand-in-hand" supply state shows market demand is far stronger than official order data indicates, as customers are also consuming their own safety stock.

Layer 2: FLASH / HDD Manufacturers (SanDisk / Western Digital / Seagate)

The three companies are trending differently: SanDisk is increasing inventory (actively stocking after signing NBM long-term agreements), Western Digital is decreasing inventory (DOI 77 days, declining QoQ, demand too strong, digesting quickly), Seagate is in the middle (87 days stable).

Layer 3: Taiwan Module Makers and Niche Players

Taiwanese module makers all increased stockpiling in Q1 2026. Phison Electronics inventory exceeded 50 billion NTD, signed LTAs with top CSPs/OEMs. ADATA aims to lift inventory to over 50 billion NTD by end of June. Team Group announced "DRAM and NAND demand expected to remain strong, will ensure stable supply through long-term procurement strategies".

Niche players are moving in the opposite direction—inventory and DOI for Winbond, Macronix, and ESMT are all declining QoQ (Winbond DOI 145 days, Macronix 163 days, ESMT 210 days, all down QoQ). Niche products (especially eMMC) are in more severe shortage, manufacturers have no extra capacity to stockpile.

Layer 4: Mainland China Module Makers

Three companies combined 42.22 billion RMB, average monthly profit about 1 billion RMB, sufficient funds to support stockpiling strategies. Mainland China niche players (GigaDevice, Puya, Dosilicon, Ingenic, Giantec, HeXin) combined inventory has increased QoQ for eight consecutive quarters, but growth rate is far lower than module makers, DOI trend downward (improving within 175-453 days range).

Reshaping of Inventory Behavior by Long-Term Agreements and Locked Orders

Combining data from all four layers, the inventory structure of the industry chain is clear:

Manufacturers' finished goods inventory at 4-week level, zero buffer on supply side
Midstream module makers are actively building inventory, locking future supply, betting on continued price increases
Downstream customers signing LTAs and long-term agreements: SanDisk signed 5 NBMs covering over 1/3 of FY2027 bit output, including over $11 billion financial guarantees; Micron signed its first five-year SCA
HDD LTAs already extend to 2028-2029

In this extremely low inventory environment, a phenomenon called "Double Ordering" begins to spread. Because customers worry they cannot obtain enough chips, they may place orders exceeding their actual needs with multiple suppliers to ensure they eventually receive the minimum quantity required for production. This behavior artificially amplifies market demand signals in the short term, making orders seen by manufacturers potentially higher than real demand, further pushing up price expectations and manufacturers' expansion willingness.

The risk is equally clear. Once supply is released, these hoarded inventories could instantly flip from the demand side to the supply side. The true scale of Double Ordering is hard to parse; when everyone is scrambling for capacity, the boundary between real demand and panic order-locking is blurred.

Although brokerage reports emphasize that NAND spot prices dropped 30-40% over the past month, mainly due to traders' funding pressure causing price cuts for liquidity and buyers digesting inventory. However, TrendForce still expects Q2 traditional DRAM contract prices to rise 58-63% QoQ, NAND contract prices to rise 70-75% QoQ—the upward trend not reversed.

Therefore, the current low inventory state both reflects the intensity of the current supercycle and contains risks of increased future market volatility. For investors, closely tracking changes in inventory days is one of the most important signals to judge when the cycle is nearing its peak. When inventory begins to recover from the bottom and consistently exceeds normal levels of 120 days (tightness line for DRAM/HBM) or 150 days (normal line), it may indicate subtle shifts in supply-demand dynamics.

Inflection Point Judgment: Four Core Indicators—Capacity Utilization, Spot Prices, Contract Prices, and Inventory Days

In the highly cyclical storage chip industry, accurately judging cycle inflection points (both peaks and troughs) is key to investment success. Although many believe this AI-driven supercycle has a "structural shift" nature, the industry's inherent cyclical patterns haven't completely vanished. Therefore, establishing an effective monitoring system to track core indicator changes is crucial for timely capturing signals of a cycle turn.

When "spot price correction + contract prices continue rising" occur simultaneously, it does not necessarily signal a price peak; it reflects confirmation of structural tightness. The current state (May 2026) is exactly this.

Contracts reflect real supply-demand: Contract market participants are large customers like CSPs and OEMs, reflecting real production demand; spot market is dominated by traders, easily influenced by sentiment
Long-term agreements lock price differentials: Micron signed a 5-year SCA, SanDisk signed 5 NBMs (covering over 1/3 of FY2027 bit output), Western Digital/Seagate LTAs extend to 2028-2029—these long-term agreement prices are unaffected by short-term spot fluctuations
Capacity allocation mechanism: Manufacturers prioritize high-value contract customers, reduce supply to PC/consumer segments—continuing contract price rises alongside spot price correction precisely prove structural tightness is intensifying

Only when three conditions are simultaneously met: "Contract price QoQ increase for two consecutive quarters 95% to <90%", can a confirmed cycle peak signal be identified. Currently, none of these three conditions have been triggered.

Indicator One: Capacity Utilization (Latest Q1 2026 Data)

SemiAnalysis: "In 2026, almost all new wafer capacity will be concentrated in three wafer fabs: Samsung's P4, SK Hynix's M15X, and Micron's A3. Among them, both Micron's A3 and SK Hynix's M15X are expected to be primarily used for HBM production rather than traditional DRAM, which will limit the increment in wafer and bit output. For NAND, there is almost no actual wafer increment contribution in 2026, bit growth mainly relies on technology process upgrades."

This means: Even if capacity utilization rises from 95% to 98%, effective supply of traditional DRAM/NAND sees almost zero growth.

Indicator Two: Spot Price Index (As of May 8, 2026)

Data Source: Wind

Detailed DRAM Spot Prices (Daily, as of May 8):

"Scissors Gap" Phenomenon between DDR4 and DDR5: DDR4 spot correction is due to previous excessive rise (large gap with contract prices), not demand collapse. DDR5 spot is still rising (WoW +2.14%), indicating AI server demand (mainly using DDR5) remains strong.

NAND Spot Prices (Weekly, as of April 27):

Three Reasons for NAND Spot Correction (China Merchants Securities): Previous excessive gains (doubled in 3 months) + Trader funding pressure for liquidity + Buyers generally observing. But contract prices completely unaffected: TrendForce expects Q2 NAND contract prices to still rise 70-75% QoQ.

Indicator Three: Latest Contract Price Forecast (TrendForce May 2026)

Sub-category Contract Price Forecast (Q2 2026):

Bernstein Mark Li (May 7, 2026): "Price increases are expected to decelerate notably into the third quarter. We expect the cycle to peak in mid-2027, with a gentle descent thereafter."

Indicator Four: Inventory Days (Latest Q1 2026)

Four-layer inventory structure:

Manufacturers' finished goods inventory at 4-week level (SK Hynix finished goods only 28 days), zero buffer on supply side
Midstream module makers actively building inventory (Longsys + Silicon Motion + BIWIN combined 42.22 billion RMB, DOI 270-303 days), betting on continued price increases
Downstream customers signing LTAs and long-term agreements (SanDisk 5 NBMs covering over 1/3 of FY2027 bit output, Micron's first 5-year SCA)
HDD LTAs already extend to 2028-2029

Comprehensive Judgment: Where is the Cycle Currently Positioned?

Conclusion: The current (May 2026) storage supercycle is in the mid-phase of the upcycle (about 60-70% position), not near its end. Core arguments: Capacity utilization still has room to improve, contract prices still rising strongly, manufacturer inventory at strategic lows, long-term agreement lock-in proportion >50% significantly reduces risk of "demand collapse".

Tiền kỹ thuật số thịnh hành

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

BrevisBREV

ZRX（0X）ZRX

PancakeSwapCAKE

Câu hỏi Liên quan

QAccording to the article, what is the fundamental bottleneck in AI inference performance during the Decode stage, and why?

AThe fundamental bottleneck is HBM (High Bandwidth Memory) bandwidth and capacity. During the Decode stage, the GPU generates tokens one by one. For each token, it must repeatedly read the massive model weights and the ever-growing KV Cache from HBM. The arithmetic intensity (ops per byte) in this stage is very low (1-2 ops/byte), meaning the compute units (Tensor Cores) finish work quickly and spend most of the time waiting for data to be read from HBM. Therefore, the speed of generating tokens (TPOT) is directly limited by how fast data can be read from HBM (bandwidth), while HBM capacity determines how many concurrent requests can be served, as KV Cache for each request must reside in HBM.

QHow does the production of HBM impact the supply of traditional DRAM and contribute to the current chip shortage?

AThe production of HBM creates a severe 'crowding-out effect' on traditional DRAM supply. Manufacturing HBM is much more wafer-intensive; producing $100 worth of HBM consumes 3-4 times the wafer capacity needed to produce standard DRAM of the same value. To meet HBM demand, manufacturers like Samsung and SK Hynix have shifted 30-40% of their consumer DRAM wafer capacity to HBM production. This massive structural shift in capacity allocation has drastically reduced the available supply of standard DRAM (like DDR4/DDR5), leading to extreme shortages and price spikes in those markets, while also occupying advanced packaging (CoWoS) production lines.

QWhat are the three primary demand engines driving the current 'super cycle' in the memory chip market, as described in the article?

AThe three primary demand engines are: 1) AI Training: The parameter arms race for ever-larger models (from hundreds of billions to trillions of parameters) drives explosive demand for HBM capacity and high-performance storage (NVMe SSDs, HDDs) to handle datasets and model checkpoints. 2) AI Inference: The shift where inference now accounts for 66% of AI compute, driven by Agentic AI. Its long-context, KV-Cache-heavy, and massively scaled deployment nature creates sustained, 'long-tail' demand for HBM (bandwidth/capacity), enterprise SSDs, and emerging technologies like HBF. 3) Data Center Expansion: Broad hyperscaler capital expenditure increases, encompassing traditional cloud, big data, and AI, amplify overall demand. AI is reshaping data center storage hierarchies, leading to a resurgence in demand for high-capacity nearline HDDs as a cost-effective solution for cold data.

QWhat is the key difference between HBM and the emerging HBF (High Bandwidth Flash) technology, and what problem is HBF designed to solve?

AHBM (High Bandwidth Memory) is a high-end, expensive, 3D-stacked DRAM technology integrated with GPUs, providing extreme bandwidth but limited capacity. HBF (High Bandwidth Flash) is a new type of storage built on 3D-stacked NAND Flash, designed as a high-bandwidth, high-capacity 'intermediate layer' for AI inference. It aims to solve the contradiction where HBM is fast but capacity-limited and costly, while standard SSDs are high-capacity but too slow for high-performance inference. HBF provides bandwidth closer to HBM and capacity closer to SSDs at a lower cost, serving as a 'near-line high-speed storage layer' for storing large model weights to feed GPUs more efficiently during inference.

QBased on the four key indicators analyzed (capacity utilization, spot/contract prices, inventory days), what is the article's conclusion about the current phase of the memory super cycle?

AThe article concludes that the memory super cycle, as of May 2026, is in the middle phase (approximately 60-70% progression) of its upward cycle, not near its end. The key supporting evidence includes: capacity utilization still has room to increase; contract prices are still rising strongly (forecasted +58-63% for DRAM, +70-75% for NAND in Q2); original equipment manufacturers' finished goods inventory is at a strategically low level (e.g., SK Hynix reports only ~4 weeks); and the high proportion of long-term agreements (LTA/SCA/NBM) covering a significant portion of future production has significantly reduced the risk of a sudden demand collapse.

Nội dung Liên quan

Lãi Mở XRP Chạm Mốc 2,6 Tỷ USD Khi Nhu Cầu Phái Sinh Tăng

Dữ liệu từ CoinGlass cho thấy lãi mở (open interest) của hợp đồng tương lai XRP đã tăng hơn 10% trong 24 giờ, đạt mốc 2,6 tỷ USD, đưa XRP trở thành một trong những tài sản tiền điện tử có lãi mở phái sinh lớn nhất. Mức tăng này phản ánh nhu cầu giao dịch phái sinh quanh XRP đang gia tăng đáng kể. Tuy nhiên, lãi mở tăng cho thấy nhiều vị thế đang được mở ra, nhưng không tự động chỉ ra rằng dòng tiền này lạc quan hay bi quan về giá. Nó có thể đến từ các vị thế mua, bán khống, phòng ngừa rủi ro hoặc giao dịch đòn bẩy. Do đó, sự tích tụ này có thể hỗ trợ một đợt biến động mạnh hơn, nhưng cũng làm tăng rủi ro biến động và thanh lý. Các nhà giao dịch cần xem xét thêm các chỉ số khác như phí funding, khối lượng giao dịch spot, hướng giá và dữ liệu thanh lý để có bức tranh đầy đủ. Sự gia tăng lãi mở cho thấy XRP đang thu hút sự chú ý nghiêm túc từ thị trường phái sinh, nhưng sự bền vững của xu hướng sẽ phụ thuộc vào việc liệu hoạt động spot có tăng theo để xác nhận hay không. Kết quả cuối cùng có thể là một xu hướng mạnh hơn hoặc đơn giản là thêm biến động cho một thị trường vốn đã sôi động.

bitcoinist47 phút trước

Lãi Mở XRP Chạm Mốc 2,6 Tỷ USD Khi Nhu Cầu Phái Sinh Tăng

bitcoinist47 phút trước

Dự đoán giá Bitcoin năm 2030: Đây là những điều bạn nên biết về đợt tăng giá tiếp theo

Giá Bitcoin đang trong xu hướng giảm kể từ đợt sụt giảm vào ngày 10 tháng 10 năm 2025. Để xác định đáy của thị trường gấu, một chỉ báo quan trọng cần theo dõi là dòng tiền stablecoin chảy vào các sàn giao dịch, vì dòng chảy mạnh thường báo hiệu sự thay đổi tâm lý và thúc đẩy đà tăng giá. Nhà phân tích Joao Wedson dựa trên phân tích fractal dự đoán rằng đáy chu kỳ này có thể nằm trong khoảng $41,5k-$45k và đạt được vào nửa đầu tháng 10 năm 2026. Tuy nhiên, đây không phải là dự đoán chắc chắn mà chỉ dựa trên mô hình lịch sử. Xem xét về triển vọng giá Bitcoin đến năm 2030, phân tích kỹ thuật sử dụng các mức Fibonacci gợi ý rằng BTC có thể giảm về vùng $39,1k (gần với mức dự báo của Wedson) trước khi tiếp tục xu hướng tăng dài hạn. Nếu lặp lại kịch bản tương tự chu kỳ trước, đà tăng có thể vượt qua mức mở rộng 61.8% ở $152,3k và hướng tới mức cao trong khoảng $200k-$220k vào năm 2030, trước khi bước vào chu kỳ gấu tiếp theo. Cần lưu ý rằng chu kỳ hiện tại có thể kéo dài hơn so với trước đây.

ambcrypto1 giờ trước

Dự đoán giá Bitcoin năm 2030: Đây là những điều bạn nên biết về đợt tăng giá tiếp theo

ambcrypto1 giờ trước

Nhịp Đập Thị Trường BTC: Tuần 30

Bitcoin (BTC) đã hồi phục từ dưới 58.000 USD để thử nghiệm mức 65.000 USD trước khi đi vào giai đoạn củng cố quanh 64.500 USD. Động lượng tăng đã chậm lại và khối lượng giao dịch spot vẫn ở mức thấp. Mặc dù vậy, sự phục hồi được duy trì trong bối cảnh thị trường đang tìm kiếm điểm cân bằng vững chắc hơn. Chênh lệch biến động (volatility spreads) thu hẹp cho thấy thị trường phái sinh không còn định giá phí bảo hiểm rủi ro cao, phản ánh tâm lý phòng thủ giảm bớt. Dù sự tham gia của thị trường giao ngay còn yếu, khẩu vị đầu cơ đang dần quay trở lại. Lãi suất mở (open interest) cho hợp đồng tương lai và quyền chọn tăng lên, dòng tiền của nhà giao dịch vĩnh viễn (perpetual taker flow) chuyển sang mua ròng và nhu cầu bảo vệ trước rủi ro giảm xuống. Hoạt động on-chain cũng đang ổn định, được hỗ trợ bởi sự cải thiện vừa phải về thông lượng kinh tế và sự tham gia của người dùng. Dòng vốn vẫn thận trọng, nhưng dòng tiền ETF spot tại Mỹ đang phục hồi và các nhóm ETF đang trở lại gần mức hòa vốn, cho thấy áp lực bán từ tổ chức đang giảm dần. Nhìn chung, thị trường Bitcoin dường như ngày càng cân bằng hơn, với niềm tin dài hạn tạo đà hỗ trợ trong khi sự tham gia đầu cơ vẫn được kiểm soát. Tuy nhiên, tỷ trọng ngày càng tăng của vốn ngắn hạn nhạy cảm với giá cả làm tăng khả năng biến động mạnh hơn, khiến thị trường vẫn kiên cường nhưng ngày càng nhạy cảm với sự thay đổi trong động lượng và áp lực bán.

insights.glassnode2 giờ trước

insights.glassnode2 giờ trước

Nhu cầu Bitcoin tại thị trường giao ngay suy yếu khi vốn mới do dự bất chấp dòng tiền vào ETF

Mặc dù dòng tiền vào các quỹ ETF Bitcoin đã chuyển sang tích cực kể từ giữa tháng 7, nhưng điều này vẫn chưa đủ để đưa giá Bitcoin vượt lên vùng cung địa phương quanh mốc 65.000 USD. Theo phân tích từ CryptoQuant, nhu cầu mua Bitcoin trên thị trường giao ngay (spot) trong 30 ngày đã suy yếu đáng kể, giảm từ -80.000 BTC xuống -170.000 BTC. Sự ổn định tương đối của giá hiện tại chủ yếu được hỗ trợ bởi các lệnh mua để đóng vị thế bán (short-covering) trên thị trường phái sinh và áp lực bán từ các nhà đầu tư ngắn hạn đã giảm bớt. Chỉ số "Nhà đầu tư mới vào Bitcoin", đo lường tỷ trọng vốn hóa nắm giữ bởi các đồng coin non trẻ (dưới 1 tháng tuổi), vẫn ở gần mức thấp nhất trong năm, cho thấy sự tham gia của dòng vốn mới còn yếu. Tỷ lệ lợi nhuận trên đầu ra đã chi tiêu của nhà đầu tư ngắn hạn (STH SOPR) cũng duy trì dưới mức 1.0, có nghĩa là họ vẫn đang chốt lỗ trung bình. Những chỉ báo này cùng củng cố quan điểm rằng thị trường hiện đang trong giai đoạn ổn định cục bộ, chứ chưa có dấu hiệu đảo chiều tăng mạnh. Các nhà phân tích nhấn mạnh rằng, cần theo dõi việc giá có thể vượt qua đỉnh dao động địa phương 67.300 USD hay không để xác nhận một sự đảo chiều thực sự.

ambcrypto4 giờ trước

Nhu cầu Bitcoin tại thị trường giao ngay suy yếu khi vốn mới do dự bất chấp dòng tiền vào ETF

ambcrypto4 giờ trước

Vì sao giao dịch "cá voi" 32,6 triệu USD của Chainlink có thể định hình đà tăng của LINK về phía 9 USD

Một giao dịch chuyển 3,89 triệu LINK (trị giá 32,58 triệu USD) từ ví Coinbase Institutional sang một ví ẩn danh đã thu hút sự chú ý đến Chainlink. Chuyển động này làm dấy lên suy đoán về việc các tổ chức lớn đang tích lũy thay vì bán ra ngay lập tức. Dòng tiền ròng trên sàn giao dịch đã chuyển sang dương với lượng vào khoảng 620,18 nghìn USD, đánh dấu sự thay đổi sau một thời gian dài dòng tiền ra chiếm ưu thế. Tuy nhiên, tâm lý trên thị trường phái sinh vẫn thận trọng, với chỉ số CVD 90 ngày cho thấy lệnh bán vẫn chiếm ưu thế. Về mặt kỹ thuật, LINK đang giao dịch quanh mức 8,35 USD, thử thách vùng kháng cự gần đó. Chỉ số RSI ở mức 57,71 cho thấy áp lực mua đang tăng lên. Để duy trì đà hồi phục, giá cần vượt qua được ngưỡng 8,35 USD, từ đó mở đường hướng tới mục tiêu 9,00 USD. Ngược lại, nếu bị từ chối tại đây, giá có thể quay lại kiểm tra vùng hỗ trợ 8,18 USD. Tóm lại, hoạt động của ví tổ chức và dòng tiền ròng trên sàn là tín hiệu tích cực, nhưng tâm lý bán trên thị trường phái sinh vẫn tạo ra sự thận trọng, khiến triển vọng ngắn hạn của LINK chưa thực sự rõ ràng.

ambcrypto5 giờ trước

Giao dịch

Giao ngay

Bài viết Nổi bật

GROK AI là gì

Grok AI: Cách mạng hóa Công nghệ Đối thoại trong Kỷ nguyên Web3 Giới thiệu Trong bối cảnh trí tuệ nhân tạo đang phát triển nhanh chóng, Grok AI nổi bật như một dự án đáng chú ý kết nối các lĩnh vực công nghệ tiên tiến và tương tác người dùng. Được phát triển bởi xAI, một công ty do doanh nhân nổi tiếng Elon Musk dẫn dắt, Grok AI nhằm định nghĩa lại cách chúng ta tương tác với trí tuệ nhân tạo. Khi phong trào Web3 tiếp tục phát triển mạnh mẽ, Grok AI hướng tới việc tận dụng sức mạnh của AI đối thoại để trả lời các câu hỏi phức tạp, mang đến cho người dùng một trải nghiệm không chỉ thông tin mà còn giải trí. Grok AI là gì? Grok AI là một chatbot AI đối thoại tinh vi được thiết kế để tương tác với người dùng một cách linh hoạt. Khác với nhiều hệ thống AI truyền thống, Grok AI chấp nhận một loạt các câu hỏi rộng hơn, bao gồm những câu hỏi thường được coi là không phù hợp hoặc ngoài các phản hồi tiêu chuẩn. Các mục tiêu cốt lõi của dự án bao gồm: Lập luận đáng tin cậy: Grok AI nhấn mạnh lập luận theo lẽ thường để cung cấp các câu trả lời hợp lý dựa trên sự hiểu biết về ngữ cảnh. Giám sát có thể mở rộng: Việc tích hợp công cụ hỗ trợ đảm bảo rằng các tương tác của người dùng được theo dõi và tối ưu hóa về chất lượng. Xác minh chính thức: An toàn là điều tối quan trọng; Grok AI tích hợp các phương pháp xác minh chính thức để nâng cao độ tin cậy của các đầu ra của nó. Hiểu biết về ngữ cảnh dài: Mô hình AI xuất sắc trong việc giữ lại và nhớ lại lịch sử cuộc trò chuyện dài, tạo điều kiện cho các cuộc thảo luận có ý nghĩa và nhận thức về ngữ cảnh. Khả năng chống lại các cuộc tấn công: Bằng cách tập trung vào việc cải thiện khả năng phòng thủ chống lại các đầu vào bị thao túng hoặc độc hại, Grok AI nhằm duy trì tính toàn vẹn của các tương tác của người dùng. Nói tóm lại, Grok AI không chỉ là một thiết bị truy xuất thông tin; nó là một đối tác đối thoại hấp dẫn khuyến khích cuộc trò chuyện năng động. Người sáng tạo Grok AI Bộ óc đứng sau Grok AI không ai khác chính là Elon Musk, một cá nhân gắn liền với sự đổi mới trong nhiều lĩnh vực, bao gồm ô tô, du hành vũ trụ và công nghệ. Dưới sự bảo trợ của xAI, một công ty tập trung vào việc phát triển công nghệ AI theo những cách có lợi, tầm nhìn của Musk nhằm định hình lại cách hiểu về các tương tác AI. Sự lãnh đạo và tinh thần nền tảng bị ảnh hưởng sâu sắc bởi cam kết của Musk trong việc thúc đẩy các ranh giới công nghệ. Các nhà đầu tư của Grok AI Mặc dù các chi tiết cụ thể về các nhà đầu tư hỗ trợ Grok AI vẫn còn hạn chế, nhưng đã được công nhận công khai rằng xAI, vườn ươm của dự án, được thành lập và hỗ trợ chủ yếu bởi chính Elon Musk. Các dự án và tài sản trước đây của Musk cung cấp một nền tảng vững chắc, tăng cường thêm độ tin cậy và tiềm năng phát triển của Grok AI. Tuy nhiên, tính đến thời điểm hiện tại, thông tin về các quỹ đầu tư hoặc tổ chức bổ sung hỗ trợ Grok AI vẫn chưa dễ dàng tiếp cận, đánh dấu một lĩnh vực có thể khám phá trong tương lai. Grok AI hoạt động như thế nào? Cơ chế hoạt động của Grok AI sáng tạo không kém gì khung khái niệm của nó. Dự án tích hợp một số công nghệ tiên tiến giúp tạo ra các chức năng độc đáo của nó: Hạ tầng mạnh mẽ: Grok AI được xây dựng bằng Kubernetes cho việc điều phối container, Rust cho hiệu suất và an toàn, và JAX cho tính toán số hiệu suất cao. Bộ ba này đảm bảo rằng chatbot hoạt động hiệu quả, mở rộng hiệu quả và phục vụ người dùng kịp thời. Truy cập kiến thức theo thời gian thực: Một trong những tính năng nổi bật của Grok AI là khả năng truy cập dữ liệu theo thời gian thực thông qua nền tảng X—trước đây được biết đến với tên gọi Twitter. Khả năng này cho phép AI truy cập thông tin mới nhất, giúp nó cung cấp các câu trả lời và khuyến nghị kịp thời mà các mô hình AI khác có thể bỏ lỡ. Hai chế độ tương tác: Grok AI cung cấp cho người dùng sự lựa chọn giữa “Chế độ Vui” và “Chế độ Thông thường.” Chế độ Vui cho phép một phong cách tương tác vui tươi và hài hước hơn, trong khi Chế độ Thông thường tập trung vào việc cung cấp các câu trả lời chính xác và đúng đắn. Sự linh hoạt này đảm bảo một trải nghiệm được cá nhân hóa phù hợp với sở thích của từng người dùng. Nói tóm lại, Grok AI kết hợp hiệu suất với sự tương tác, tạo ra một trải nghiệm vừa phong phú vừa giải trí. Thời gian phát triển của Grok AI Hành trình của Grok AI được đánh dấu bởi các cột mốc quan trọng phản ánh các giai đoạn phát triển và triển khai của nó: Phát triển ban đầu: Giai đoạn nền tảng của Grok AI diễn ra trong khoảng hai tháng, trong đó việc đào tạo và tinh chỉnh ban đầu của mô hình được thực hiện. Phát hành Beta Grok-2: Trong một bước tiến quan trọng, beta Grok-2 đã được công bố. Phiên bản này giới thiệu hai phiên bản của chatbot—Grok-2 và Grok-2 mini—mỗi phiên bản đều có khả năng trò chuyện, lập trình và lập luận. Truy cập công khai: Sau khi phát triển beta, Grok AI đã trở thành có sẵn cho người dùng nền tảng X. Những người có tài khoản được xác minh bằng số điện thoại và hoạt động ít nhất bảy ngày có thể truy cập phiên bản giới hạn, giúp công nghệ có sẵn cho một đối tượng rộng lớn hơn. Thời gian này tóm tắt sự phát triển có hệ thống của Grok AI từ lúc khởi đầu đến khi công khai, nhấn mạnh cam kết của nó đối với việc cải tiến liên tục và tương tác người dùng. Các tính năng chính của Grok AI Grok AI bao gồm một số tính năng chính góp phần vào bản sắc đổi mới của nó: Tích hợp kiến thức theo thời gian thực: Truy cập thông tin hiện tại và liên quan phân biệt Grok AI với nhiều mô hình tĩnh, cho phép một trải nghiệm người dùng hấp dẫn và chính xác. Phong cách tương tác đa dạng: Bằng cách cung cấp các chế độ tương tác khác nhau, Grok AI phục vụ cho sở thích đa dạng của người dùng, khuyến khích sự sáng tạo và cá nhân hóa trong việc trò chuyện với AI. Cơ sở công nghệ tiên tiến: Việc sử dụng Kubernetes, Rust và JAX cung cấp cho dự án một khung vững chắc để đảm bảo độ tin cậy và hiệu suất tối ưu. Xem xét về đạo đức trong đối thoại: Việc bao gồm chức năng tạo hình ảnh thể hiện tinh thần đổi mới của dự án. Tuy nhiên, nó cũng đặt ra các vấn đề đạo đức liên quan đến bản quyền và việc thể hiện tôn trọng các nhân vật dễ nhận biết—một cuộc thảo luận đang diễn ra trong cộng đồng AI. Kết luận Như một thực thể tiên phong trong lĩnh vực AI đối thoại, Grok AI khái quát tiềm năng cho những trải nghiệm người dùng chuyển đổi trong kỷ nguyên số. Được phát triển bởi xAI và được thúc đẩy bởi cách tiếp cận tầm nhìn của Elon Musk, Grok AI tích hợp kiến thức theo thời gian thực với khả năng tương tác tiên tiến. Nó cố gắng mở rộng ranh giới của những gì trí tuệ nhân tạo có thể đạt được trong khi vẫn giữ trọng tâm vào các vấn đề đạo đức và an toàn cho người dùng. Grok AI không chỉ thể hiện sự tiến bộ công nghệ mà còn thể hiện một mô hình đối thoại mới trong bối cảnh Web3, hứa hẹn sẽ thu hút người dùng bằng cả kiến thức sâu sắc và sự tương tác vui tươi. Khi dự án tiếp tục phát triển, nó đứng như một minh chứng cho những gì giao thoa giữa công nghệ, sáng tạo và tương tác giống như con người có thể đạt được.

Tổng lượt xem 714Xuất bản vào 2024.12.26Cập nhật vào 2024.12.26

ERC AI là gì

Euruka Tech: Tổng Quan về $erc ai và Những Tham Vọng trong Web3 Giới thiệu Trong bối cảnh công nghệ blockchain và ứng dụng phi tập trung đang phát triển nhanh chóng, các dự án mới thường xuyên xuất hiện, mỗi dự án đều có những mục tiêu và phương pháp độc đáo. Một trong những dự án đó là Euruka Tech, hoạt động trong lĩnh vực tiền điện tử và Web3 rộng lớn. Mục tiêu chính của Euruka Tech, đặc biệt là token $erc ai của nó, là cung cấp các giải pháp sáng tạo nhằm khai thác những khả năng đang phát triển của công nghệ phi tập trung. Bài viết này nhằm cung cấp một cái nhìn tổng quan về Euruka Tech, khám phá các mục tiêu, chức năng, danh tính của người sáng lập, các nhà đầu tư tiềm năng và tầm quan trọng của nó trong bối cảnh rộng lớn hơn của Web3. Euruka Tech, $erc ai là gì? Euruka Tech được mô tả như một dự án tận dụng các công cụ và chức năng mà môi trường Web3 cung cấp, tập trung vào việc tích hợp trí tuệ nhân tạo trong các hoạt động của nó. Mặc dù các chi tiết cụ thể về khung của dự án vẫn còn mơ hồ, nhưng nó được thiết kế để nâng cao sự tham gia của người dùng và tự động hóa các quy trình trong không gian tiền điện tử. Dự án nhằm tạo ra một hệ sinh thái phi tập trung không chỉ tạo điều kiện cho các giao dịch mà còn tích hợp các chức năng dự đoán thông qua trí tuệ nhân tạo, do đó tên gọi của token, $erc ai. Mục tiêu là cung cấp một nền tảng trực quan giúp tạo ra các tương tác thông minh hơn và xử lý giao dịch hiệu quả hơn trong lĩnh vực Web3 đang phát triển. Ai là Người Sáng Lập Euruka Tech, $erc ai? Hiện tại, thông tin về người sáng lập hoặc đội ngũ sáng lập đứng sau Euruka Tech vẫn chưa được xác định và có phần mờ mịt. Sự thiếu hụt dữ liệu này gây ra lo ngại, vì kiến thức về nền tảng của đội ngũ thường rất quan trọng để thiết lập độ tin cậy trong lĩnh vực blockchain. Do đó, chúng tôi đã phân loại thông tin này là không rõ cho đến khi có các chi tiết cụ thể được công bố trong lĩnh vực công cộng. Ai là Các Nhà Đầu Tư của Euruka Tech, $erc ai? Tương tự, việc xác định các nhà đầu tư hoặc tổ chức hỗ trợ cho dự án Euruka Tech không được cung cấp dễ dàng qua các nghiên cứu hiện có. Một khía cạnh quan trọng đối với các bên liên quan tiềm năng hoặc người dùng đang xem xét tham gia vào Euruka Tech là sự đảm bảo đến từ các quan hệ đối tác tài chính đã được thiết lập hoặc sự hỗ trợ từ các công ty đầu tư uy tín. Thiếu thông tin về các mối quan hệ đầu tư, rất khó để rút ra những kết luận toàn diện về sự an toàn tài chính hoặc độ bền vững của dự án. Theo thông tin đã tìm thấy, phần này cũng đang ở trạng thái không rõ. Euruka Tech, $erc ai hoạt động như thế nào? Mặc dù thiếu các thông số kỹ thuật chi tiết cho Euruka Tech, nhưng điều quan trọng là phải xem xét những tham vọng đổi mới của nó. Dự án tìm cách tận dụng sức mạnh tính toán của trí tuệ nhân tạo để tự động hóa và nâng cao trải nghiệm người dùng trong môi trường tiền điện tử. Bằng cách tích hợp AI với công nghệ blockchain, Euruka Tech nhằm cung cấp các tính năng như giao dịch tự động, đánh giá rủi ro và giao diện người dùng cá nhân hóa. Bản chất đổi mới của Euruka Tech nằm ở mục tiêu tạo ra một kết nối liền mạch giữa người dùng và những khả năng rộng lớn mà các mạng phi tập trung mang lại. Thông qua việc sử dụng các thuật toán học máy và AI, nó nhằm giảm thiểu những thách thức mà người dùng lần đầu gặp phải và tinh giản trải nghiệm giao dịch trong khuôn khổ Web3. Sự cộng sinh giữa AI và blockchain nhấn mạnh tầm quan trọng của token $erc ai, đứng như một cầu nối giữa các giao diện người dùng truyền thống và các khả năng tiên tiến của công nghệ phi tập trung. Thời Gian của Euruka Tech, $erc ai Thật không may, do thông tin hạn chế hiện có về Euruka Tech, chúng tôi không thể trình bày một thời gian biểu chi tiết về các phát triển chính hoặc cột mốc trong hành trình của dự án. Thời gian biểu này, thường rất quý giá trong việc vạch ra sự tiến triển của một dự án và hiểu được quỹ đạo phát triển của nó, hiện không có sẵn. Khi thông tin về các sự kiện đáng chú ý, quan hệ đối tác hoặc các bổ sung chức năng trở nên rõ ràng, các cập nhật chắc chắn sẽ nâng cao sự hiện diện của Euruka Tech trong lĩnh vực tiền điện tử. Làm rõ về Các Dự Án “Eureka” Khác Điều đáng lưu ý là nhiều dự án và công ty chia sẻ một tên gọi tương tự với “Eureka.” Nghiên cứu đã xác định các sáng kiến như một đại lý AI từ NVIDIA Research, tập trung vào việc dạy robot thực hiện các nhiệm vụ phức tạp bằng các phương pháp sinh tạo, cũng như Eureka Labs và Eureka AI, cải thiện trải nghiệm người dùng trong giáo dục và phân tích dịch vụ khách hàng, tương ứng. Tuy nhiên, những dự án này là khác biệt với Euruka Tech và không nên bị nhầm lẫn với các mục tiêu hoặc chức năng của nó. Kết luận Euruka Tech, cùng với token $erc ai của nó, đại diện cho một người chơi hứa hẹn nhưng hiện tại còn mờ mịt trong bối cảnh Web3. Trong khi chi tiết về người sáng lập và các nhà đầu tư vẫn chưa được công bố, tham vọng cốt lõi của việc kết hợp trí tuệ nhân tạo với công nghệ blockchain vẫn là một điểm thu hút sự chú ý. Các phương pháp độc đáo của dự án trong việc thúc đẩy sự tham gia của người dùng thông qua tự động hóa tiên tiến có thể giúp nó nổi bật khi hệ sinh thái Web3 tiến triển. Khi thị trường tiền điện tử tiếp tục phát triển, các bên liên quan nên theo dõi chặt chẽ những tiến bộ xung quanh Euruka Tech, vì sự phát triển của các đổi mới đã được ghi chép, các quan hệ đối tác hoặc một lộ trình rõ ràng có thể mang lại những cơ hội đáng kể trong tương lai gần. Hiện tại, chúng tôi đang chờ đợi những hiểu biết sâu sắc hơn có thể tiết lộ tiềm năng của Euruka Tech và vị trí của nó trong bối cảnh cạnh tranh của tiền điện tử.

Tổng lượt xem 720Xuất bản vào 2025.01.02Cập nhật vào 2025.01.02

DUOLINGO AI là gì

DUOLINGO AI: Tích hợp Học ngôn ngữ với Web3 và Đổi mới AI Trong một kỷ nguyên mà công nghệ định hình lại giáo dục, việc tích hợp trí tuệ nhân tạo (AI) và các mạng blockchain báo hiệu một biên giới mới cho việc học ngôn ngữ. Giới thiệu DUOLINGO AI và đồng tiền điện tử liên quan của nó, $DUOLINGO AI. Dự án này mong muốn kết hợp sức mạnh giáo dục của các nền tảng học ngôn ngữ hàng đầu với những lợi ích của công nghệ Web3 phi tập trung. Bài viết này đi sâu vào các khía cạnh chính của DUOLINGO AI, khám phá các mục tiêu, khung công nghệ, sự phát triển lịch sử và tiềm năng tương lai trong khi duy trì sự rõ ràng giữa tài nguyên giáo dục gốc và sáng kiến tiền điện tử độc lập này. Tổng quan về DUOLINGO AI Cốt lõi của DUOLINGO AI là thiết lập một môi trường phi tập trung nơi người học có thể kiếm được phần thưởng mã hóa cho việc đạt được các cột mốc giáo dục trong khả năng ngôn ngữ. Bằng cách áp dụng hợp đồng thông minh, dự án nhằm tự động hóa các quy trình xác minh kỹ năng và phân bổ token, tuân thủ các nguyên tắc Web3 nhấn mạnh tính minh bạch và quyền sở hữu của người dùng. Mô hình này khác biệt so với các phương pháp truyền thống trong việc tiếp cận ngôn ngữ bằng cách dựa nhiều vào cấu trúc quản trị do cộng đồng điều hành, cho phép những người nắm giữ token đề xuất cải tiến nội dung khóa học và phân phối phần thưởng. Một số mục tiêu đáng chú ý của DUOLINGO AI bao gồm: Học tập gamified: Dự án tích hợp các thành tựu blockchain và token không thể thay thế (NFT) để đại diện cho các cấp độ thành thạo ngôn ngữ, thúc đẩy động lực thông qua các phần thưởng kỹ thuật số hấp dẫn. Tạo nội dung phi tập trung: Nó mở ra cơ hội cho các nhà giáo dục và những người yêu thích ngôn ngữ đóng góp khóa học của họ, tạo điều kiện cho một mô hình chia sẻ doanh thu có lợi cho tất cả các bên đóng góp. Cá nhân hóa dựa trên AI: Bằng cách sử dụng các mô hình học máy tiên tiến, DUOLINGO AI cá nhân hóa các bài học để thích ứng với tiến trình học tập của từng cá nhân, tương tự như các tính năng thích ứng có trong các nền tảng đã được thiết lập. Người sáng lập dự án và Quản trị Tính đến tháng 4 năm 2025, đội ngũ đứng sau $DUOLINGO AI vẫn giữ bí danh, một thực tiễn phổ biến trong lĩnh vực tiền điện tử phi tập trung. Sự ẩn danh này nhằm thúc đẩy sự phát triển tập thể và sự tham gia của các bên liên quan thay vì tập trung vào các nhà phát triển cá nhân. Hợp đồng thông minh được triển khai trên blockchain Solana ghi chú địa chỉ ví của nhà phát triển, điều này thể hiện cam kết về tính minh bạch liên quan đến các giao dịch mặc dù danh tính của các nhà sáng lập vẫn chưa được biết đến. Theo lộ trình của nó, DUOLINGO AI dự định phát triển thành một Tổ chức Tự trị Phi tập trung (DAO). Cấu trúc quản trị này cho phép những người nắm giữ token bỏ phiếu về các vấn đề quan trọng như triển khai tính năng và phân bổ ngân quỹ. Mô hình này phù hợp với tinh thần trao quyền cho cộng đồng có trong nhiều ứng dụng phi tập trung, nhấn mạnh tầm quan trọng của việc ra quyết định tập thể. Nhà đầu tư và Đối tác chiến lược Hiện tại, không có nhà đầu tư tổ chức hoặc nhà đầu tư mạo hiểm nào được xác định công khai liên quan đến $DUOLINGO AI. Thay vào đó, tính thanh khoản của dự án chủ yếu đến từ các sàn giao dịch phi tập trung (DEX), đánh dấu một sự tương phản rõ rệt với các chiến lược tài trợ của các công ty công nghệ giáo dục truyền thống. Mô hình cơ sở này cho thấy một cách tiếp cận do cộng đồng điều hành, phản ánh cam kết của dự án đối với sự phi tập trung. Trong tài liệu trắng của mình, DUOLINGO AI đề cập đến việc hình thành các hợp tác với các “nền tảng giáo dục blockchain” không xác định nhằm làm phong phú thêm các khóa học của mình. Mặc dù các đối tác cụ thể vẫn chưa được công bố, những nỗ lực hợp tác này gợi ý về một chiến lược kết hợp đổi mới blockchain với các sáng kiến giáo dục, mở rộng quyền truy cập và sự tham gia của người dùng qua nhiều con đường học tập khác nhau. Kiến trúc công nghệ Tích hợp AI DUOLINGO AI tích hợp hai thành phần chính dựa trên AI để nâng cao các đề xuất giáo dục của mình: Công cụ học tập thích ứng: Công cụ tinh vi này học từ các tương tác của người dùng, tương tự như các mô hình độc quyền từ các nền tảng giáo dục lớn. Nó điều chỉnh độ khó của bài học một cách linh hoạt để giải quyết các thách thức cụ thể của người học, củng cố các lĩnh vực yếu thông qua các bài tập có mục tiêu. Đại lý hội thoại: Bằng cách sử dụng chatbot được hỗ trợ bởi GPT-4, DUOLINGO AI cung cấp một nền tảng cho người dùng tham gia vào các cuộc hội thoại mô phỏng, thúc đẩy một trải nghiệm học ngôn ngữ tương tác và thực tiễn hơn. Hạ tầng Blockchain Được xây dựng trên blockchain Solana, $DUOLINGO AI sử dụng một khung công nghệ toàn diện bao gồm: Hợp đồng thông minh xác minh kỹ năng: Tính năng này tự động trao token cho người dùng đã vượt qua các bài kiểm tra thành thạo, củng cố cấu trúc khuyến khích cho các kết quả học tập thực sự. Huy hiệu NFT: Những token kỹ thuật số này biểu thị các cột mốc khác nhau mà người học đạt được, chẳng hạn như hoàn thành một phần của khóa học hoặc thành thạo các kỹ năng cụ thể, cho phép họ giao dịch hoặc trưng bày thành tích của mình một cách kỹ thuật số. Quản trị DAO: Các thành viên cộng đồng nắm giữ token có thể tham gia vào quản trị bằng cách bỏ phiếu về các đề xuất chính, tạo điều kiện cho một văn hóa tham gia khuyến khích đổi mới trong các đề xuất khóa học và tính năng của nền tảng. Dòng thời gian lịch sử 2022–2023: Khái niệm Công việc chuẩn bị cho DUOLINGO AI bắt đầu với việc tạo ra một tài liệu trắng, nêu bật sự phối hợp giữa những tiến bộ AI trong học ngôn ngữ và tiềm năng phi tập trung của công nghệ blockchain. 2024: Ra mắt Beta Một phiên bản beta giới hạn giới thiệu các đề xuất trong các ngôn ngữ phổ biến, thưởng cho người dùng sớm bằng các phần thưởng token như một phần của chiến lược tham gia cộng đồng của dự án. 2025: Chuyển đổi DAO Vào tháng 4, một lần ra mắt mainnet đầy đủ diễn ra với sự lưu thông của các token, thúc đẩy các cuộc thảo luận trong cộng đồng về khả năng mở rộng sang các ngôn ngữ châu Á và các phát triển khóa học khác. Thách thức và Hướng đi tương lai Khó khăn kỹ thuật Mặc dù có những mục tiêu tham vọng, DUOLINGO AI phải đối mặt với những thách thức đáng kể. Khả năng mở rộng vẫn là một mối quan tâm liên tục, đặc biệt là trong việc cân bằng chi phí liên quan đến xử lý AI và duy trì một mạng lưới phi tập trung phản hồi. Ngoài ra, việc đảm bảo chất lượng tạo nội dung và quản lý trong bối cảnh cung cấp phi tập trung đặt ra những phức tạp trong việc duy trì tiêu chuẩn giáo dục. Cơ hội chiến lược Nhìn về phía trước, DUOLINGO AI có tiềm năng tận dụng các quan hệ đối tác cấp chứng chỉ vi mô với các tổ chức học thuật, cung cấp các xác nhận kỹ năng ngôn ngữ được xác minh bằng blockchain. Hơn nữa, việc mở rộng chuỗi chéo có thể cho phép dự án tiếp cận các cơ sở người dùng rộng hơn và các hệ sinh thái blockchain bổ sung, nâng cao khả năng tương tác và phạm vi tiếp cận của nó. Kết luận DUOLINGO AI đại diện cho một sự kết hợp đổi mới giữa trí tuệ nhân tạo và công nghệ blockchain, cung cấp một lựa chọn tập trung vào cộng đồng thay thế cho các hệ thống học ngôn ngữ truyền thống. Mặc dù sự phát triển bí danh và mô hình kinh tế mới nổi của nó mang lại một số rủi ro, cam kết của dự án đối với học tập gamified, giáo dục cá nhân hóa và quản trị phi tập trung mở ra một con đường phía trước cho công nghệ giáo dục trong lĩnh vực Web3. Khi AI tiếp tục phát triển và hệ sinh thái blockchain tiến hóa, các sáng kiến như DUOLINGO AI có thể định hình lại cách người dùng tương tác với giáo dục ngôn ngữ, trao quyền cho cộng đồng và thưởng cho sự tham gia thông qua các cơ chế học tập đổi mới.

Tổng lượt xem 754Xuất bản vào 2025.04.11Cập nhật vào 2025.04.11

Thảo luận

Chào mừng đến với Cộng đồng HTX. Tại đây, bạn có thể được thông báo về những phát triển nền tảng mới nhất và có quyền truy cập vào thông tin chuyên sâu về thị trường. Ý kiến của người dùng về giá của AI (AI) được trình bày dưới đây.

From Token Explosion to Physical Bottlenecks: The Storage Bull Market Driven by Agentic AI

Tóm tắt

The Starting Point of the Story: Inference Has Surpassed Training

Phase One: Prefill (Context Processing, i.e., "Reading the Question")

Phase Two: Decode (Token Generation, i.e., "Answering the Question")

Here is a summary of AI inference economics:

Storage Sector Panorama: Classification, Technology Generations, Player List

Traditional Three Storage Categories: DRAM, NAND, NOR

The Most Important Storage Branch in the AI Era: HBM

NAND's New Direction: HBF

Player List: Who Holds Key Positions in the Storage Industry Chain

In-depth Supply-Demand Analysis: The Most Severe Structural Shortage in 15 Years

Demand Side

Supply Side: HBM Capacity Squeeze and 2-3 Year Capacity Build Cycle

Inventory Status: Inventory Days Drop to Historically Extreme Lows

Inflection Point Judgment: Four Core Indicators—Capacity Utilization, Spot Prices, Contract Prices, and Inventory Days

Indicator One: Capacity Utilization (Latest Q1 2026 Data)

Indicator Two: Spot Price Index (As of May 8, 2026)

Indicator Three: Latest Contract Price Forecast (TrendForce May 2026)

Indicator Four: Inventory Days (Latest Q1 2026)

Comprehensive Judgment: Where is the Cycle Currently Positioned?

Tiền kỹ thuật số thịnh hành

Câu hỏi Liên quan

Nội dung Liên quan

Lãi Mở XRP Chạm Mốc 2,6 Tỷ USD Khi Nhu Cầu Phái Sinh Tăng

Dự đoán giá Bitcoin năm 2030: Đây là những điều bạn nên biết về đợt tăng giá tiếp theo

Nhịp Đập Thị Trường BTC: Tuần 30

Nhu cầu Bitcoin tại thị trường giao ngay suy yếu khi vốn mới do dự bất chấp dòng tiền vào ETF

Vì sao giao dịch "cá voi" 32,6 triệu USD của Chainlink có thể định hình đà tăng của LINK về phía 9 USD

Giao dịch

Bài viết Nổi bật

GROK AI là gì

ERC AI là gì

DUOLINGO AI là gì

Thảo luận

Danh mục Phổ biến

Thẻ Nổi bật