The Next Earthquake in AI: Why the Real Danger Isn't the SaaS Killer, but the Computing Power Revolution?

marsbitPublished on 2026-02-11Last updated on 2026-02-11

Abstract

The next seismic shift in AI is not the threat of "SaaS killers" but a fundamental revolution in computing power. While many focus on how AI applications like Claude Cowork are disrupting traditional software, the real transformation is happening beneath the surface—in the infrastructure that powers AI. Two converging technological paths are challenging NVIDIA’s GPU dominance: 1. **Algorithmic Efficiency**: DeepSeek’s Mixture-of-Experts (MoE) architecture allows massive models (e.g., DeepSeek-V2 with 236B parameters) to activate only a small fraction of "experts" (9%) during computation, achieving GPT-4-level performance at 10% of the computational cost. This decouples AI capability from sheer compute power. 2. **Specialized Hardware**: Inference-optimized chips from companies like Cerebras and Groq integrate memory directly onto the chip, eliminating data transfer delays. This "zero-latency" design drastically improves speed and efficiency, prompting even OpenAI to sign a $10B deal with Cerebras. Together, these advances could cause a cost collapse: training costs may drop by 90%, and inference costs could fall by an order of magnitude. The total cost of running world-class AI may plummet to 10-15% of current GPU-based solutions. This paradigm shift threatens NVIDIA’s valuation, built on the assumption of perpetual GPU dominance. If the market realizes that GPUs are no longer the only—or best—option, the foundation of NVIDIA’s trillions in market cap could crumble. The ...

Written by: Bruce

Lately, the entire tech and investment communities have been fixated on the same thing: how AI applications are "killing" traditional SaaS. Since @AnthropicAI's Claude Cowork demonstrated how easily it can help you write emails, create PPTs, and analyze Excel spreadsheets, a panic about "software is dead" has begun to spread. This is indeed frightening, but if your gaze stops here, you might be missing the real seismic shift.

It's like we're all looking up at a drone dogfight in the sky, but no one notices that the entire continental plate beneath our feet is quietly shifting. The real storm is hidden beneath the surface, in a corner most people can't see: the computing power foundation that supports the entire AI world is undergoing a "silent revolution."

And this revolution might end the grand party hosted by AI's shovel seller: NVIDIA @nvidia, much sooner than anyone imagined.

Two Revolutionary Paths Converging

This revolution isn't a single event, but rather the convergence of two seemingly independent technological paths. They are like two armies closing in, forming a pincer movement against NVIDIA's GPU hegemony.

The first path is the algorithm slimming revolution.

Have you ever thought about whether a super brain really needs to mobilize all its cells when thinking about a problem? Obviously not. DeepSeek figured this out with their Mixture of Experts (MoE) architecture.

You can think of it like a company with hundreds of experts in different fields. But every time you need to solve a problem, you only call upon the two or three most relevant experts, rather than having everyone brainstorm together. This is the cleverness of MoE: it allows a massive model to activate only a small portion of its "experts" during each computation, drastically saving computing power.

What's the result? The DeepSeek-V2 model nominally has 236 billion "experts" (parameters), but only needs to activate 21 billion of them for each task—less than 9% of the total. Yet its performance is comparable to GPT-4, which requires 100% full operation. What does this mean? AI capability is decoupling from the computing power it consumes!

In the past, we all assumed that the stronger the AI, the more GPUs it would need. Now, DeepSeek shows us that through clever algorithms, the same effect can be achieved at one-tenth the cost. This directly puts a huge question mark on the essential nature of NVIDIA GPUs.

The second path is the hardware "lane change" revolution.

AI work is divided into two phases: training and inference. Training is like going to school, requiring reading countless books (data); here, GPUs with their "brute force" parallel computing are indeed useful. But inference is like our daily use of AI, where response speed is more critical.

GPUs have an inherent weakness in inference: their memory (HBM) is external, and data transfer back and forth causes latency. It's like a chef whose ingredients are in a fridge in the next room; every time they cook, they have to run over to get them—no matter how fast, it can't be instant. Companies like Cerebras and Groq started from scratch, designing dedicated inference chips that solder the memory (SRAM) directly onto the chip, putting the ingredients right at hand, achieving "zero latency" access.

The market has voted with real money. OpenAI, while complaining about NVIDIA's GPU inference performance, turned around and signed a $10 billion deal with Cerebras specifically to rent their inference services. NVIDIA itself panicked, spending $20 billion to acquire Groq, precisely to not fall behind in this new race.

When the Two Paths Converge: A Cost Avalanche

Now, let's put these two things together: run an algorithmically "slimmed-down" DeepSeek model on a hardware platform with "zero latency" like a Cerebras chip.

What happens?

A cost avalanche.

First, the slimmed-down model is small enough to be loaded entirely into the chip's built-in memory at once. Second, without the external memory bottleneck, the AI's response speed becomes astonishingly fast. The final result: training costs drop by 90% due to the MoE architecture, and inference costs drop by another order of magnitude due to specialized hardware and sparse computation. In total, the cost of owning and operating a world-class AI could be just 10%-15% of the traditional GPU solution.

This isn't an improvement; it's a paradigm shift.

The Carpet is Being Pulled from Under NVIDIA's Throne

Now you should understand why this is more fatal than the "Cowork panic."

NVIDIA's multi-trillion dollar valuation today is built on a simple story: AI is the future, and the future of AI depends on my GPUs. But now, the foundation of that story is being shaken.

In the training market, even if NVIDIA maintains its monopoly, if customers can do the work with one-tenth the GPUs, the overall size of this market could shrink significantly.

In the inference market, a cake ten times larger than training, NVIDIA not only lacks an absolute advantage but is also facing a siege from various players like Google and Cerebras. Even its biggest customer, OpenAI, is defecting.

Once Wall Street realizes that NVIDIA's "shovels" are no longer the only, or even the best, option, what will happen to the valuation built on the expectation of "permanent monopoly"? I think we all know.

Therefore, the biggest black swan in the next six months might not be which AI application has killed what, but a seemingly insignificant piece of tech news: for example, a new paper on the efficiency of MoE algorithms, or a report showing a significant increase in market share for dedicated inference chips, quietly announcing a new phase in the computing power war.

When the "shovel seller's" shovels are no longer the only choice, his golden age may well be over.

Liquidity Forecasts Show Ozak AI Reaching a $1.5 Billion Fully Diluted Valuation Soon After Launch

Updated liquidity forecasts indicate that Ozak AI is positioned to reach a fully diluted valuation (FDV) of $1.5 billion shortly after its initial exchange listings. With over $6.8 million raised in its presale—priced at $0.014 per token—and more than 1.17 billion tokens already sold, the project has demonstrated strong early capital formation, reducing downside risk during price discovery. Analysts highlight that Ozak AI’s concentrated early supply and capital-efficient structure could allow smaller liquidity injections to drive significant valuation expansion, potentially bypassing lower valuation ranges entirely. The project’s AI-native infrastructure—including Prediction Agents, the Ozak Stream Network, Data Vaults, EigenLayer integration, and Arbitrum Orbit deployment—supports higher valuation multiples by offering tangible utility rather than speculative appeal. Partnerships with Pyth Network, SINT, HIVE Intel, and Weblume further strengthen liquidity confidence and exchange adoption prospects. With robust presale performance, functional AI infrastructure, and favorable liquidity dynamics, Ozak AI is emerging as a liquidity-driven growth asset with the potential for rapid post-launch valuation scaling.

TheNewsCrypto3h ago

Liquidity Forecasts Show Ozak AI Reaching a $1.5 Billion Fully Diluted Valuation Soon After Launch

TheNewsCrypto3h ago

Exchange Exposure Could Push Ozak AI’s Daily Volume Beyond $500 Million Within the First Trading Month

Ozak AI ($OZ) is positioned to capitalize on market shifts from established altcoins to innovative AI-driven projects. With a successful presale raising over $5 million and strong token distribution, the project is expected to achieve significant trading volume upon exchange listing—potentially exceeding $500 million daily within the first month. Key factors driving this outlook include concentrated early investor interest, AI and DePIN utility, and strategic partnerships enhancing functionality and credibility. Unlike hype-driven tokens, Ozak AI’s real-world use cases in staking and ecosystem integrations support sustained trading activity beyond initial speculation.

TheNewsCrypto4h ago

Exchange Exposure Could Push Ozak AI’s Daily Volume Beyond $500 Million Within the First Trading Month

TheNewsCrypto4h ago

Clarity Act Reaches Critical Juncture: The U.S. Crypto Regulatory Crossroads

The U.S. cryptocurrency regulatory landscape faces a historic turning point in spring 2026. The fate of the CLARITY Act, which aims to establish a comprehensive market structure framework for digital assets—hangs in the balance. If it fails to clear the Senate Banking Committee by late April, its chances of passage drop drastically, potentially delaying U.S. crypto legislation for years. Simultaneously, the GENIUS Act is reshaping the stablecoin market through stringent prudential regulations, including AML/CFT compliance and reserve requirements, favoring major compliant players like USDC and Tether’s new USAT. A key political compromise led by Senators Tillis and Alsobrooks addresses contentious issues like stablecoin yields, though critical implementation details remain unresolved. The White House strongly supports the legislation as part of a strategy to make the U.S. a global crypto hub, but political challenges—including midterm elections and cross-party demands—threaten its progress. If passed, the CLARITY Act could unlock trillions in institutional capital and help the U.S. compete with the EU’s MiCA framework. If stalled, global regulatory leadership may shift to Europe and Asia. Regardless of outcome, regulatory clarity and compliance infrastructure are becoming central to competitive advantage in the crypto industry.

marsbit4h ago

Clarity Act Reaches Critical Juncture: The U.S. Crypto Regulatory Crossroads

marsbit4h ago

Warsh Hearing Concludes: What Are the Notable Signals for the Crypto Industry?

The Senate Banking Committee held a confirmation hearing for Judy Shelton, a Federal Reserve nominee, who faced intense questioning regarding her ability to maintain the central bank's independence amid pressure from President Trump to lower interest rates. Shelton denied any pre-arranged commitments on rate cuts and emphasized her independence, though Democrats remained skeptical, citing contradictions with Trump's public statements. Shelton characterized post-pandemic inflation as a major policy failure and called for a "regime change" in the Fed’s approach, including reforms to inflation measurement and communication strategies. She criticized the current practice of Fed officials frequently signaling future rate moves and did not commit to maintaining post-meeting press conferences, suggesting potential reductions in transparency. Regarding crypto markets, Shelton’s extensive investments in digital asset companies—including Solana, DeFi, and blockchain infrastructure—were noted, though she has pledged to divest these holdings due to ethics rules. Her familiarity with the crypto industry and deregulatory leanings may signal a more open, though cautious, stance toward digital assets. However, concerns were raised about potential conflicts of interest, especially given Trump family involvement in crypto-financial ventures. The timing of her confirmation remains uncertain, pending a Justice Department investigation into current Chair Powell. Shelton’s potential leadership could lead to a more hawkish, productivity-focused Fed with tighter policy communication—factors that may significantly influence liquidity conditions and macro narratives for crypto markets.

marsbit4h ago

Warsh Hearing Concludes: What Are the Notable Signals for the Crypto Industry?

marsbit4h ago

"Form Over Substance" Fed Hearing: Warsh Avoids Interest Rates, But Offers Reassurance for Crypto

Kevin Wash, nominee for Federal Reserve Chair, testified before the Senate Banking Committee, emphasizing his independence from former President Trump and stating that Trump never asked him to commit to specific interest rate decisions. Wash avoided extensive discussion on monetary policy but called for institutional reforms at the Fed, advocating for a gradual and cautious reduction of the balance sheet without clear signals on rate cuts. The hearing was seen as largely symbolic, with Wash’s confirmation hinging on political negotiations offstage, particularly whether Trump drops his investigation into current Chair Powell. Notably, Wash expressed a crypto-friendly stance, acknowledging digital assets as part of the U.S. financial services industry and committing to divest his own crypto holdings if confirmed. This marks a positive signal for the crypto market from a potential Fed leader.

Odaily星球日报6h ago

"Form Over Substance" Fed Hearing: Warsh Avoids Interest Rates, But Offers Reassurance for Crypto

Odaily星球日报6h ago

Trading

Spot

Futures

Hot Articles

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

Audiera is a dual-platform Web4 entertainment ecosystem combining a mobile rhythm experience and a lightweight Telegram mini-game, powered by AI interaction and an on-chain creator economy.

39.7k Total ViewsPublished 2026.03.11Updated 2026.03.11

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

Talus is a decentralized AI Agent framework built on the Sui, designed to solve the structural problems of current AI systems: centralization, opacity, and a lack of native economic identity.

41.2k Total ViewsPublished 2026.03.18Updated 2026.03.18

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

By 2026, the integration of artificial intelligence and cryptocurrency has advanced from proof-of-concept to a new stage of "system-level integration".

1.3k Total ViewsPublished 2026.03.26Updated 2026.03.26

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

The Next Earthquake in AI: Why the Real Danger Isn't the SaaS Killer, but the Computing Power Revolution?

Abstract

Two Revolutionary Paths Converging

When the Two Paths Converge: A Cost Avalanche

The Carpet is Being Pulled from Under NVIDIA's Throne

Related Questions

Related Reads

Liquidity Forecasts Show Ozak AI Reaching a $1.5 Billion Fully Diluted Valuation Soon After Launch

Exchange Exposure Could Push Ozak AI’s Daily Volume Beyond $500 Million Within the First Trading Month

Clarity Act Reaches Critical Juncture: The U.S. Crypto Regulatory Crossroads

Warsh Hearing Concludes: What Are the Notable Signals for the Crypto Industry?

"Form Over Substance" Fed Hearing: Warsh Avoids Interest Rates, But Offers Reassurance for Crypto

Trading

Hot Articles

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

Discussions

Top Questions

Hot Categories

Hot Tags