NVIDIA's Jensen Huang Latest Article: AI Is a Five-Layer Cake, Each Layer Represents a Trillion-Dollar Opportunity

比推Published on 2026-03-11Last updated on 2026-03-11

Abstract

AI is a five-layer infrastructure stack that represents a fundamental shift in computing, moving from pre-recorded software to real-time intelligence generation. The layers are: Energy (the foundational layer converting power into computation), Chips (processors enabling massive parallel computation), Infrastructure (AI factories orchestrating hardware systems), Models (AI systems understanding diverse domains like biology, chemistry, and language), and Applications (where economic value is created in fields like drug discovery, robotics, and autonomous vehicles). Each successful application pulls demand through all underlying layers. This industrial transformation requires trillions in infrastructure investment and a skilled workforce, driving global growth. AI has now crossed a usability threshold, with models generating real economic value and open-source initiatives accelerating adoption. The scale of this build-out is unprecedented, positioning AI as critical infrastructure that will be built and used by every company and nation.

Original Title: AI Is a Five‐Layer Cake

Source: Nvidia

Compiled by: BitpushNews


Artificial intelligence is one of the most powerful forces shaping the world today. It is not just a clever application or a single model; it is infrastructure, like electricity and the internet.

AI runs on real hardware, real energy, and real economics. It consumes raw materials and transforms them into intelligence at scale. Every company will use it. Every country will build it.

To understand why AI is developing in this way, it helps to start from first principles and examine the fundamental changes happening in computing.

From Pre-recorded Software to Real-time Intelligence

For most of computing history, software was pre-recorded. Humans described an algorithm. Computers executed it. Data had to be carefully structured, stored in tables, and retrieved through precise queries. SQL became indispensable because it made that world feasible.

AI breaks this pattern.

For the first time, we have computers that can understand unstructured information. They can see images, read text, listen to sounds, and comprehend meaning. They can reason about context and intent. Most importantly, they generate intelligence in real time.

Every response is newly created. Every answer depends on the context you provide. This is not software retrieving stored instructions. This is software reasoning and generating intelligence on demand.

Because intelligence is generated in real time, the entire computing stack beneath it must be reinvented.

AI as Infrastructure

When you look at AI from an industrial perspective, it can be broken down into a five-layer stack.

Energy

The bottom layer is energy. Real-time generated intelligence requires real-time produced energy. Every generated token is the result of electrons moving, heat being managed, and energy being converted into computation. There is no abstraction layer below this. Energy is the first principle of AI infrastructure and the hard constraint on how much intelligence the system can produce.

Chips

Above energy are chips. These processors are designed to efficiently convert energy into computation at scale. AI workloads require massive parallelism, high-bandwidth memory, and fast interconnects. Advances in the chip layer determine how quickly AI can scale and how cheap intelligence can become.

Infrastructure

Above chips is infrastructure. This includes land, power delivery, cooling, buildings, networking, and systems that orchestrate tens of thousands of processors into a single machine. These systems are AI factories. They are not designed to store information. They are designed to manufacture intelligence.

Models

Above infrastructure are models. AI models understand multiple types of information: language, biology, chemistry, physics, finance, medicine, and the physical world itself. Language models are just one category. Some of the most transformative work is happening in protein AI, chemistry AI, physics simulation, robotics, and autonomous systems.

Applications

The top layer is applications, where economic value is created. Drug discovery platforms. Industrial robots. Legal assistants. Self-driving cars. A self-driving car is an AI application embodied in a machine. A humanoid robot is an AI application embodied in a body. Same stack. Different outcomes.

This is the five-layer cake:

Energy → Chips → Infrastructure → Models → Applications.

Every successful application pulls on every layer below it, all the way down to the power plant that sustain its life.

We are just beginning this construction. We have only invested a few hundred billion dollars. Trillions of dollars more in infrastructure need to be built.

Globally, we are seeing chip factories, computer assembly plants, and AI factories being built on an unprecedented scale. This is becoming the largest infrastructure construction in human history.

The workforce required to support this construction is immense. AI factories need electricians, plumbers, pipefitters, steelworkers, network technicians, installers, and operators.

These are skilled, well-paying jobs that are currently in high demand. You don't need a computer science PhD to participate in this transformation.

Meanwhile, AI is driving productivity across the entire knowledge economy. Take radiology as an example. AI now assists in reading scans, but the demand for radiologists is still growing. This is not a paradox.

The purpose of a radiologist is to care for patients. Reading scans is just one task. As AI takes on more routine work, radiologists can focus on judgment, communication, and care. Hospitals become more efficient. They serve more patients. They hire more staff.

Productivity creates capacity. Capacity creates growth.

What Changed in the Past Year?

Over the past year, AI crossed a significant threshold. Models became good enough for widespread use. Reasoning capabilities improved. Hallucinations decreased. Grounding improved significantly. For the first time, applications built on AI began to generate real economic value.

Applications in drug discovery, logistics, customer service, software development, and manufacturing have shown strong product-market fit. These applications strongly pull on every layer below them.

Open-source models play a key role here. Most models in the world are free. Researchers, startups, enterprises, and entire nations rely on open models to participate in the advancement of AI. When open models reach the frontier, they don't just change software. They activate demand across the entire stack.

DeepSeek-R1 is a powerful example. By making a powerful reasoning model widely available, it accelerates adoption at the application layer and increases demand for the training, infrastructure, chips, and energy beneath it.

What This Means

When you view AI as critical infrastructure, the implications become clear.

AI began with a transformer large language model. But it is far more than that. It is an industrial transformation reshaping how energy is produced and consumed, how factories are built, how work is organized, and how economies grow.

AI factories are being built because intelligence is now generated in real time. Chips are being redesigned because efficiency determines how quickly intelligence can scale. Energy becomes central because it sets the upper limit on how much intelligence can be produced. Applications are accelerating because the models beneath them have crossed the threshold of being useful at scale.

Each layer reinforces the others.

This is why the scale of construction is so vast. This is why it touches so many industries simultaneously. And this is why it will not be confined to a single country or a single domain. Every company will use AI. Every country will build it.

We are still in the early stages. Most of the infrastructure does not yet exist. Most of the workforce is not yet trained. Most of the opportunities have not yet been realized.

But the direction is clear.

AI is becoming the infrastructure of the modern world. And the choices we make now, how quickly we build, how broadly we participate, and how responsibly we deploy, will shape the character of this era.

Original link:https://www.bitpush.news/articles/7618907

Trending Cryptos

Related Questions

QWhat are the five layers of the AI infrastructure stack according to Jensen Huang?

AThe five layers of the AI infrastructure stack are: Energy → Chips → Infrastructure → Models → Applications.

QWhy is energy considered the foundational layer of the AI stack?

AEnergy is the foundational layer because real-time generated intelligence requires real-time energy. Every generated token is a result of electrons moving, heat being managed, and energy being converted into computation. It is the hard constraint on how much intelligence the system can produce.

QHow does AI changed the traditional paradigm of pre-recorded software?

AAI broke the traditional paradigm by enabling computers to understand unstructured information (images, text, sounds) and reason about context and intent. It generates intelligence in real-time, creating new outputs on-demand based on the provided context, rather than just retrieving stored instructions.

QWhat role do open-source models play in the AI ecosystem as described in the article?

AOpen-source models are crucial because they allow researchers, startups, enterprises, and nations to participate in advanced AI development. When open models reach the frontier, they accelerate adoption at the application layer and increase demand for the entire underlying stack, including training, infrastructure, chips, and energy.

QWhat is the economic impact of AI on specialized fields like radiology, according to the article?

AIn fields like radiology, AI assists with routine tasks such as reading scans. This increases hospital efficiency and capacity, allowing professionals like radiologists to focus more on judgment, communication, and patient care. This productivity gain creates capacity, which in turn drives growth and leads to hiring more staff.

Related Reads

GPT-5.6 Countdown: Abandon the Illusion of a Single API, Computational Iteration Can't Outpace a Single Page of Compliance

In mid-June, three seemingly independent industry events—the compliance-driven throttling of Fable 5, the open-sourcing of GLM-5.2, and the leaked release timeline for GPT-5.6—are pushing the global AI industry toward a watershed moment. These shifts signal a fundamental restructuring of the industry's underlying logic. First, **"usability" has substantially overtaken "advanced capabilities"** as the primary weight, pushing the global large language model (LLM) supply chain into a "dual-track" phase of controlled closed-source and local open-source coexistence. Second, **the competitive moats of closed-source giants are shifting**. Their technical focus is moving from "language intelligence" toward "spatial intelligence (world models)"—a domain heavily reliant on computing power. Third, faced with常态化 transnational compliance risks, **a "model-agnostic" decoupled design has become a survival necessity for application-layer developers to maintain business continuity.** The article details how Anthropic's Fable 5, despite its advanced engineering feats, was restricted for non-U.S. citizens within 72 hours of launch, highlighting how geopolitical compliance can instantly limit even the most advanced models. In response, the open-source camp, exemplified by Zhipu AI's MIT-licensed GLM-5.2, is gaining market share by offering stable performance improvements and significant cost advantages (up to 70% savings for enterprises), while achieving full adaptation with domestic semiconductor platforms. Meanwhile, closed-source leaders like OpenAI are pivoting. The anticipated GPT-5.6 reportedly shifts focus from language to spatial intelligence and world models, aiming to rebuild a generational gap in areas like 3D understanding, simulation, and industrial design that demand immense compute. The core conclusion is that the LLM supply chain's logic has changed. Enterprises must now evaluate infrastructure based on a composite of technical performance and policy compliance. For developers, complete reliance on a single closed-source API poses unacceptable risk. Implementing a truly model-agnostic architecture—enabling swift switches to compliant, locally deployable open-source alternatives—is no longer just good practice but a fundamental baseline for business continuity.

marsbit54m ago

GPT-5.6 Countdown: Abandon the Illusion of a Single API, Computational Iteration Can't Outpace a Single Page of Compliance

marsbit54m ago

Is the 'Token Subsidy War' Among AI Giants Almost Over?

The article discusses the ongoing "token subsidy war" among AI giants like OpenAI and Anthropic, questioning whether it's nearing its end. It reveals that current AI subscription prices are heavily subsidized, with some plans offering tokens at up to 70 times the actual cost to attract and retain heavy users, especially developers and enterprises. This strategy mirrors past internet-era subsidy battles, but with a key difference: AI tokens lack "lock-in" effects. Unlike ride-hailing or food delivery apps, users can easily switch between AI providers as APIs become standardized, making it difficult for companies to raise prices post-subsidy. The piece highlights a structural asymmetry in the competition. Giants like Google, with massive advertising revenue, can afford to subsidize tokens indefinitely, akin to using "tokens as a weapon." In contrast, venture-backed companies like OpenAI and Anthropic face pressure to become profitable, especially as they approach IPO. The article cites Google Ventures founder Bill Maris, who suggests Google could slash token prices by 80%, putting immense pressure on competitors. Two potential endgames are presented: the "internet service" model (subsidize, monopolize, then raise prices) and the "utility" model (tokens become a standardized, low-margin commodity like electricity). Given the low switching costs, the latter seems more likely. The competition may not have a single winner but could instead accelerate AI's evolution into a foundational, infrastructure-level technology, akin to a public utility. For now, users continue to benefit from heavily subsidized token costs.

marsbit1h ago

Is the 'Token Subsidy War' Among AI Giants Almost Over?

marsbit1h ago

Beyond the Stadium: The Profitable Games Surrounding the World Cup

"Beyond the Pitch: The Profit Game Around the World Cup" The FIFA World Cup transcends being a sporting spectacle, evolving into a massive global arena for speculation and profit-seeking. The 2026 tournament has amplified this dynamic, creating a multi-layered ecosystem of financial opportunism alongside the football. **Prediction markets** have surged into the mainstream. Platforms like Polymarket and Kalshi saw trading volumes for World Cup contracts soar, attracting new users with their financial trading model and high-profile, chain-based wealth stories that overshadow traditional sports betting in terms of growth and narrative. However, **traditional sportsbooks** remain the dominant force, leveraging established user habits, legal markets, and comprehensive product offerings to handle the vast majority of speculative wagers, with projections suggesting record-breaking betting volumes. Capital markets also react. **"Concept stocks"** in countries like South Korea and Japan experience volatile price swings based on team performance and anticipated fan spending on items like chicken, beer, and viewing parties, effectively becoming a stock market reflecting fan sentiment. The **ticket resale market** has become a sophisticated arena for arbitrage. Prices fluctuate wildly based on team draws and star power, with sellers sometimes listing tickets they don't yet own in a practice akin to short-selling, while FIFA's own "Right to Buy" tokens add another layer of speculative trading. **Collectibles and merchandise** offer another avenue. Panini sticker albums, with their inherent scarcity and nostalgic value, can become high-value collectibles. Limited-edition or locally themed jerseys command significant premiums on secondary markets, and even counterfeit vendors profit from fans' desire for affordable match-day identity. The **cryptocurrency** space has seen a frenzy of speculative, unauthorized World Cup-themed meme coins on chains like Solana. These tokens, often exploiting team names and player imagery, experience extreme pump-and-dump cycles, creating stories of massive gains for a few early entrants and steep losses for many others. Finally, an entire industry thrives on **providing information and tools** to other speculators. Developers create platforms like SeatSidekick to track ticket inventory and prices, while paid Telegram groups and subscriptions sell betting tips and predictions, monetizing the widespread desire for an informational edge. In essence, the World Cup has become a compressed, global laboratory for speculation. While the games determine champions on the field, a parallel, complex network of financial transactions—spanning prediction contracts, bets, stocks, tickets, collectibles, crypto, and information services—settles its own scores in the global market.

marsbit1h ago

Beyond the Stadium: The Profitable Games Surrounding the World Cup

marsbit1h ago

How Does Codex Use a Computer? Three Entry Points and Permission Boundaries

This article explains the three primary methods for Codex to interact with a computer, each with distinct use cases, permission boundaries, and trust levels. **1. Computer Use:** This offers the broadest access, allowing Codex to visually control and interact with the graphical user interface of authorized macOS/Windows apps, system settings, and even iOS simulators. It's ideal for tasks lacking APIs or structured tools, such as operating legacy software or multi-app workflows. However, it's the slowest method and has the widest permission scope, requiring careful supervision for sensitive actions. **2. Chrome Extension:** This grants Codex access to the user's logged-in Chrome browser state, including cookies, profiles, and open tabs. It's best for tasks requiring user identity across websites like Gmail, LinkedIn, Salesforce, or internal dashboards. Its key advantage is multi-tab control for complex workflows. While more powerful for browser-based tasks than Computer Use, it carries higher sensitivity as actions are performed under the user's identity. **3. In-App Browser:** This is a browser isolated within the Codex thread, separate from the user's personal browsing data. It excels in web development and debugging scenarios—previewing local servers, testing responsive layouts, or annotating designs directly on the page. Its isolation is a strength for development but a limitation for tasks requiring login sessions. The core principle is to choose the narrowest, safest, and most structured interface for the task. Use plugins or MCPs first, resort to visual control (Computer Use) only for GUI-dependent tasks, employ the Chrome extension for identity-reliant browser work, and prefer the In-App Browser for isolated development. **Appshots** are clarified as a fourth, complementary tool for *inputting* context—capturing a screenshot of a window to point Codex to something—rather than a method for Codex to *act*. Together, this layered approach highlights a key to AI agent productization: not granting unlimited permissions, but constraining them within clear boundaries for specific tasks while preserving user oversight.

marsbit3h ago

How Does Codex Use a Computer? Three Entry Points and Permission Boundaries

marsbit3h ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of LAYER (LAYER) are presented below.

活动图片