Jensen Huang's 2026 GTC Taipei Speech: The Era of AI Agents is Here, Computing is Revenue

marsbit發佈於 2026-06-03更新於 2026-06-03

文章摘要

NVIDIA CEO Jensen Huang's 2026 GTC Taipei speech announces the arrival of the "Agent AI" era, where AI transitions from content generation to performing useful work. Huang positions tokens as units of profit and GDP, driving massive demand for computing power and "AI factories." NVIDIA's strategy revolves around a new computing paradigm centered on AI agents, which combine large language models (LLMs) with agent frameworks for planning, memory, and tool use. Key announcements include: * **Vera Rubin:** A complete, end-to-end system (not just a GPU) designed from the ground up to run AI agents at scale, representing NVIDIA's evolution into an infrastructure company. * **Vera CPU:** A revolutionary CPU architecture built specifically for impatient AI agents, prioritizing low latency, single-thread performance, and massive bandwidth over traditional multi-core throughput. * **Enterprise AI Agent Toolkit:** A suite including open models (like Nemotron 3 Ultra), frameworks, tools, and a secure runtime (Open Shell) to enable every company to build and deploy its own AI agents. * **Next-Gen PCs with Microsoft:** A new line of Windows desktops, laptops, and workstations co-developed with Microsoft, featuring the N1X chip and designed to run local AI agents, redefining the personal computer. * **Physical AI Foundation Models:** Introduction of Cosmos 3 for robotics and physical AI, Alpamayo 2 for autonomous driving, and the Isaac GR00T platform—a fully integrated humanoid r...

Organized & Compiled: Deep Tide TechFlow

Guest: Jensen Huang, CEO of NVIDIA

Podcast Source: Bonnie Blockchain

Original Title: 7 Core Points from Jensen Huang's 2026 GTC Taipei Speech, NVIDIA's Latest Strategy Cheat Sheet! 【Bonnie Blockchain】

Broadcast Date: June 2, 2026

Summary of Key Points

In his 2026 GTC Taipei speech, Jensen Huang focused NVIDIA's next-phase strategy on one core judgment: AI has transitioned from generating content into the era of functional agents. Tokens are no longer just technical metrics but units of production for revenue, profit, and GDP. Centered around this shift, NVIDIA introduced Vera Rubin, Vera CPU, an enterprise-grade agent toolkit, new-generation PCs in collaboration with Microsoft, and Cosmos 3, Alpamayo 2, and Isaac GR00T for physical AI. Huang emphasized that the computing paradigm for the next decade will be composed of models, agent frameworks, tool skills, and runtimes, diffusing from the cloud, enterprises, and local PCs to robots, factories, satellites, and edge devices. For Taiwan's supply chain, this means AI factories, power efficiency, infrastructure delivery speed, and full-stack synergy will become the keys to the next wave of industry growth.

Excerpts of Highlights

The Arrival of the AI Agent Era

"Useful AI has arrived; AI is now a profit generator and a GDP generator. Behind it is not just large language models, but a brand-new computing paradigm: agents."
"Agents are composed of large language models and an agent framework. The framework connects memory, tools, reasoning, planning, and action like an operating system."
"The breakthrough in agent systems comes from large language models now being capable of thinking, reasoning, planning, and using tools, as well as from agent frameworks capable of managing memory, coordinating workflows, and dispatching tools."
"Every company will become an agent company; every company will run agents internally, and every company will need its own agent operating system."

Tokens, AI Factories, and Infrastructure Economics

"Tokens are now profitable units of revenue. AI companies wanting to produce more tokens will build more AI factories, which is precisely why Taiwan's computing demand is exploding."
"Computing is revenue, computing is profit. Without revenue and profit, it's a loss."
"If an AI factory has only 1 gigawatt of power, that 1 GW is the limit; under this constraint, throughput per watt is revenue, because every token has value."
"Choosing the wrong architecture just because the chip is cheaper doesn't translate into real returns; you need to ensure revenue per watt. The more you buy, the more you earn."

Vera Rubin and NVIDIA's Infrastructure Transformation

"Vera Rubin is not a chip, nor just a GPU; it's a complete system built end-to-end."
"NVIDIA was a GPU company, then became a systems company, and is now further evolving into an infrastructure company, helping customers build AI factories."
"Vera Rubin is NVIDIA's most ambitious engineering project in history. All 40,000 engineers in the company are involved, and Taiwan's supply chain also participated in creating this system."
"Grace Blackwell was built to handle AI, especially inference; Vera Rubin is built to run agents."

Vera CPU and the Computational Needs of Agents

"All CPUs until now were built for humans; this CPU is built for agents."
"Agents have no patience. They live in a world measured not in seconds, but in nanoseconds. When an agent uses a tool, it wants the response as fast as possible; when it accesses a database, it wants results returned instantly."
"Vera CPU is a CPU built for agents, emphasizing single-threaded performance, instructions per clock, bandwidth per core, and total system bandwidth."
"This market will certainly be larger than the previous one because the number of agents will far exceed humans, and agents are extremely impatient. This is the NVIDIA Vera CPU."

Next-Generation Personal Computers

"The future agent computing paradigm will run on the AI cloud, within enterprises, and on your PC."
"The new operating system will be the traditional OS plus a large language model; in many ways, the large language model is the modern equivalent of DirectX, an intelligent extension of the computer."
"Applications will be replaced by agent runtimes; the modern application will become an agent."
"NVIDIA and Microsoft are reinventing the PC, launching a new generation of Windows machines covering desktops, notebooks, and workstations."

Physical AI, Autonomous Driving, and Robotics

"Language models are trained on data from a human perspective, but robots need to understand the world from the robot's own perspective. The biggest problem for physical AI is data."
"Cosmos 3 is a foundational model at the forefront of physical AI, capable of understanding, reasoning, generating, simulating in closed loops, and even becoming the strategy itself."
"With AI, computing itself will also become data; Cosmos 3 can be used to train more AI models and be enhanced into your own proprietary model."
"Whether it's cloud agents, PC agents, autonomous driving systems, or humanoid robots, the underlying computing pattern is the same: model, framework, tool skills, and runtime."

Jensen Huang Names Taiwanese Snacks as Part of AI Supply Chain

Jensen Huang:

The scale of the Taiwan ecosystem's development today is truly incredible. When most people talk about ecosystems, they first think of our software stack, the developer ecosystem built on top of NVIDIA computing systems. But NVIDIA's ecosystem goes beyond that; it extends all the way up to the Taiwan supply chain, where everything begins, and all the way down to the data center, ultimately reaching end users.

Today, we'll discuss almost every part of this ecosystem. There are so many people to thank. I love the ecosystem here; there are many companies, and many of my favorite ecosystem partners. Taiwan has an incredibly rich ecosystem; it's the best supply chain ecosystem in the world.

The AI Agent Era Has Arrived

Jensen Huang:

Two years ago when I came here, I started talking about how AI would move from generative AI to the next wave, which is agentic AI. Today we can say that agentic AI has arrived, useful AI has arrived.

From an industry perspective, this means demand for tokens is becoming extremely strong. Because if AI can actually do things, people will want to produce more of this capability. Tokens are now profitable units, revenue-generating units. Since it can make money, AI companies will want to build more tokens, generate more tokens, construct more AI factories, which is also the reason for the explosive growth in computing demand in Taiwan.

This is exactly why you're all so busy and your business performance is so good. In fact, it seems reflected in the stock prices of some of your companies. The computing paradigm has changed; everything has changed.

The first key point: Useful AI has arrived; AI is now a profit generator and a GDP generator. Behind it is a brand-new computing paradigm. It's not just large language models, but agents. Almost everything we discuss today will be built on this foundation.

Let me take a moment to explain what I mean. Inside is an agent, an agent application. In the past, this would have been an application, code, an operating system—code within the application running on top of the OS. Today, it's an agent, composed of one or more large language models placed within an agent framework. This framework helps coordinate its work, enabling it to truly accomplish productive tasks.

When input enters the system, the agent must understand, observe, reason, act, and use tools. Tools can be spreadsheets, web browsers, data processing engines, or database engines. Every flow of information, whether processing context, understanding what's happening, reasoning what to do next, or forming an actionable plan, needs to be coordinated by some software.

So, the essence of an agent is such a system. It handles short-term memory, or working memory, and also long-term memory, just like humans. The memory management system thus becomes extremely important. The entire system is called the agent. The large language model is responsible for thinking, and the agent framework connects everything, like an operating system.

This is the new computing paradigm and the reason agents can accomplish amazing tasks. This is a major breakthrough: Large language models are now good at thinking, reasoning, planning, using tools; at the same time, we also have agent frameworks capable of managing memory, coordinating workflows, and invoking tools. Therefore, we can now do many things we couldn't before.

What are Tokens in AI Factories?

Jensen Huang:

Tokens, DSX, GPU, CPU, Vera... We've already built the next-generation system Vera Rubin. Vera Rubin is not a chip, nor just a GPU. It starts with the GPU but goes far beyond it. The entire end-to-end system is Vera Rubin.

It includes the GPU, Vera Rubin NVLink 72, coordinated by the Vera CPU which I'll introduce later. It also includes the revolutionary Vera storage system, CX9, our software stack DOCA, and built-in security processors. All data in the system, whether at rest, in transit, or in use, is encrypted. The entire system is secure because AI models are extremely valuable. This is why the whole system follows confidential computing principles.

Any one of these systems alone could be a full revolution. Vera Rubin is NVIDIA's most ambitious engineering project in history. All 40,000 engineers in the company participated in the work on Vera Rubin, not to mention those of you present who also participated in creating the entire system. Vera Rubin is truly a marvel; it's not just a chip, but a system composed of many components.

It goes even further. Long ago, NVIDIA was a GPU company; over the years, we've evolved into a systems company. What you see now is the most complex system we've ever designed from scratch. But ultimately, our customers and partners don't want to buy computers; they want to build AI factories.

This is why NVIDIA is beginning to transform again. As you can see, many of our technologies have expanded to the full infrastructure scale. Our partners are also at the infrastructure scale: power plants, cooling systems, grid suppliers, and many industrial companies are now part of our ecosystem. In the end, we need to build the full technology stack, just like we built GPUs, Grace Blackwell, NVLink 72; now, we need to build full-stack systems enabling customers to build outstanding AI infrastructure.

Doing this well, helping customers build and deploy AI factories, is extremely important. The reason is simple: Computing is revenue, computing is profit. Without revenue and profit, it's a loss.

Everyone needs to understand one thing: When an AI infrastructure goes online, it can go live quickly, or it can drag on; throughput can be high or low; elasticity and reliability can be good or bad; effective service life can be long or short. Because this represents investments of 50, 60, or even 100 billion dollars, this curve is extremely important.

This is also why NVIDIA is a great partner. We have full integration capabilities, not just making a presentation slide, but actually creating the entire infrastructure, connecting everything, and building at scale ourselves to ensure the system runs well. Therefore, our first token time, first inference time, training startup time are all faster.

Second, our throughput per watt, tokens per watt are world-class. The reason is we integrate everything, design everything from scratch, simulate the entire system, and employ extreme co-design. Just like the Vera Rubin rack shown earlier, everything is designed for incredible throughput.

If your data center, your factory has 1 gigawatt of power, it won't get any more; that's all the generation capacity you get. Under 1 GW of power, throughput per watt is revenue, because every token generates profit, every token is revenue.

This is the future. Computing is revenue; performance per watt is your revenue. Choosing the wrong architecture just because the chip is cheaper doesn't translate into real returns; you need to ensure revenue per watt. The more you buy, the more you earn.

Standing before you now, I can tell you: Vera Rubin is in full production. The supply chain scale we've built for Vera Rubin is twice that of Grace Blackwell. Where assembling a Grace Blackwell rack used to take two hours, now it takes only five minutes. So not only is capacity higher, but production throughput is much faster, and we need all of this to meet demand.

This ecosystem is extraordinary. To support Grace Blackwell and prepare for Vera Rubin's ramp, millions of square feet of capacity have come online. I want to thank you all. Vera Rubin is in full production. Thank you.

Vera Rubin System Introduction

Jensen Huang:

Vera Rubin wasn't built just for AI. Vera Rubin wasn't built just to run AI; it was built to run agents. It's an agentic system. Imagine the complexity. And precisely because of this, agents are the final computer science breakthrough. It took so many years to finally realize their potential and become useful. The computer that can run them should also be the world's most advanced.

This is Vera Rubin. Let's take a look. Please bring Vera Rubin up.

This is Vera Rubin, Vera Rubin NVLink 72. This is part of the next-generation system; at the next GTC, I'll talk more about it; we have a lot to cover today. This is the Vera CPU rack, 256 CPUs, all liquid-cooled. I'll introduce Vera later. This is the Vera BlueField storage processing system, also the security system. And of course, our Mellanox networking, the world's first CPO. This is Vera Rubin, an amazing combination of technologies.

When we built Hopper, it was for pre-training. Pre-training was the most important application then, the most important workload we faced. When building Grace Blackwell, people said: "Jensen, NVIDIA is great at pre-training; inference is simple." Remember? Many said: "Inference is simple; we can do it too."

But you know, inference equals money. Models are very complex; achieving excellence simultaneously in high response speed, fast interaction, and high throughput is very difficult. This is why we created NVLink 72.

Today, NVIDIA's token cost is the lowest in the world. Not just 10% lower, but multiples lower, even orders of magnitude. All because we did extreme co-design, because we understood the computational model and pattern of inference, and created NVLink 72.

With Vera Rubin, things have gone beyond inference. Now it's inference within agentic systems. This is Vera Rubin. No cables, no hoses, no fans. Last time I showed it to you, cables were everywhere.

VERA CPU: The CPU for AI Agents

Jensen Huang:

Vera CPU is a CPU built for the AI era. So far, all CPUs have been built for people. We were users, we were tenants. The way humans use CPUs is living in a world measured in seconds. We rent CPU resources in the cloud; more CPU cores mean more resources to rent. The usage scenarios and economics of old CPUs are completely different from those of agents.

Agents have no patience. They live in a world measured not in seconds, but in nanoseconds. When an agent uses a tool, it wants the response as fast as possible; when it accesses a database, it wants results returned instantly. Every moment an agent waits, it's prevented from moving to the next step, and the next, and the next. Therefore, we must make the CPU as low-latency and interactive as possible.

This is why we created Vera CPU for the AI era. In our system, it has three uses. The first, of course, is for thinking within Vera Rubin. In the Vera Rubin rack, there are already two CPUs. You know, we are manufacturing and selling millions of Vera Rubins, and have already sold millions of Grace Blackwells. NVIDIA is already one of the world's largest CPU manufacturers.

The two CPUs in the Vera Rubin rack: one coordinates and manages the GPUs, manages the KV cache, and handles various software running in the rack. We also have Grace BlueField for security and isolation. The Vera compute portion is for the agent framework, responsible for coordinating AI models, tool usage, and database access.

The data server here is Vera BlueField, the world's fastest storage server and storage system. It's crucial because agents access memory at extremely high speeds. Storage servers and CPUs are now on the critical path of the most expensive part of the data center.

There's a good reason why this is the most expensive. The core economics of an AI factory are tokens, and tokens are created here. So, you naturally want to produce and generate as many tokens as possible. Economic value is concentrated here, and the CPU and storage system must not become bottlenecks.

Therefore, Vera CPU puts a lot of pressure on CPU architecture, which is also why we built a completely new architecture from scratch. This is a CPU the world has never seen before; we call it Vera. This is a CPU built for agents. All CPUs until now were built for humans; this CPU is built for agents.

First, Vera's instructions per clock (IPC) must be extremely strong because we need to reduce latency, reduce processing time. We want single-threaded performance, not just throughput. Single-threaded performance must be world-class, the best. So Vera's IPC is extremely high, among the highest in the world: 10 instructions fetched, decoded, and executed per clock cycle.

Second, the bandwidth the CPU needs for data in and out must be world-class. This includes both per-core bandwidth and total bandwidth. As I said earlier, agentic systems are inherently decoupled and distributed. When computing is decoupled and distributed, networking becomes the issue. Therefore, we must move data as fast as possible between CPU cores, between CPU and storage, and between CPU and GPU.

Bandwidth around the system and inside the CPU cores must be world-class because CPU cores are communicating with each other at extremely high bandwidth. They are not rented out one core at a time; they all collaborate together. Vera's cross-sectional bandwidth is amazing. It's the first system to support PCI Express Gen 6, also first to feature LPDDR5, with bandwidth reaching 1.2 to 2 TB per second, 2 to 3 times that of the highest-performance CPUs.

This is a CPU built for agents. This market will certainly be larger than the previous one because the number of agents will far exceed humans, and agents are extremely impatient. This is the NVIDIA Vera CPU.

The Most Important Computing Paradigm for the Next Decade

Jensen Huang:

This is truly the most important slide. The core conclusion here is: This is the application pattern for the next decade, and also the computing pattern for the next decade. Agents, agent frameworks, and the large language models coordinated by the framework—every company will run this. Every company will become an agent company; every company will have agents running internally; every company will find that agents need their own operating system.

Every company is asking us: How to run agents securely? How to build agents for our workloads? So, we have the NVIDIA Enterprise AI Agent Toolkit. You've actually seen me building it publicly step by step.

Almost everything NVIDIA does, as you know, if you look back at my GTC speeches 5 or 10 years ago, you'll see I've been talking about these things for years because we've been preparing for this moment.

For enterprises to build agents as a service, or agents for operations, they need four things. First, they need models. Of course, the smarter, cheaper, and faster the large language model, the better. Second, they need a framework to coordinate the entire system. Third, these models want to use tools, and these tools come with skills. I just showed the CUDA-X libraries; they will become powerful tools for agents in the future. Fourth, they need a runtime, an operating system that ties everything together.

This is the NVIDIA Agent Toolkit. It includes modifiable models, namely NVIDIA's world-class open-source models. I want to show more. You can run agents from anywhere; you can run powerful agents like Claude Code, or powerful agents like Codex. You can place them within a framework called Open Shell for highly secure operation within the enterprise.

This Shell protects the agent, keeping it always constrained by security policies. Privacy is protected, permissions and privileges are explicitly assigned, identity is protected. Therefore, Open Shell is being adopted globally. NVIDIA Open Shell is open-source; you'll see many companies adopting it, including Red Hat, Canonical, and Microsoft. It will be adopted everywhere.

This is an important runtime, and this runtime is fully optimized for the ubiquitous NVIDIA AI platform. You can run Open Shell on any cloud, on-premises, even on devices. Now, you have tools and libraries agents can use, models you can modify or use directly, and agent frameworks. These agent frameworks can now run on-premises or anywhere else.

One of my favorite agent use cases is chip designers. This is one of NVIDIA's most important jobs. So, of course, we worked with Cadence to build a chip design super-agent. It's coordinated by Codex or Claude Code, taking RTL, architecture diagrams, schematics, or specifications as input, helping you fix what needs fixing. We've built some super-agents together and optimized Nemotron for the NVIDIA runtime.

NVIDIA is committed to building open models for the world, so you, all of us, can create our own agents. Today, we announce Nemotron 3 Ultra, our next-generation open model, and it's very smart. Nemotron models not only give you the model, but also all the data we used to train the model.

Because we have a strong partner alliance, you can see all the partners listed here. We work together, contribute data to each other. Through these great partnerships, everything—from the model to the training scripts to the data—will be fully opened to you. This is the best form of open model, the world's best open model system policy. The goal is simple: You can take everything, add to it, make it better, and make it your own model.

Nemotron 3 Ultra is 5 times faster, costs 30% less, and is fully open. We are very firm on this. This is Nemotron 3, and we are also developing Nemotron 4. It's this complete toolkit of models, frameworks, tool skills, and runtimes that enables every enterprise globally to create their own agents, just like Cadence with its super-agent.

NVIDIA's New Generation Personal Computers

Jensen Huang:

Microsoft and NVIDIA will reinvent the PC. This will become the new PC. Tomorrow night, our tomorrow night here, I'll be with Satya to talk more about the work we've been advancing together over the past three years. Microsoft and NVIDIA have spent so much time completely rethinking how the PC operates, precisely to prepare for this moment.

As I mentioned earlier, this agent computing paradigm will run on the AI cloud, within enterprises, and on your PC. What happens when a PC has an autonomous agent? It helps you, understands you. You can talk to it; it can see you. You can have it read files, help you with research. It can do even more, which I'll show later.

The new operating system, of course, is the old OS plus a large language model. In many ways, the large language model is the modern version of DirectX. It has input and output, understands prompts, understands computer vision, can generate video, can generate sound. It's a modern intelligent extension of the PC, of the computer.

On top of that, as I said earlier, applications will be replaced by agent runtimes, and the modern application is the agent.

Everyone, the NVIDIA RTX Spark laptop. Thank you. I have too many things in my pockets. Okay, this is the world's most amazing chip. This is the N1X we built in collaboration with MediaTek. I think I just saw Rick. This is the N1X, a beautiful chip. Frankly, it's a chip that took 33 years to build.

The reason is, 100% of the NVIDIA software stack can run here. Want to do digital biology? No problem. Want to do seismic processing? No problem. Want to do astrophysics? No problem. Everything related to CUDA, all physics, all biology, all genomics, all AI, no problem. All computer graphics, no problem.

Every application NVIDIA has ever created, and every application Windows has ever run, Microsoft and NVIDIA have meticulously optimized so that this computer can truly run everything the world has ever created. On top of that, it can now run agents. This is an incredible computer; I'm very proud of it.

This computer can have a local Nemotron 3 Ultra model, or a Nemotron 3 super model; it can also connect to cloud-based Claude Code, Codex, or other models; it can also connect to models on the network. It will work and accomplish amazing things. RTX Spark is a reinvention of the laptop, but in fact, Microsoft and NVIDIA are reinventing the entire PC.

Today, we announce a brand-new product line: three revolutionary Windows machines, covering desktop, notebook, and workstation. They are 100% compatible with Windows, 100% support CUDA, 100% equipped with NVIDIA AI Tensor Cores. Everything you've seen running on various NVIDIA platforms globally can run here.

We have a roadmap for this. This is a brand-new product family. For each generation architecture, we'll have desktop, notebook, workstation; the next generation will still have desktop, notebook, workstation. I'm very happy and honored that 100% of the global PC industry has joined us in reinventing the PC. This is a new product line and a new beginning.

Cosmos 3: The Foundational Model for Physical AI

Jensen Huang:

In the context of language models, the English and various languages we train on from the internet are from a human perspective. They are written by us and read by us. However, to create data for AI robots, it must be from the robot's perception and perspective. The vast majority of video data in the world is from a third-person perspective, not first-person.

Therefore, for agentic systems, robotic systems, and physical AI, data is the hardest problem. You've seen us climb this ladder. We started with teleoperation, essentially human demonstration. This is no different from the human feedback breakthrough in reinforcement learning. Then, we used simulation, which is where Omniverse comes into play. This is also analogous to verifiable rewards in reinforcement learning.

We use these systems to bootstrap AI models, bootstrap physical AI models. Eventually, we can learn from a third-person perspective and reproject it to a first-person perspective. Through this bootstrapping process, we end up with a world foundation model that can understand the physical world from any perspective you want. Third-person, first-person, outside-in, inside-out, all possible. This is indeed a major breakthrough.

Today, we announce Cosmos 3. Cosmos 3 is the forefront of physical AI. We are at the forefront in language models; many are researching them. But in physical AI, we are absolutely the strongest globally. I'm immensely proud of the team for achieving this.

This is your foundational model for all your work. Whether you want to create robots, factory robots, or robots working in factories, as long as it involves the physical world, you now have a partner: Cosmos 3. It can understand and reason, can generate, can simulate in closed loops, and can even become the strategy itself. It leads in various global benchmarks. I'm very proud of Cosmos. Today, we announce Cosmos 3.

It used to be data plus computing equals AI. Now we have AI, and computing will also become data. So, using Cosmos 3, train a large batch of AI models. Cosmos is a very excellent open model system, exactly like Nemotron. We open the model, open the data, even open the training methods, so you can enhance it for yourself and turn Cosmos into your proprietary model.

Alpamayo 2: Autonomous Vehicle Inference

Jensen Huang:

Today, we announce Alpamayo 2, an open model for autonomous vehicles. We are collaborating with global automotive companies. Looking at these brands that have joined NVIDIA Hyperion, are building NVIDIA Hyperion cars, they represent about 80% of global car production. That is, these manufacturers cover around 80% of global cars.

There will be a large number of NVIDIA Hyperion systems in the future, capable of running Alpamayo and any other autonomous driving technology stack. We also connect to mobility services. About 97% of global mobility services are connecting with us. Therefore, when we deploy Alpamayo on the Hyperion runtime and Halos operating system, we can connect to these global services.

Isaac GR00T: Humanoid Robots

Jensen Huang:

NVIDIA Isaac GR00T is our humanoid robot technology stack, containing models, data generation, simulation, runtime, and operating system. It represents the GR00T platform, the Isaac GR00T platform.

As you can see, every one of our systems follows the exact same pattern: whether it's cloud-based agentic systems, agentic systems on PCs, robotic systems for autonomous vehicles, or robotic systems for humanoid robots, it's the same pattern.

Of course, in each case, we build everything completely. We do vertical integration, complete integration, employ co-design and extreme co-design, then open it up so everyone can use any part as they need. You want to use something; we'll even help you modify it.

But there's still one thing missing: robotic systems need a reference platform. These robotic systems are too complex, with many motors and sensors, and very fragile. However, we need a way to deliver these reference platforms. Just like we did for PCs, DGX, cloud, and autonomous vehicles, now we must do the same for robots.

Today, we announce NVIDIA Isaac GR00T, a fully integrated humanoid robot reference platform. It has 25 degrees of freedom per hand, 31 degrees of freedom for the robot body, stands 6 feet tall, weighs 150 lbs. Just like me, except the first number is smaller than mine, the second larger, otherwise similar.

This platform runs the new Thor, along with our complete software stack, data generation stack, data simulation stack, and runtime. Everything is integrated into a single robot platform for everyone to use. We built it for higher education and university researchers because building such a platform themselves is too difficult.

Recap and Summary

Jensen Huang:

Over the past six months, the computer industry has been completely transformed. The reason for the change is that agents have finally been realized and have converged with the latest frontier models, enabling AI to now do truly useful work.

This computing pattern will repeat over and over: an agent composed of models and a framework, using tools with skills, and running on a certain runtime. The runtime depends on whether it's in the cloud, on-premises enterprise environment, PC, or robot. But the computing pattern is exactly the same.

You will use different frameworks based on your preference, and different models based on your preference. You will improve them for your proprietary uses. You will create super-agents, rent them to others, help others accomplish work. This agentic platform, this agentic pattern, is precisely what the NVIDIA Enterprise AI Toolkit aims to support. For you, this is a great way to participate in AI; for us, it's also a huge growth opportunity.

Vera Rubin is in full production. Grace Blackwell was built to handle AI, especially inference; Vera Rubin was built to run agents. It is in full production. It is far more than just a GPU; it's an entire decoupled, distributed agent processing system.

NVIDIA has truly become an infrastructure company. Not just a GPU company, not just a systems company, but an infrastructure company. Our goal is to help you create maximum revenue, maximum profit, and do so as quickly as possible.

In the world of agents, this new way of computing means CPUs must also be built for agents, not for people. CPUs built for agents have their own special requirements. Our NVIDIA Vera is a revolution. I'm happy to see its ramp and order status; it will be the fastest, most successful product launch in NVIDIA's history.

NVIDIA and Microsoft have created a brand-new PC product line. This is a new beginning. Of course, the same agentic processing pattern, agentic computing pattern I just described, will also run on various devices. I mentioned PCs, but in the future, it will appear in robots, satellites, base stations, factories, cloud, on-premises, edge devices. This agentic AI system, agentic computing pattern, will be replicated in all kinds of computers. Our understanding of the personal computer will likely change.

你可能也喜歡

a16z 全球化转向：VC 正在成为美国科技联盟的「推手」

a16z（Andreessen Horowitz）发布公告，宣布其全球化战略发生重要转向：不再局限于海外寻找项目和投资，而是将自身定位融入更大的技术竞争与国际盟友合作框架中。面对AI、机器人、国防科技等成为国家竞争焦点的领域，创业公司面临复杂的国际监管、产业政策和地缘关系。a16z通过设立东京办公室、任命Anne Neuberger负责全球事务、将投资者关系团队升级为全球合作伙伴团队等举措，主动应对这一变化。公告明确将a16z的全球网络与“美国及其盟友”的技术领导力绑定，标志着技术创新已进入国家安全和国际竞争语境。未来，风投的角色不仅是提供资本和增长建议，更要帮助创始人对接关键市场、政府机构和战略资源，理解多国政策环境。a16z旨在成为连接创业公司、国家能力、产业资源和全球资本的组织者，支持盟友国家在关键创新领域的合作，并助力投资组合公司进行全球扩张。这一布局体现了硅谷资本对全球科技竞争新格局的主动站位。

marsbit10 分鐘前

marsbit10 分鐘前

解读Agent商业、支付与基础设施的真相

作者基于一年来为Agent经济构建基础设施的经验，指出当前Agent商业尚未形成真实、规模化的市场需求，初创公司面临结构性挑战。文章分析了四个关键场景： 1. **Agent对商户**：目前电商体验中，聊天界面在视觉比价购物上逊于传统界面，商户接入多出于防御性“优化”心态。对话式商业在如外卖等高頻、低决策场景有潜力，但受限于平台开放性和成本。 2. **Agent对API**：开发者现有支付方式（如预付）已能处理低频、小额的API调用成本问题。真正的机会在于服务长尾、小众的供应商市场，但规模有限。 3. **Agent对Agent**：这是长期的愿景，涉及机器间的自动交易与结算，需求真实但当前市场几乎为零，需要专用的基础设施。 4. **Agent对金融**：这是唯一存在现成需求和付费客户的领域。将AI嵌入金融工作流是自然演进，但竞争激烈，老牌机构优势明显。文章认为，行业巨头因资金充足和战略防御而持续投入，但对初创公司而言，真正的机会并非单纯构建支付层。支付只是更宏大问题——**Agent与人类的协同工作、验证与结算**——的一部分。未来，解决协同问题的公司将主导市场，而非支付服务商。作者团队已转向一个存在真实需求、快速增长且未被充分服务的领域。

marsbit16 分鐘前

marsbit16 分鐘前

Kalshi、MTS 与 a16z 的野望

本文探讨了预测市场在2025年成为投资、加密和媒体领域共同关注焦点的现象，并着重分析了其精神内核的演变及其与风投机构a16z所倡导的“新媒体”愿景的契合。文章首先回顾了预测市场的思想渊源：从哈耶克关于市场作为分散知识协调机制的观点，到罗宾·汉森设计对数市场评分规则（LMSR）以激励信息真实披露，乃至衍生出的“未来统治”（Futarchy）治理乌托邦构想。然而，作者指出，a16z在2024-2025年投资估值飙升的预测市场平台Kalshi，为此领域注入了新的精神内涵——“在场感”。在人们与现实世界日益疏离的后现代语境下，预测市场提供了一种通过真金白银下注来介入和“预测”未来的方式，使用户从被动观察者转变为主动的“超级观察者”，从而对抗不确定性与无力感。当足够多人使用并依赖这种媒介时，市场本身将对事件的真实性与重要性获得解释权，这正是a16z构建新媒体帝国的关键拼图。最后，文章以媒体公司MTS为例，说明a16z的“新媒体”是一种全频段、高烈度的信息工程，旨在“接管时间线”。而Kalshi的核心价值在于，它通过真实的交易数据构建了一种强大的“现实扭曲力场”，其显示的市场概率能深刻影响公众认知与判断，这种赋予私营公司的社会影响力是其获得高估值的根本原因。

链捕手17 分鐘前

链捕手17 分鐘前

一周代币解锁：HOME解锁超流通量20%代币

本周代币解锁重点关注项目DeFi.app（代币HOME），其将释放7.9亿枚代币，价值约4126万美元，解锁量超过其流通供应量的20%。 DeFi.app旨在简化去中心化金融操作，为用户提供跨链购买代币、一键式跨链交易、访问去中心化衍生品市场以及获取收益机会等功能，并宣称无需支付Gas费或使用跨链桥。项目相关信息可通过其官网与推特获取。文内附有其代币释放曲线图以供参考。

marsbit3 小時前

marsbit3 小時前

美股遭遇2025年来最惨烈暴跌，三大导火索引爆科技股估值重估

2025年6月5日，美股遭遇当年最惨烈暴跌，纳斯达克指数重挫4.18%，标普500跌2.64%。此前市场连续上涨，此次逆转由三条导火索同时引爆。首先，博通发布财报，其AI芯片收入虽大幅增长，但下一季度展望不及预期，暗示AI增速可能放缓，引发市场对半导体板块估值重估的担忧，芯片股集体暴跌。其次，强劲的非农就业数据超出预期，叠加前值上修，令市场担忧美联储可能不仅不降息，甚至转向加息。这直接压缩了高估值科技股的未来现金流价值，并促使资金轮动。第三条是持续的地缘政治风险。伊朗战争导致的霍尔木兹海峡封锁推高油价，通胀阴影挥之不去，使美联储政策面临两难，加剧了市场对流动性收紧的忧虑。三者叠加，攻击了“AI无限增长”、“美联储降息”和“通胀受控”三大市场叙事，引发全球风险资产抛售。此次事件被视为一次“估值重定价”而非“AI叙事崩塌”，市场开始更冷静地区分AI浪潮中的真正赢家。未来走势将取决于美联储政策立场、更多AI公司财报及地缘局势发展。

marsbit4 小時前

marsbit4 小時前

交易

現貨

合約

熱門文章

什麼是 $S$

理解 SPERO：全面概述 SPERO 簡介隨著創新領域的不斷演變，web3 技術和加密貨幣項目的出現在塑造數字未來中扮演著關鍵角色。在這個動態領域中，SPERO（標記為 SPERO,$$s$）是一個引起關注的項目。本文旨在收集並呈現有關 SPERO 的詳細信息，以幫助愛好者和投資者理解其基礎、目標和在 web3 和加密領域內的創新。 SPERO,$$s$ 是什麼？ SPERO,$$s$ 是加密空間中的一個獨特項目，旨在利用去中心化和區塊鏈技術的原則，創建一個促進參與、實用性和金融包容性的生態系統。該項目旨在以新的方式促進點對點互動，為用戶提供創新的金融解決方案和服務。 SPERO,$$s$ 的核心目標是通過提供增強用戶體驗的工具和平台來賦能個人。這包括使交易方式更加靈活、促進社區驅動的倡議，以及通過去中心化應用程序（dApps）創造金融機會的途徑。SPERO,$$s$ 的基本願景圍繞包容性展開，旨在彌合傳統金融中的差距，同時利用區塊鏈技術的優勢。誰是 SPERO,$$s$ 的創建者？ SPERO,$$s$ 的創建者身份仍然有些模糊，因為公開可用的資源對其創始人提供的詳細背景信息有限。這種缺乏透明度可能源於該項目對去中心化的承諾——這是一種許多 web3 項目所共享的精神，優先考慮集體貢獻而非個人認可。通過將討論重心放在社區及其共同目標上，SPERO,$$s$ 體現了賦能的本質，而不特別突出某些個體。因此，理解 SPERO 的精神和使命比識別單一創建者更為重要。誰是 SPERO,$$s$ 的投資者？ SPERO,$$s$ 得到了來自風險投資家到天使投資者的多樣化投資者的支持，他們致力於促進加密領域的創新。這些投資者的關注點通常與 SPERO 的使命一致——優先考慮那些承諾社會技術進步、金融包容性和去中心化治理的項目。這些投資者通常對不僅提供創新產品，還對區塊鏈社區及其生態系統做出積極貢獻的項目感興趣。這些投資者的支持強化了 SPERO,$$s$ 作為快速發展的加密項目領域中的一個重要競爭者。 SPERO,$$s$ 如何運作？ SPERO,$$s$ 採用多面向的框架，使其與傳統的加密貨幣項目區別開來。以下是一些突顯其獨特性和創新的關鍵特徵：去中心化治理：SPERO,$$s$ 整合了去中心化治理模型，賦予用戶積極參與決策過程的權力，關於項目的未來。這種方法促進了社區成員之間的擁有感和責任感。代幣實用性：SPERO,$$s$ 使用其自己的加密貨幣代幣，旨在在生態系統內部提供多種功能。這些代幣使交易、獎勵和平台上提供的服務得以促進，增強了整體參與度和實用性。分層架構：SPERO,$$s$ 的技術架構支持模塊化和可擴展性，允許在項目發展過程中無縫整合額外的功能和應用。這種適應性對於在不斷變化的加密環境中保持相關性至關重要。社區參與：該項目強調社區驅動的倡議，採用激勵合作和反饋的機制。通過培養強大的社區，SPERO,$$s$ 能夠更好地滿足用戶需求並適應市場趨勢。專注於包容性：通過提供低交易費用和用戶友好的界面，SPERO,$$s$ 旨在吸引多樣化的用戶群體，包括那些以前可能未曾參與加密領域的個體。這種對包容性的承諾與其通過可及性賦能的總體使命相一致。 SPERO,$$s$ 的時間線理解一個項目的歷史提供了對其發展軌跡和里程碑的關鍵見解。以下是建議的時間線，映射 SPERO,$$s$ 演變中的重要事件：概念化和構思階段：形成 SPERO,$$s$ 基礎的初步想法被提出，與區塊鏈行業內的去中心化和社區聚焦原則密切相關。項目白皮書的發布：在概念階段之後，發布了一份全面的白皮書，詳細說明了 SPERO,$$s$ 的願景、目標和技術基礎設施，以吸引社區的興趣和反饋。社區建設和早期參與：積極進行外展工作，建立早期採用者和潛在投資者的社區，促進圍繞項目目標的討論並獲得支持。代幣生成事件：SPERO,$$s$ 進行了一次代幣生成事件（TGE），向早期支持者分發其原生代幣，並在生態系統內建立初步流動性。首次 dApp 上線：與 SPERO,$$s$ 相關的第一個去中心化應用程序（dApp）上線，允許用戶參與平台的核心功能。持續發展和夥伴關係：對項目產品的持續更新和增強，包括與區塊鏈領域其他參與者的戰略夥伴關係，使 SPERO,$$s$ 成為加密市場中一個具有競爭力和不斷演變的參與者。結論 SPERO,$$s$ 是 web3 和加密貨幣潛力的見證，能夠徹底改變金融系統並賦能個人。憑藉對去中心化治理、社區參與和創新設計功能的承諾，它為更具包容性的金融環境鋪平了道路。與任何在快速發展的加密領域中的投資一樣，潛在的投資者和用戶都被鼓勵進行徹底研究，並對 SPERO,$$s$ 的持續發展進行深思熟慮的參與。該項目展示了加密行業的創新精神，邀請人們進一步探索其無數可能性。儘管 SPERO,$$s$ 的旅程仍在展開，但其基礎原則確實可能影響我們在互聯網數字生態系統中如何與技術、金融和彼此互動的未來。

85 人學過發佈於 2024.12.17更新於 2024.12.17

什麼是 AGENT S

Agent S：Web3中自主互動的未來介紹在不斷演變的Web3和加密貨幣領域，創新不斷重新定義個人如何與數字平台互動。Agent S是一個開創性的項目，承諾通過其開放的代理框架徹底改變人機互動。Agent S旨在簡化複雜任務，為人工智能（AI）提供變革性的應用，鋪平自主互動的道路。本詳細探索將深入研究該項目的複雜性、其獨特特徵以及對加密貨幣領域的影響。什麼是Agent S？ Agent S是一個突破性的開放代理框架，專門設計用來解決計算機任務自動化中的三個基本挑戰：獲取特定領域知識：該框架智能地從各種外部知識來源和內部經驗中學習。這種雙重方法使其能夠建立豐富的特定領域知識庫，提升其在任務執行中的表現。長期任務規劃：Agent S採用經驗增強的分層規劃，這是一種戰略方法，可以有效地分解和執行複雜任務。此特徵顯著提升了其高效和有效地管理多個子任務的能力。處理動態、不均勻的界面：該項目引入了代理-計算機界面（ACI），這是一種創新的解決方案，增強了代理和用戶之間的互動。利用多模態大型語言模型（MLLMs），Agent S能夠無縫導航和操作各種圖形用戶界面。通過這些開創性特徵，Agent S提供了一個強大的框架，解決了自動化人機互動中涉及的複雜性，為AI及其他領域的無數應用奠定了基礎。誰是Agent S的創建者？儘管Agent S的概念根本上是創新的，但有關其創建者的具體信息仍然難以捉摸。創建者目前尚不清楚，這突顯了該項目的初期階段或戰略選擇將創始成員保密。無論是否匿名，重點仍然在於框架的能力和潛力。誰是Agent S的投資者？由於Agent S在加密生態系統中相對較新，關於其投資者和財務支持者的詳細信息並未明確記錄。缺乏對支持該項目的投資基礎或組織的公開見解，引發了對其資金結構和發展路線圖的質疑。了解其支持背景對於評估該項目的可持續性和潛在市場影響至關重要。 Agent S如何運作？ Agent S的核心是尖端技術，使其能夠在多種環境中有效運作。其運營模型圍繞幾個關鍵特徵構建：類人計算機互動：該框架提供先進的AI規劃，力求使與計算機的互動更加直觀。通過模仿人類在任務執行中的行為，承諾提升用戶體驗。敘事記憶：用於利用高級經驗，Agent S利用敘事記憶來跟蹤任務歷史，從而增強其決策過程。情節記憶：此特徵為用戶提供逐步指導，使框架能夠在任務展開時提供上下文支持。支持OpenACI：Agent S能夠在本地運行，使用戶能夠控制其互動和工作流程，與Web3的去中心化理念相一致。與外部API的輕鬆集成：其多功能性和與各種AI平台的兼容性確保了Agent S能夠無縫融入現有技術生態系統，成為開發者和組織的理想選擇。這些功能共同促成了Agent S在加密領域的獨特地位，因為它以最小的人類干預自動化複雜的多步任務。隨著項目的發展，其在Web3中的潛在應用可能重新定義數字互動的展開方式。 Agent S的時間線 Agent S的發展和里程碑可以用一個時間線來概括，突顯其重要事件： 2024年9月27日：Agent S的概念在一篇名為《一個像人類一樣使用計算機的開放代理框架》的綜合研究論文中推出，展示了該項目的基礎工作。 2024年10月10日：該研究論文在arXiv上公開，提供了對框架及其基於OSWorld基準的性能評估的深入探索。 2024年10月12日：發布了一個視頻演示，提供了對Agent S能力和特徵的視覺洞察，進一步吸引潛在用戶和投資者。這些時間線上的標記不僅展示了Agent S的進展，還表明了其對透明度和社區參與的承諾。有關Agent S的要點隨著Agent S框架的持續演變，幾個關鍵特徵脫穎而出，強調其創新性和潛力：創新框架：旨在提供類似人類互動的直觀計算機使用，Agent S為任務自動化帶來了新穎的方法。自主互動：通過GUI自主與計算機互動的能力標誌著向更智能和高效的計算解決方案邁進了一步。複雜任務自動化：憑藉其強大的方法論，能夠自動化複雜的多步任務，使過程更快且更少出錯。持續改進：學習機制使Agent S能夠從過去的經驗中改進，不斷提升其性能和效率。多功能性：其在OSWorld和WindowsAgentArena等不同操作環境中的適應性確保了它能夠服務於廣泛的應用。隨著Agent S在Web3和加密領域中的定位，其增強互動能力和自動化過程的潛力標誌著AI技術的一次重大進步。通過其創新框架，Agent S展現了數字互動的未來，為各行各業的用戶承諾提供更無縫和高效的體驗。結論 Agent S代表了AI與Web3結合的一次大膽飛躍，具有重新定義我們與技術互動方式的能力。儘管仍處於早期階段，但其應用的可能性廣泛且引人入勝。通過其全面的框架解決關鍵挑戰，Agent S旨在將自主互動帶到數字體驗的最前沿。隨著我們深入加密貨幣和去中心化的領域，像Agent S這樣的項目無疑將在塑造技術和人機協作的未來中發揮關鍵作用。

823 人學過發佈於 2025.01.14更新於 2025.01.14

如何購買S

歡迎來到HTX.com！在這裡，購買Sonic (S)變得簡單而便捷。跟隨我們的逐步指南，放心開始您的加密貨幣之旅。第一步：創建您的HTX帳戶使用您的 Email、手機號碼在HTX註冊一個免費帳戶。體驗無憂的註冊過程並解鎖所有平台功能。立即註冊第二步：前往買幣頁面，選擇您的支付方式信用卡/金融卡購買：使用您的Visa或Mastercard即時購買Sonic (S)。餘額購買：使用您HTX帳戶餘額中的資金進行無縫交易。第三方購買：探索諸如Google Pay或Apple Pay等流行支付方式以增加便利性。C2C購買：在HTX平台上直接與其他用戶交易。HTX 場外交易 (OTC) 購買：為大量交易者提供個性化服務和競爭性匯率。第三步：存儲您的Sonic (S)購買Sonic (S)後，將其存儲在您的HTX帳戶中。您也可以透過區塊鏈轉帳將其發送到其他地址或者用於交易其他加密貨幣。第四步：交易Sonic (S)在HTX的現貨市場輕鬆交易Sonic (S)。前往您的帳戶，選擇交易對，執行交易，並即時監控。HTX為初學者和經驗豐富的交易者提供了友好的用戶體驗。

1.7k 人學過發佈於 2025.01.15更新於 2026.06.02

Jensen Huang's 2026 GTC Taipei Speech: The Era of AI Agents is Here, Computing is Revenue

文章摘要

Summary of Key Points

Excerpts of Highlights

The Arrival of the AI Agent Era

Tokens, AI Factories, and Infrastructure Economics

Vera Rubin and NVIDIA's Infrastructure Transformation

Vera CPU and the Computational Needs of Agents

Next-Generation Personal Computers

Physical AI, Autonomous Driving, and Robotics

Jensen Huang Names Taiwanese Snacks as Part of AI Supply Chain

The AI Agent Era Has Arrived

What are Tokens in AI Factories?

Vera Rubin System Introduction

VERA CPU: The CPU for AI Agents

The Most Important Computing Paradigm for the Next Decade

NVIDIA's New Generation Personal Computers

Cosmos 3: The Foundational Model for Physical AI

Alpamayo 2: Autonomous Vehicle Inference

Isaac GR00T: Humanoid Robots

Recap and Summary

相關問答

你可能也喜歡

a16z 全球化转向：VC 正在成为美国科技联盟的「推手」

解读Agent商业、支付与基础设施的真相

Kalshi、MTS 与 a16z 的野望

一周代币解锁：HOME解锁超流通量20%代币

美股遭遇2025年来最惨烈暴跌，三大导火索引爆科技股估值重估

交易

熱門文章

什麼是 $S$

什麼是 AGENT S

如何購買S

相關討論

熱門問答

熱門分類

熱門標籤