Jensen Huang's 2026 GTC Taipei Speech: The Era of AI Agents is Here, Computing is Revenue

marsbitPublished on 2026-06-03Last updated on 2026-06-03

Abstract

NVIDIA CEO Jensen Huang's 2026 GTC Taipei speech announces the arrival of the "Agent AI" era, where AI transitions from content generation to performing useful work. Huang positions tokens as units of profit and GDP, driving massive demand for computing power and "AI factories." NVIDIA's strategy revolves around a new computing paradigm centered on AI agents, which combine large language models (LLMs) with agent frameworks for planning, memory, and tool use. Key announcements include: * **Vera Rubin:** A complete, end-to-end system (not just a GPU) designed from the ground up to run AI agents at scale, representing NVIDIA's evolution into an infrastructure company. * **Vera CPU:** A revolutionary CPU architecture built specifically for impatient AI agents, prioritizing low latency, single-thread performance, and massive bandwidth over traditional multi-core throughput. * **Enterprise AI Agent Toolkit:** A suite including open models (like Nemotron 3 Ultra), frameworks, tools, and a secure runtime (Open Shell) to enable every company to build and deploy its own AI agents. * **Next-Gen PCs with Microsoft:** A new line of Windows desktops, laptops, and workstations co-developed with Microsoft, featuring the N1X chip and designed to run local AI agents, redefining the personal computer. * **Physical AI Foundation Models:** Introduction of Cosmos 3 for robotics and physical AI, Alpamayo 2 for autonomous driving, and the Isaac GR00T platform—a fully integrated humanoid r...

Organized & Compiled: Deep Tide TechFlow

Guest: Jensen Huang, CEO of NVIDIA

Podcast Source: Bonnie Blockchain

Original Title: 7 Core Points from Jensen Huang's 2026 GTC Taipei Speech, NVIDIA's Latest Strategy Cheat Sheet! 【Bonnie Blockchain】

Broadcast Date: June 2, 2026

Summary of Key Points

In his 2026 GTC Taipei speech, Jensen Huang focused NVIDIA's next-phase strategy on one core judgment: AI has transitioned from generating content into the era of functional agents. Tokens are no longer just technical metrics but units of production for revenue, profit, and GDP. Centered around this shift, NVIDIA introduced Vera Rubin, Vera CPU, an enterprise-grade agent toolkit, new-generation PCs in collaboration with Microsoft, and Cosmos 3, Alpamayo 2, and Isaac GR00T for physical AI. Huang emphasized that the computing paradigm for the next decade will be composed of models, agent frameworks, tool skills, and runtimes, diffusing from the cloud, enterprises, and local PCs to robots, factories, satellites, and edge devices. For Taiwan's supply chain, this means AI factories, power efficiency, infrastructure delivery speed, and full-stack synergy will become the keys to the next wave of industry growth.

Excerpts of Highlights

The Arrival of the AI Agent Era

  • "Useful AI has arrived; AI is now a profit generator and a GDP generator. Behind it is not just large language models, but a brand-new computing paradigm: agents."
  • "Agents are composed of large language models and an agent framework. The framework connects memory, tools, reasoning, planning, and action like an operating system."
  • "The breakthrough in agent systems comes from large language models now being capable of thinking, reasoning, planning, and using tools, as well as from agent frameworks capable of managing memory, coordinating workflows, and dispatching tools."
  • "Every company will become an agent company; every company will run agents internally, and every company will need its own agent operating system."

Tokens, AI Factories, and Infrastructure Economics

  • "Tokens are now profitable units of revenue. AI companies wanting to produce more tokens will build more AI factories, which is precisely why Taiwan's computing demand is exploding."
  • "Computing is revenue, computing is profit. Without revenue and profit, it's a loss."
  • "If an AI factory has only 1 gigawatt of power, that 1 GW is the limit; under this constraint, throughput per watt is revenue, because every token has value."
  • "Choosing the wrong architecture just because the chip is cheaper doesn't translate into real returns; you need to ensure revenue per watt. The more you buy, the more you earn."

Vera Rubin and NVIDIA's Infrastructure Transformation

  • "Vera Rubin is not a chip, nor just a GPU; it's a complete system built end-to-end."
  • "NVIDIA was a GPU company, then became a systems company, and is now further evolving into an infrastructure company, helping customers build AI factories."
  • "Vera Rubin is NVIDIA's most ambitious engineering project in history. All 40,000 engineers in the company are involved, and Taiwan's supply chain also participated in creating this system."
  • "Grace Blackwell was built to handle AI, especially inference; Vera Rubin is built to run agents."

Vera CPU and the Computational Needs of Agents

  • "All CPUs until now were built for humans; this CPU is built for agents."
  • "Agents have no patience. They live in a world measured not in seconds, but in nanoseconds. When an agent uses a tool, it wants the response as fast as possible; when it accesses a database, it wants results returned instantly."
  • "Vera CPU is a CPU built for agents, emphasizing single-threaded performance, instructions per clock, bandwidth per core, and total system bandwidth."
  • "This market will certainly be larger than the previous one because the number of agents will far exceed humans, and agents are extremely impatient. This is the NVIDIA Vera CPU."

Next-Generation Personal Computers

  • "The future agent computing paradigm will run on the AI cloud, within enterprises, and on your PC."
  • "The new operating system will be the traditional OS plus a large language model; in many ways, the large language model is the modern equivalent of DirectX, an intelligent extension of the computer."
  • "Applications will be replaced by agent runtimes; the modern application will become an agent."
  • "NVIDIA and Microsoft are reinventing the PC, launching a new generation of Windows machines covering desktops, notebooks, and workstations."

Physical AI, Autonomous Driving, and Robotics

  • "Language models are trained on data from a human perspective, but robots need to understand the world from the robot's own perspective. The biggest problem for physical AI is data."
  • "Cosmos 3 is a foundational model at the forefront of physical AI, capable of understanding, reasoning, generating, simulating in closed loops, and even becoming the strategy itself."
  • "With AI, computing itself will also become data; Cosmos 3 can be used to train more AI models and be enhanced into your own proprietary model."
  • "Whether it's cloud agents, PC agents, autonomous driving systems, or humanoid robots, the underlying computing pattern is the same: model, framework, tool skills, and runtime."

Jensen Huang Names Taiwanese Snacks as Part of AI Supply Chain

Jensen Huang:

The scale of the Taiwan ecosystem's development today is truly incredible. When most people talk about ecosystems, they first think of our software stack, the developer ecosystem built on top of NVIDIA computing systems. But NVIDIA's ecosystem goes beyond that; it extends all the way up to the Taiwan supply chain, where everything begins, and all the way down to the data center, ultimately reaching end users.

Today, we'll discuss almost every part of this ecosystem. There are so many people to thank. I love the ecosystem here; there are many companies, and many of my favorite ecosystem partners. Taiwan has an incredibly rich ecosystem; it's the best supply chain ecosystem in the world.

The AI Agent Era Has Arrived

Jensen Huang:

Two years ago when I came here, I started talking about how AI would move from generative AI to the next wave, which is agentic AI. Today we can say that agentic AI has arrived, useful AI has arrived.

From an industry perspective, this means demand for tokens is becoming extremely strong. Because if AI can actually do things, people will want to produce more of this capability. Tokens are now profitable units, revenue-generating units. Since it can make money, AI companies will want to build more tokens, generate more tokens, construct more AI factories, which is also the reason for the explosive growth in computing demand in Taiwan.

This is exactly why you're all so busy and your business performance is so good. In fact, it seems reflected in the stock prices of some of your companies. The computing paradigm has changed; everything has changed.

The first key point: Useful AI has arrived; AI is now a profit generator and a GDP generator. Behind it is a brand-new computing paradigm. It's not just large language models, but agents. Almost everything we discuss today will be built on this foundation.

Let me take a moment to explain what I mean. Inside is an agent, an agent application. In the past, this would have been an application, code, an operating system—code within the application running on top of the OS. Today, it's an agent, composed of one or more large language models placed within an agent framework. This framework helps coordinate its work, enabling it to truly accomplish productive tasks.

When input enters the system, the agent must understand, observe, reason, act, and use tools. Tools can be spreadsheets, web browsers, data processing engines, or database engines. Every flow of information, whether processing context, understanding what's happening, reasoning what to do next, or forming an actionable plan, needs to be coordinated by some software.

So, the essence of an agent is such a system. It handles short-term memory, or working memory, and also long-term memory, just like humans. The memory management system thus becomes extremely important. The entire system is called the agent. The large language model is responsible for thinking, and the agent framework connects everything, like an operating system.

This is the new computing paradigm and the reason agents can accomplish amazing tasks. This is a major breakthrough: Large language models are now good at thinking, reasoning, planning, using tools; at the same time, we also have agent frameworks capable of managing memory, coordinating workflows, and invoking tools. Therefore, we can now do many things we couldn't before.

What are Tokens in AI Factories?

Jensen Huang:

Tokens, DSX, GPU, CPU, Vera... We've already built the next-generation system Vera Rubin. Vera Rubin is not a chip, nor just a GPU. It starts with the GPU but goes far beyond it. The entire end-to-end system is Vera Rubin.

It includes the GPU, Vera Rubin NVLink 72, coordinated by the Vera CPU which I'll introduce later. It also includes the revolutionary Vera storage system, CX9, our software stack DOCA, and built-in security processors. All data in the system, whether at rest, in transit, or in use, is encrypted. The entire system is secure because AI models are extremely valuable. This is why the whole system follows confidential computing principles.

Any one of these systems alone could be a full revolution. Vera Rubin is NVIDIA's most ambitious engineering project in history. All 40,000 engineers in the company participated in the work on Vera Rubin, not to mention those of you present who also participated in creating the entire system. Vera Rubin is truly a marvel; it's not just a chip, but a system composed of many components.

It goes even further. Long ago, NVIDIA was a GPU company; over the years, we've evolved into a systems company. What you see now is the most complex system we've ever designed from scratch. But ultimately, our customers and partners don't want to buy computers; they want to build AI factories.

This is why NVIDIA is beginning to transform again. As you can see, many of our technologies have expanded to the full infrastructure scale. Our partners are also at the infrastructure scale: power plants, cooling systems, grid suppliers, and many industrial companies are now part of our ecosystem. In the end, we need to build the full technology stack, just like we built GPUs, Grace Blackwell, NVLink 72; now, we need to build full-stack systems enabling customers to build outstanding AI infrastructure.

Doing this well, helping customers build and deploy AI factories, is extremely important. The reason is simple: Computing is revenue, computing is profit. Without revenue and profit, it's a loss.

Everyone needs to understand one thing: When an AI infrastructure goes online, it can go live quickly, or it can drag on; throughput can be high or low; elasticity and reliability can be good or bad; effective service life can be long or short. Because this represents investments of 50, 60, or even 100 billion dollars, this curve is extremely important.

This is also why NVIDIA is a great partner. We have full integration capabilities, not just making a presentation slide, but actually creating the entire infrastructure, connecting everything, and building at scale ourselves to ensure the system runs well. Therefore, our first token time, first inference time, training startup time are all faster.

Second, our throughput per watt, tokens per watt are world-class. The reason is we integrate everything, design everything from scratch, simulate the entire system, and employ extreme co-design. Just like the Vera Rubin rack shown earlier, everything is designed for incredible throughput.

If your data center, your factory has 1 gigawatt of power, it won't get any more; that's all the generation capacity you get. Under 1 GW of power, throughput per watt is revenue, because every token generates profit, every token is revenue.

This is the future. Computing is revenue; performance per watt is your revenue. Choosing the wrong architecture just because the chip is cheaper doesn't translate into real returns; you need to ensure revenue per watt. The more you buy, the more you earn.

Standing before you now, I can tell you: Vera Rubin is in full production. The supply chain scale we've built for Vera Rubin is twice that of Grace Blackwell. Where assembling a Grace Blackwell rack used to take two hours, now it takes only five minutes. So not only is capacity higher, but production throughput is much faster, and we need all of this to meet demand.

This ecosystem is extraordinary. To support Grace Blackwell and prepare for Vera Rubin's ramp, millions of square feet of capacity have come online. I want to thank you all. Vera Rubin is in full production. Thank you.

Vera Rubin System Introduction

Jensen Huang:

Vera Rubin wasn't built just for AI. Vera Rubin wasn't built just to run AI; it was built to run agents. It's an agentic system. Imagine the complexity. And precisely because of this, agents are the final computer science breakthrough. It took so many years to finally realize their potential and become useful. The computer that can run them should also be the world's most advanced.

This is Vera Rubin. Let's take a look. Please bring Vera Rubin up.

This is Vera Rubin, Vera Rubin NVLink 72. This is part of the next-generation system; at the next GTC, I'll talk more about it; we have a lot to cover today. This is the Vera CPU rack, 256 CPUs, all liquid-cooled. I'll introduce Vera later. This is the Vera BlueField storage processing system, also the security system. And of course, our Mellanox networking, the world's first CPO. This is Vera Rubin, an amazing combination of technologies.

When we built Hopper, it was for pre-training. Pre-training was the most important application then, the most important workload we faced. When building Grace Blackwell, people said: "Jensen, NVIDIA is great at pre-training; inference is simple." Remember? Many said: "Inference is simple; we can do it too."

But you know, inference equals money. Models are very complex; achieving excellence simultaneously in high response speed, fast interaction, and high throughput is very difficult. This is why we created NVLink 72.

Today, NVIDIA's token cost is the lowest in the world. Not just 10% lower, but multiples lower, even orders of magnitude. All because we did extreme co-design, because we understood the computational model and pattern of inference, and created NVLink 72.

With Vera Rubin, things have gone beyond inference. Now it's inference within agentic systems. This is Vera Rubin. No cables, no hoses, no fans. Last time I showed it to you, cables were everywhere.

VERA CPU: The CPU for AI Agents

Jensen Huang:

Vera CPU is a CPU built for the AI era. So far, all CPUs have been built for people. We were users, we were tenants. The way humans use CPUs is living in a world measured in seconds. We rent CPU resources in the cloud; more CPU cores mean more resources to rent. The usage scenarios and economics of old CPUs are completely different from those of agents.

Agents have no patience. They live in a world measured not in seconds, but in nanoseconds. When an agent uses a tool, it wants the response as fast as possible; when it accesses a database, it wants results returned instantly. Every moment an agent waits, it's prevented from moving to the next step, and the next, and the next. Therefore, we must make the CPU as low-latency and interactive as possible.

This is why we created Vera CPU for the AI era. In our system, it has three uses. The first, of course, is for thinking within Vera Rubin. In the Vera Rubin rack, there are already two CPUs. You know, we are manufacturing and selling millions of Vera Rubins, and have already sold millions of Grace Blackwells. NVIDIA is already one of the world's largest CPU manufacturers.

The two CPUs in the Vera Rubin rack: one coordinates and manages the GPUs, manages the KV cache, and handles various software running in the rack. We also have Grace BlueField for security and isolation. The Vera compute portion is for the agent framework, responsible for coordinating AI models, tool usage, and database access.

The data server here is Vera BlueField, the world's fastest storage server and storage system. It's crucial because agents access memory at extremely high speeds. Storage servers and CPUs are now on the critical path of the most expensive part of the data center.

There's a good reason why this is the most expensive. The core economics of an AI factory are tokens, and tokens are created here. So, you naturally want to produce and generate as many tokens as possible. Economic value is concentrated here, and the CPU and storage system must not become bottlenecks.

Therefore, Vera CPU puts a lot of pressure on CPU architecture, which is also why we built a completely new architecture from scratch. This is a CPU the world has never seen before; we call it Vera. This is a CPU built for agents. All CPUs until now were built for humans; this CPU is built for agents.

First, Vera's instructions per clock (IPC) must be extremely strong because we need to reduce latency, reduce processing time. We want single-threaded performance, not just throughput. Single-threaded performance must be world-class, the best. So Vera's IPC is extremely high, among the highest in the world: 10 instructions fetched, decoded, and executed per clock cycle.

Second, the bandwidth the CPU needs for data in and out must be world-class. This includes both per-core bandwidth and total bandwidth. As I said earlier, agentic systems are inherently decoupled and distributed. When computing is decoupled and distributed, networking becomes the issue. Therefore, we must move data as fast as possible between CPU cores, between CPU and storage, and between CPU and GPU.

Bandwidth around the system and inside the CPU cores must be world-class because CPU cores are communicating with each other at extremely high bandwidth. They are not rented out one core at a time; they all collaborate together. Vera's cross-sectional bandwidth is amazing. It's the first system to support PCI Express Gen 6, also first to feature LPDDR5, with bandwidth reaching 1.2 to 2 TB per second, 2 to 3 times that of the highest-performance CPUs.

This is a CPU built for agents. This market will certainly be larger than the previous one because the number of agents will far exceed humans, and agents are extremely impatient. This is the NVIDIA Vera CPU.

The Most Important Computing Paradigm for the Next Decade

Jensen Huang:

This is truly the most important slide. The core conclusion here is: This is the application pattern for the next decade, and also the computing pattern for the next decade. Agents, agent frameworks, and the large language models coordinated by the framework—every company will run this. Every company will become an agent company; every company will have agents running internally; every company will find that agents need their own operating system.

Every company is asking us: How to run agents securely? How to build agents for our workloads? So, we have the NVIDIA Enterprise AI Agent Toolkit. You've actually seen me building it publicly step by step.

Almost everything NVIDIA does, as you know, if you look back at my GTC speeches 5 or 10 years ago, you'll see I've been talking about these things for years because we've been preparing for this moment.

For enterprises to build agents as a service, or agents for operations, they need four things. First, they need models. Of course, the smarter, cheaper, and faster the large language model, the better. Second, they need a framework to coordinate the entire system. Third, these models want to use tools, and these tools come with skills. I just showed the CUDA-X libraries; they will become powerful tools for agents in the future. Fourth, they need a runtime, an operating system that ties everything together.

This is the NVIDIA Agent Toolkit. It includes modifiable models, namely NVIDIA's world-class open-source models. I want to show more. You can run agents from anywhere; you can run powerful agents like Claude Code, or powerful agents like Codex. You can place them within a framework called Open Shell for highly secure operation within the enterprise.

This Shell protects the agent, keeping it always constrained by security policies. Privacy is protected, permissions and privileges are explicitly assigned, identity is protected. Therefore, Open Shell is being adopted globally. NVIDIA Open Shell is open-source; you'll see many companies adopting it, including Red Hat, Canonical, and Microsoft. It will be adopted everywhere.

This is an important runtime, and this runtime is fully optimized for the ubiquitous NVIDIA AI platform. You can run Open Shell on any cloud, on-premises, even on devices. Now, you have tools and libraries agents can use, models you can modify or use directly, and agent frameworks. These agent frameworks can now run on-premises or anywhere else.

One of my favorite agent use cases is chip designers. This is one of NVIDIA's most important jobs. So, of course, we worked with Cadence to build a chip design super-agent. It's coordinated by Codex or Claude Code, taking RTL, architecture diagrams, schematics, or specifications as input, helping you fix what needs fixing. We've built some super-agents together and optimized Nemotron for the NVIDIA runtime.

NVIDIA is committed to building open models for the world, so you, all of us, can create our own agents. Today, we announce Nemotron 3 Ultra, our next-generation open model, and it's very smart. Nemotron models not only give you the model, but also all the data we used to train the model.

Because we have a strong partner alliance, you can see all the partners listed here. We work together, contribute data to each other. Through these great partnerships, everything—from the model to the training scripts to the data—will be fully opened to you. This is the best form of open model, the world's best open model system policy. The goal is simple: You can take everything, add to it, make it better, and make it your own model.

Nemotron 3 Ultra is 5 times faster, costs 30% less, and is fully open. We are very firm on this. This is Nemotron 3, and we are also developing Nemotron 4. It's this complete toolkit of models, frameworks, tool skills, and runtimes that enables every enterprise globally to create their own agents, just like Cadence with its super-agent.

NVIDIA's New Generation Personal Computers

Jensen Huang:

Microsoft and NVIDIA will reinvent the PC. This will become the new PC. Tomorrow night, our tomorrow night here, I'll be with Satya to talk more about the work we've been advancing together over the past three years. Microsoft and NVIDIA have spent so much time completely rethinking how the PC operates, precisely to prepare for this moment.

As I mentioned earlier, this agent computing paradigm will run on the AI cloud, within enterprises, and on your PC. What happens when a PC has an autonomous agent? It helps you, understands you. You can talk to it; it can see you. You can have it read files, help you with research. It can do even more, which I'll show later.

The new operating system, of course, is the old OS plus a large language model. In many ways, the large language model is the modern version of DirectX. It has input and output, understands prompts, understands computer vision, can generate video, can generate sound. It's a modern intelligent extension of the PC, of the computer.

On top of that, as I said earlier, applications will be replaced by agent runtimes, and the modern application is the agent.

Everyone, the NVIDIA RTX Spark laptop. Thank you. I have too many things in my pockets. Okay, this is the world's most amazing chip. This is the N1X we built in collaboration with MediaTek. I think I just saw Rick. This is the N1X, a beautiful chip. Frankly, it's a chip that took 33 years to build.

The reason is, 100% of the NVIDIA software stack can run here. Want to do digital biology? No problem. Want to do seismic processing? No problem. Want to do astrophysics? No problem. Everything related to CUDA, all physics, all biology, all genomics, all AI, no problem. All computer graphics, no problem.

Every application NVIDIA has ever created, and every application Windows has ever run, Microsoft and NVIDIA have meticulously optimized so that this computer can truly run everything the world has ever created. On top of that, it can now run agents. This is an incredible computer; I'm very proud of it.

This computer can have a local Nemotron 3 Ultra model, or a Nemotron 3 super model; it can also connect to cloud-based Claude Code, Codex, or other models; it can also connect to models on the network. It will work and accomplish amazing things. RTX Spark is a reinvention of the laptop, but in fact, Microsoft and NVIDIA are reinventing the entire PC.

Today, we announce a brand-new product line: three revolutionary Windows machines, covering desktop, notebook, and workstation. They are 100% compatible with Windows, 100% support CUDA, 100% equipped with NVIDIA AI Tensor Cores. Everything you've seen running on various NVIDIA platforms globally can run here.

We have a roadmap for this. This is a brand-new product family. For each generation architecture, we'll have desktop, notebook, workstation; the next generation will still have desktop, notebook, workstation. I'm very happy and honored that 100% of the global PC industry has joined us in reinventing the PC. This is a new product line and a new beginning.

Cosmos 3: The Foundational Model for Physical AI

Jensen Huang:

In the context of language models, the English and various languages we train on from the internet are from a human perspective. They are written by us and read by us. However, to create data for AI robots, it must be from the robot's perception and perspective. The vast majority of video data in the world is from a third-person perspective, not first-person.

Therefore, for agentic systems, robotic systems, and physical AI, data is the hardest problem. You've seen us climb this ladder. We started with teleoperation, essentially human demonstration. This is no different from the human feedback breakthrough in reinforcement learning. Then, we used simulation, which is where Omniverse comes into play. This is also analogous to verifiable rewards in reinforcement learning.

We use these systems to bootstrap AI models, bootstrap physical AI models. Eventually, we can learn from a third-person perspective and reproject it to a first-person perspective. Through this bootstrapping process, we end up with a world foundation model that can understand the physical world from any perspective you want. Third-person, first-person, outside-in, inside-out, all possible. This is indeed a major breakthrough.

Today, we announce Cosmos 3. Cosmos 3 is the forefront of physical AI. We are at the forefront in language models; many are researching them. But in physical AI, we are absolutely the strongest globally. I'm immensely proud of the team for achieving this.

This is your foundational model for all your work. Whether you want to create robots, factory robots, or robots working in factories, as long as it involves the physical world, you now have a partner: Cosmos 3. It can understand and reason, can generate, can simulate in closed loops, and can even become the strategy itself. It leads in various global benchmarks. I'm very proud of Cosmos. Today, we announce Cosmos 3.

It used to be data plus computing equals AI. Now we have AI, and computing will also become data. So, using Cosmos 3, train a large batch of AI models. Cosmos is a very excellent open model system, exactly like Nemotron. We open the model, open the data, even open the training methods, so you can enhance it for yourself and turn Cosmos into your proprietary model.

Alpamayo 2: Autonomous Vehicle Inference

Jensen Huang:

Today, we announce Alpamayo 2, an open model for autonomous vehicles. We are collaborating with global automotive companies. Looking at these brands that have joined NVIDIA Hyperion, are building NVIDIA Hyperion cars, they represent about 80% of global car production. That is, these manufacturers cover around 80% of global cars.

There will be a large number of NVIDIA Hyperion systems in the future, capable of running Alpamayo and any other autonomous driving technology stack. We also connect to mobility services. About 97% of global mobility services are connecting with us. Therefore, when we deploy Alpamayo on the Hyperion runtime and Halos operating system, we can connect to these global services.

Isaac GR00T: Humanoid Robots

Jensen Huang:

NVIDIA Isaac GR00T is our humanoid robot technology stack, containing models, data generation, simulation, runtime, and operating system. It represents the GR00T platform, the Isaac GR00T platform.

As you can see, every one of our systems follows the exact same pattern: whether it's cloud-based agentic systems, agentic systems on PCs, robotic systems for autonomous vehicles, or robotic systems for humanoid robots, it's the same pattern.

Of course, in each case, we build everything completely. We do vertical integration, complete integration, employ co-design and extreme co-design, then open it up so everyone can use any part as they need. You want to use something; we'll even help you modify it.

But there's still one thing missing: robotic systems need a reference platform. These robotic systems are too complex, with many motors and sensors, and very fragile. However, we need a way to deliver these reference platforms. Just like we did for PCs, DGX, cloud, and autonomous vehicles, now we must do the same for robots.

Today, we announce NVIDIA Isaac GR00T, a fully integrated humanoid robot reference platform. It has 25 degrees of freedom per hand, 31 degrees of freedom for the robot body, stands 6 feet tall, weighs 150 lbs. Just like me, except the first number is smaller than mine, the second larger, otherwise similar.

This platform runs the new Thor, along with our complete software stack, data generation stack, data simulation stack, and runtime. Everything is integrated into a single robot platform for everyone to use. We built it for higher education and university researchers because building such a platform themselves is too difficult.

Recap and Summary

Jensen Huang:

Over the past six months, the computer industry has been completely transformed. The reason for the change is that agents have finally been realized and have converged with the latest frontier models, enabling AI to now do truly useful work.

This computing pattern will repeat over and over: an agent composed of models and a framework, using tools with skills, and running on a certain runtime. The runtime depends on whether it's in the cloud, on-premises enterprise environment, PC, or robot. But the computing pattern is exactly the same.

You will use different frameworks based on your preference, and different models based on your preference. You will improve them for your proprietary uses. You will create super-agents, rent them to others, help others accomplish work. This agentic platform, this agentic pattern, is precisely what the NVIDIA Enterprise AI Toolkit aims to support. For you, this is a great way to participate in AI; for us, it's also a huge growth opportunity.

Vera Rubin is in full production. Grace Blackwell was built to handle AI, especially inference; Vera Rubin was built to run agents. It is in full production. It is far more than just a GPU; it's an entire decoupled, distributed agent processing system.

NVIDIA has truly become an infrastructure company. Not just a GPU company, not just a systems company, but an infrastructure company. Our goal is to help you create maximum revenue, maximum profit, and do so as quickly as possible.

In the world of agents, this new way of computing means CPUs must also be built for agents, not for people. CPUs built for agents have their own special requirements. Our NVIDIA Vera is a revolution. I'm happy to see its ramp and order status; it will be the fastest, most successful product launch in NVIDIA's history.

NVIDIA and Microsoft have created a brand-new PC product line. This is a new beginning. Of course, the same agentic processing pattern, agentic computing pattern I just described, will also run on various devices. I mentioned PCs, but in the future, it will appear in robots, satellites, base stations, factories, cloud, on-premises, edge devices. This agentic AI system, agentic computing pattern, will be replicated in all kinds of computers. Our understanding of the personal computer will likely change.

Related Questions

QAccording to Jensen Huang's GTC Taipei 2026 speech, what is the core shift in AI that NVIDIA is focusing on, and what does it signify for the industry?

AThe core shift is from generative AI to agentic AI. Jensen Huang states that 'agentic AI has arrived' and that useful, productive AI is now a reality. This signifies that AI is transitioning from being a content generator to an agent capable of performing actual work, making it a 'profit generator' and a 'GDP generator' for the industry.

QWhat is the Vera Rubin, and how does it represent a strategic evolution for NVIDIA?

AThe Vera Rubin is not just a chip or GPU, but a complete end-to-end system designed specifically to run AI agents. It represents NVIDIA's strategic evolution from a GPU company to a system company, and now into an 'infrastructure company' focused on helping customers build and deploy complete 'AI factories' for generating revenue and profit through agentic AI.

QWhat is unique about the new NVIDIA Vera CPU, and why was it developed?

AThe NVIDIA Vera CPU is unique because it is the first CPU designed specifically for AI agents, not humans. It was developed because agents operate on nanosecond timescales and are 'impatient.' The Vera CPU prioritizes extreme single-thread performance, instructions per clock (IPC), per-core bandwidth, and overall system bandwidth to minimize latency and meet the demands of real-time tool use and memory access by agents.

QWhat are the key components of the 'agentic computing model' that Jensen Huang describes as defining the next decade?

AThe key components of the agentic computing model are: 1) a model (large language model), 2) an agent framework (which coordinates tasks like an operating system), 3) tools with skills (like databases or software libraries), and 4) a runtime. This model will be replicated across various platforms including the cloud, enterprises, PCs, robots, and edge devices.

QWhat new hardware platform did NVIDIA announce in collaboration with Microsoft, and what is its significance?

ANVIDIA announced a new line of personal computers in collaboration with Microsoft, including desktops, notebooks, and workstations. This signifies the reinvention of the PC. These machines will be 100% compatible with Windows and CUDA, and feature NVIDIA AI Tensor Cores to natively support the agentic computing model, allowing AI agents to run locally, understand the user, and perform useful tasks.

Related Reads

a16z: Why Prediction Markets Could Become the Infrastructure for 'Future Probabilities'

The article explores the concept and potential of prediction markets, arguing that they are evolving from niche trading tools into a foundational infrastructure for assessing the probability of future events. A prediction market creates tradable contracts on specific event outcomes, using market price to aggregate dispersed information and approximate a collective probability assessment. This mechanism offers advantages over polls or expert forecasts by providing a real-time, incentivized signal, as participants risk real money on their judgments. Key strengths include the ability to generate probabilistic estimates, built-in financial incentives that encourage genuine information gathering, and the capacity to address specialized questions (e.g., AI model performance, geopolitical events) not easily captured by traditional financial markets. The author emphasizes that a prediction market is essentially a market—a tool for both resource allocation and information aggregation. However, the article also outlines significant challenges for reliability and effectiveness. Success depends on participation from well-informed traders, thoughtful contract design, unambiguous outcome resolution, and robust safeguards against manipulation (e.g., by insiders or groups seeking to influence public perception). Without these, prices may be mere noise or tools for propaganda. The future of prediction markets, therefore, lies not simply in scaling up trading volume, but in building more credible and transparent infrastructure. This includes clear rules for participation, auditable settlement mechanisms, and designs that mitigate manipulation. If these challenges can be addressed, prediction markets could become a vital public utility for navigating uncertainty, providing a new class of probability signals about the future.

marsbit20m ago

a16z: Why Prediction Markets Could Become the Infrastructure for 'Future Probabilities'

marsbit20m ago

Trading

Spot
Futures

Hot Articles

What is SONIC

Sonic: Pioneering the Future of Gaming in Web3 Introduction to Sonic In the ever-evolving landscape of Web3, the gaming industry stands out as one of the most dynamic and promising sectors. At the forefront of this revolution is Sonic, a project designed to amplify the gaming ecosystem on the Solana blockchain. Leveraging cutting-edge technology, Sonic aims to deliver an unparalleled gaming experience by efficiently processing millions of requests per second, ensuring that players enjoy seamless gameplay while maintaining low transaction costs. This article delves into the intricate details of Sonic, exploring its creators, funding sources, operational mechanics, and the timeline of significant events that have shaped its journey. What is Sonic? Sonic is an innovative layer-2 network that operates atop the Solana blockchain, specifically tailored to enhance the existing Solana gaming ecosystem. It accomplishes this through a customised, VM-agnostic game engine paired with a HyperGrid interpreter, facilitating sovereign game economies that roll up back to the Solana platform. The primary goals of Sonic include: Enhanced Gaming Experiences: Sonic is committed to offering lightning-fast on-chain gameplay, allowing players and developers to engage with games at previously unattainable speeds. Atomic Interoperability: This feature enables transactions to be executed within Sonic without the need to redeploy Solana programmes and accounts. This makes the process more efficient and directly benefits from Solana Layer1 services and liquidity. Seamless Deployment: Sonic allows developers to write for Ethereum Virtual Machine (EVM) based systems and execute them on Solana’s SVM infrastructure. This interoperability is crucial for attracting a broader range of dApps and decentralised applications to the platform. Support for Developers: By offering native composable gaming primitives and extensible data types - dining within the Entity-Component-System (ECS) framework - game creators can craft intricate business logic with ease. Overall, Sonic's unique approach not only caters to players but also provides an accessible and low-cost environment for developers to innovate and thrive. Creator of Sonic The information regarding the creator of Sonic is somewhat ambiguous. However, it is known that Sonic's SVM is owned by the company Mirror World. The absence of detailed information about the individuals behind Sonic reflects a common trend in several Web3 projects, where collective efforts and partnerships often overshadow individual contributions. Investors of Sonic Sonic has garnered considerable attention and support from various investors within the crypto and gaming sectors. Notably, the project raised an impressive $12 million during its Series A funding round. The round was led by BITKRAFT Ventures, with other notable investors including Galaxy, Okx Ventures, Interactive, Big Brain Holdings, and Mirana. This financial backing signifies the confidence that investment foundations have in Sonic’s potential to revolutionise the Web3 gaming landscape, further validating its innovative approaches and technologies. How Does Sonic Work? Sonic utilises the HyperGrid framework, a sophisticated parallel processing mechanism that enhances its scalability and customisability. Here are the core features that set Sonic apart: Lightning Speed at Low Costs: Sonic offers one of the fastest on-chain gaming experiences compared to other Layer-1 solutions, powered by the scalability of Solana’s virtual machine (SVM). Atomic Interoperability: Sonic enables transaction execution without redeployment of Solana programmes and accounts, effectively streamlining the interaction between users and the blockchain. EVM Compatibility: Developers can effortlessly migrate decentralised applications from EVM chains to the Solana environment using Sonic’s HyperGrid interpreter, increasing the accessibility and integration of various dApps. Ecosystem Support for Developers: By exposing native composable gaming primitives, Sonic facilitates a sandbox-like environment where developers can experiment and implement business logic, greatly enhancing the overall development experience. Monetisation Infrastructure: Sonic natively supports growth and monetisation efforts, providing frameworks for traffic generation, payments, and settlements, thereby ensuring that gaming projects are not only viable but also sustainable financially. Timeline of Sonic The evolution of Sonic has been marked by several key milestones. Below is a brief timeline highlighting critical events in the project's history: 2022: The Sonic cryptocurrency was officially launched, marking the beginning of its journey in the Web3 gaming arena. 2024: June: Sonic SVM successfully raised $12 million in a Series A funding round. This investment allowed Sonic to further develop its platform and expand its offerings. August: The launch of the Sonic Odyssey testnet provided users with the first opportunity to engage with the platform, offering interactive activities such as collecting rings—a nod to gaming nostalgia. October: SonicX, an innovative crypto game integrated with Solana, made its debut on TikTok, capturing the attention of over 120,000 users within a short span. This integration illustrated Sonic’s commitment to reaching a broader, global audience and showcased the potential of blockchain gaming. Key Points Sonic SVM is a revolutionary layer-2 network on Solana explicitly designed to enhance the GameFi landscape, demonstrating great potential for future development. HyperGrid Framework empowers Sonic by introducing horizontal scaling capabilities, ensuring that the network can handle the demands of Web3 gaming. Integration with Social Platforms: The successful launch of SonicX on TikTok displays Sonic’s strategy to leverage social media platforms to engage users, exponentially increasing the exposure and reach of its projects. Investment Confidence: The substantial funding from BITKRAFT Ventures, among others, emphasizes the robust backing Sonic has, paving the way for its ambitious future. In conclusion, Sonic encapsulates the essence of Web3 gaming innovation, striking a balance between cutting-edge technology, developer-centric tools, and community engagement. As the project continues to evolve, it is poised to redefine the gaming landscape, making it a notable entity for gamers and developers alike. As Sonic moves forward, it will undoubtedly attract greater interest and participation, solidifying its place within the broader narrative of blockchain gaming.

1.6k Total ViewsPublished 2024.04.04Updated 2024.12.03

What is SONIC

What is $S$

Understanding SPERO: A Comprehensive Overview Introduction to SPERO As the landscape of innovation continues to evolve, the emergence of web3 technologies and cryptocurrency projects plays a pivotal role in shaping the digital future. One project that has garnered attention in this dynamic field is SPERO, denoted as SPERO,$$s$. This article aims to gather and present detailed information about SPERO, to help enthusiasts and investors understand its foundations, objectives, and innovations within the web3 and crypto domains. What is SPERO,$$s$? SPERO,$$s$ is a unique project within the crypto space that seeks to leverage the principles of decentralisation and blockchain technology to create an ecosystem that promotes engagement, utility, and financial inclusion. The project is tailored to facilitate peer-to-peer interactions in new ways, providing users with innovative financial solutions and services. At its core, SPERO,$$s$ aims to empower individuals by providing tools and platforms that enhance user experience in the cryptocurrency space. This includes enabling more flexible transaction methods, fostering community-driven initiatives, and creating pathways for financial opportunities through decentralised applications (dApps). The underlying vision of SPERO,$$s$ revolves around inclusiveness, aiming to bridge gaps within traditional finance while harnessing the benefits of blockchain technology. Who is the Creator of SPERO,$$s$? The identity of the creator of SPERO,$$s$ remains somewhat obscure, as there are limited publicly available resources providing detailed background information on its founder(s). This lack of transparency can stem from the project's commitment to decentralisation—an ethos that many web3 projects share, prioritising collective contributions over individual recognition. By centring discussions around the community and its collective goals, SPERO,$$s$ embodies the essence of empowerment without singling out specific individuals. As such, understanding the ethos and mission of SPERO remains more important than identifying a singular creator. Who are the Investors of SPERO,$$s$? SPERO,$$s$ is supported by a diverse array of investors ranging from venture capitalists to angel investors dedicated to fostering innovation in the crypto sector. The focus of these investors generally aligns with SPERO's mission—prioritising projects that promise societal technological advancement, financial inclusivity, and decentralised governance. These investor foundations are typically interested in projects that not only offer innovative products but also contribute positively to the blockchain community and its ecosystems. The backing from these investors reinforces SPERO,$$s$ as a noteworthy contender in the rapidly evolving domain of crypto projects. How Does SPERO,$$s$ Work? SPERO,$$s$ employs a multi-faceted framework that distinguishes it from conventional cryptocurrency projects. Here are some of the key features that underline its uniqueness and innovation: Decentralised Governance: SPERO,$$s$ integrates decentralised governance models, empowering users to participate actively in decision-making processes regarding the project’s future. This approach fosters a sense of ownership and accountability among community members. Token Utility: SPERO,$$s$ utilises its own cryptocurrency token, designed to serve various functions within the ecosystem. These tokens enable transactions, rewards, and the facilitation of services offered on the platform, enhancing overall engagement and utility. Layered Architecture: The technical architecture of SPERO,$$s$ supports modularity and scalability, allowing for seamless integration of additional features and applications as the project evolves. This adaptability is paramount for sustaining relevance in the ever-changing crypto landscape. Community Engagement: The project emphasises community-driven initiatives, employing mechanisms that incentivise collaboration and feedback. By nurturing a strong community, SPERO,$$s$ can better address user needs and adapt to market trends. Focus on Inclusion: By offering low transaction fees and user-friendly interfaces, SPERO,$$s$ aims to attract a diverse user base, including individuals who may not previously have engaged in the crypto space. This commitment to inclusion aligns with its overarching mission of empowerment through accessibility. Timeline of SPERO,$$s$ Understanding a project's history provides crucial insights into its development trajectory and milestones. Below is a suggested timeline mapping significant events in the evolution of SPERO,$$s$: Conceptualisation and Ideation Phase: The initial ideas forming the basis of SPERO,$$s$ were conceived, aligning closely with the principles of decentralisation and community focus within the blockchain industry. Launch of Project Whitepaper: Following the conceptual phase, a comprehensive whitepaper detailing the vision, goals, and technological infrastructure of SPERO,$$s$ was released to garner community interest and feedback. Community Building and Early Engagements: Active outreach efforts were made to build a community of early adopters and potential investors, facilitating discussions around the project’s goals and garnering support. Token Generation Event: SPERO,$$s$ conducted a token generation event (TGE) to distribute its native tokens to early supporters and establish initial liquidity within the ecosystem. Launch of Initial dApp: The first decentralised application (dApp) associated with SPERO,$$s$ went live, allowing users to engage with the platform's core functionalities. Ongoing Development and Partnerships: Continuous updates and enhancements to the project's offerings, including strategic partnerships with other players in the blockchain space, have shaped SPERO,$$s$ into a competitive and evolving player in the crypto market. Conclusion SPERO,$$s$ stands as a testament to the potential of web3 and cryptocurrency to revolutionise financial systems and empower individuals. With a commitment to decentralised governance, community engagement, and innovatively designed functionalities, it paves the way toward a more inclusive financial landscape. As with any investment in the rapidly evolving crypto space, potential investors and users are encouraged to research thoroughly and engage thoughtfully with the ongoing developments within SPERO,$$s$. The project showcases the innovative spirit of the crypto industry, inviting further exploration into its myriad possibilities. While the journey of SPERO,$$s$ is still unfolding, its foundational principles may indeed influence the future of how we interact with technology, finance, and each other in interconnected digital ecosystems.

54 Total ViewsPublished 2024.12.17Updated 2024.12.17

What is $S$

What is AGENT S

Agent S: The Future of Autonomous Interaction in Web3 Introduction In the ever-evolving landscape of Web3 and cryptocurrency, innovations are constantly redefining how individuals interact with digital platforms. One such pioneering project, Agent S, promises to revolutionise human-computer interaction through its open agentic framework. By paving the way for autonomous interactions, Agent S aims to simplify complex tasks, offering transformative applications in artificial intelligence (AI). This detailed exploration will delve into the project's intricacies, its unique features, and the implications for the cryptocurrency domain. What is Agent S? Agent S stands as a groundbreaking open agentic framework, specifically designed to tackle three fundamental challenges in the automation of computer tasks: Acquiring Domain-Specific Knowledge: The framework intelligently learns from various external knowledge sources and internal experiences. This dual approach empowers it to build a rich repository of domain-specific knowledge, enhancing its performance in task execution. Planning Over Long Task Horizons: Agent S employs experience-augmented hierarchical planning, a strategic approach that facilitates efficient breakdown and execution of intricate tasks. This feature significantly enhances its ability to manage multiple subtasks efficiently and effectively. Handling Dynamic, Non-Uniform Interfaces: The project introduces the Agent-Computer Interface (ACI), an innovative solution that enhances the interaction between agents and users. Utilizing Multimodal Large Language Models (MLLMs), Agent S can navigate and manipulate diverse graphical user interfaces seamlessly. Through these pioneering features, Agent S provides a robust framework that addresses the complexities involved in automating human interaction with machines, setting the stage for myriad applications in AI and beyond. Who is the Creator of Agent S? While the concept of Agent S is fundamentally innovative, specific information about its creator remains elusive. The creator is currently unknown, which highlights either the nascent stage of the project or the strategic choice to keep founding members under wraps. Regardless of anonymity, the focus remains on the framework's capabilities and potential. Who are the Investors of Agent S? As Agent S is relatively new in the cryptographic ecosystem, detailed information regarding its investors and financial backers is not explicitly documented. The lack of publicly available insights into the investment foundations or organisations supporting the project raises questions about its funding structure and development roadmap. Understanding the backing is crucial for gauging the project's sustainability and potential market impact. How Does Agent S Work? At the core of Agent S lies cutting-edge technology that enables it to function effectively in diverse settings. Its operational model is built around several key features: Human-like Computer Interaction: The framework offers advanced AI planning, striving to make interactions with computers more intuitive. By mimicking human behaviour in tasks execution, it promises to elevate user experiences. Narrative Memory: Employed to leverage high-level experiences, Agent S utilises narrative memory to keep track of task histories, thereby enhancing its decision-making processes. Episodic Memory: This feature provides users with step-by-step guidance, allowing the framework to offer contextual support as tasks unfold. Support for OpenACI: With the ability to run locally, Agent S allows users to maintain control over their interactions and workflows, aligning with the decentralised ethos of Web3. Easy Integration with External APIs: Its versatility and compatibility with various AI platforms ensure that Agent S can fit seamlessly into existing technological ecosystems, making it an appealing choice for developers and organisations. These functionalities collectively contribute to Agent S's unique position within the crypto space, as it automates complex, multi-step tasks with minimal human intervention. As the project evolves, its potential applications in Web3 could redefine how digital interactions unfold. Timeline of Agent S The development and milestones of Agent S can be encapsulated in a timeline that highlights its significant events: September 27, 2024: The concept of Agent S was launched in a comprehensive research paper titled “An Open Agentic Framework that Uses Computers Like a Human,” showcasing the groundwork for the project. October 10, 2024: The research paper was made publicly available on arXiv, offering an in-depth exploration of the framework and its performance evaluation based on the OSWorld benchmark. October 12, 2024: A video presentation was released, providing a visual insight into the capabilities and features of Agent S, further engaging potential users and investors. These markers in the timeline not only illustrate the progress of Agent S but also indicate its commitment to transparency and community engagement. Key Points About Agent S As the Agent S framework continues to evolve, several key attributes stand out, underscoring its innovative nature and potential: Innovative Framework: Designed to provide an intuitive use of computers akin to human interaction, Agent S brings a novel approach to task automation. Autonomous Interaction: The ability to interact autonomously with computers through GUI signifies a leap towards more intelligent and efficient computing solutions. Complex Task Automation: With its robust methodology, it can automate complex, multi-step tasks, making processes faster and less error-prone. Continuous Improvement: The learning mechanisms enable Agent S to improve from past experiences, continually enhancing its performance and efficacy. Versatility: Its adaptability across different operating environments like OSWorld and WindowsAgentArena ensures that it can serve a broad range of applications. As Agent S positions itself in the Web3 and crypto landscape, its potential to enhance interaction capabilities and automate processes signifies a significant advancement in AI technologies. Through its innovative framework, Agent S exemplifies the future of digital interactions, promising a more seamless and efficient experience for users across various industries. Conclusion Agent S represents a bold leap forward in the marriage of AI and Web3, with the capacity to redefine how we interact with technology. While still in its early stages, the possibilities for its application are vast and compelling. Through its comprehensive framework addressing critical challenges, Agent S aims to bring autonomous interactions to the forefront of the digital experience. As we move deeper into the realms of cryptocurrency and decentralisation, projects like Agent S will undoubtedly play a crucial role in shaping the future of technology and human-computer collaboration.

700 Total ViewsPublished 2025.01.14Updated 2025.01.14

What is AGENT S

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of S (S) are presented below.

活动图片