Jensen Huang's 2026 GTC Taipei Speech: The Era of AI Agents is Here, Computing is Revenue

marsbitPublicado em 2026-06-03Última atualização em 2026-06-03

Resumo

NVIDIA CEO Jensen Huang's 2026 GTC Taipei speech announces the arrival of the "Agent AI" era, where AI transitions from content generation to performing useful work. Huang positions tokens as units of profit and GDP, driving massive demand for computing power and "AI factories." NVIDIA's strategy revolves around a new computing paradigm centered on AI agents, which combine large language models (LLMs) with agent frameworks for planning, memory, and tool use. Key announcements include: * **Vera Rubin:** A complete, end-to-end system (not just a GPU) designed from the ground up to run AI agents at scale, representing NVIDIA's evolution into an infrastructure company. * **Vera CPU:** A revolutionary CPU architecture built specifically for impatient AI agents, prioritizing low latency, single-thread performance, and massive bandwidth over traditional multi-core throughput. * **Enterprise AI Agent Toolkit:** A suite including open models (like Nemotron 3 Ultra), frameworks, tools, and a secure runtime (Open Shell) to enable every company to build and deploy its own AI agents. * **Next-Gen PCs with Microsoft:** A new line of Windows desktops, laptops, and workstations co-developed with Microsoft, featuring the N1X chip and designed to run local AI agents, redefining the personal computer. * **Physical AI Foundation Models:** Introduction of Cosmos 3 for robotics and physical AI, Alpamayo 2 for autonomous driving, and the Isaac GR00T platform—a fully integrated humanoid r...

Organized & Compiled: Deep Tide TechFlow

Guest: Jensen Huang, CEO of NVIDIA

Podcast Source: Bonnie Blockchain

Original Title: 7 Core Points from Jensen Huang's 2026 GTC Taipei Speech, NVIDIA's Latest Strategy Cheat Sheet! 【Bonnie Blockchain】

Broadcast Date: June 2, 2026

Summary of Key Points

In his 2026 GTC Taipei speech, Jensen Huang focused NVIDIA's next-phase strategy on one core judgment: AI has transitioned from generating content into the era of functional agents. Tokens are no longer just technical metrics but units of production for revenue, profit, and GDP. Centered around this shift, NVIDIA introduced Vera Rubin, Vera CPU, an enterprise-grade agent toolkit, new-generation PCs in collaboration with Microsoft, and Cosmos 3, Alpamayo 2, and Isaac GR00T for physical AI. Huang emphasized that the computing paradigm for the next decade will be composed of models, agent frameworks, tool skills, and runtimes, diffusing from the cloud, enterprises, and local PCs to robots, factories, satellites, and edge devices. For Taiwan's supply chain, this means AI factories, power efficiency, infrastructure delivery speed, and full-stack synergy will become the keys to the next wave of industry growth.

Excerpts of Highlights

The Arrival of the AI Agent Era

  • "Useful AI has arrived; AI is now a profit generator and a GDP generator. Behind it is not just large language models, but a brand-new computing paradigm: agents."
  • "Agents are composed of large language models and an agent framework. The framework connects memory, tools, reasoning, planning, and action like an operating system."
  • "The breakthrough in agent systems comes from large language models now being capable of thinking, reasoning, planning, and using tools, as well as from agent frameworks capable of managing memory, coordinating workflows, and dispatching tools."
  • "Every company will become an agent company; every company will run agents internally, and every company will need its own agent operating system."

Tokens, AI Factories, and Infrastructure Economics

  • "Tokens are now profitable units of revenue. AI companies wanting to produce more tokens will build more AI factories, which is precisely why Taiwan's computing demand is exploding."
  • "Computing is revenue, computing is profit. Without revenue and profit, it's a loss."
  • "If an AI factory has only 1 gigawatt of power, that 1 GW is the limit; under this constraint, throughput per watt is revenue, because every token has value."
  • "Choosing the wrong architecture just because the chip is cheaper doesn't translate into real returns; you need to ensure revenue per watt. The more you buy, the more you earn."

Vera Rubin and NVIDIA's Infrastructure Transformation

  • "Vera Rubin is not a chip, nor just a GPU; it's a complete system built end-to-end."
  • "NVIDIA was a GPU company, then became a systems company, and is now further evolving into an infrastructure company, helping customers build AI factories."
  • "Vera Rubin is NVIDIA's most ambitious engineering project in history. All 40,000 engineers in the company are involved, and Taiwan's supply chain also participated in creating this system."
  • "Grace Blackwell was built to handle AI, especially inference; Vera Rubin is built to run agents."

Vera CPU and the Computational Needs of Agents

  • "All CPUs until now were built for humans; this CPU is built for agents."
  • "Agents have no patience. They live in a world measured not in seconds, but in nanoseconds. When an agent uses a tool, it wants the response as fast as possible; when it accesses a database, it wants results returned instantly."
  • "Vera CPU is a CPU built for agents, emphasizing single-threaded performance, instructions per clock, bandwidth per core, and total system bandwidth."
  • "This market will certainly be larger than the previous one because the number of agents will far exceed humans, and agents are extremely impatient. This is the NVIDIA Vera CPU."

Next-Generation Personal Computers

  • "The future agent computing paradigm will run on the AI cloud, within enterprises, and on your PC."
  • "The new operating system will be the traditional OS plus a large language model; in many ways, the large language model is the modern equivalent of DirectX, an intelligent extension of the computer."
  • "Applications will be replaced by agent runtimes; the modern application will become an agent."
  • "NVIDIA and Microsoft are reinventing the PC, launching a new generation of Windows machines covering desktops, notebooks, and workstations."

Physical AI, Autonomous Driving, and Robotics

  • "Language models are trained on data from a human perspective, but robots need to understand the world from the robot's own perspective. The biggest problem for physical AI is data."
  • "Cosmos 3 is a foundational model at the forefront of physical AI, capable of understanding, reasoning, generating, simulating in closed loops, and even becoming the strategy itself."
  • "With AI, computing itself will also become data; Cosmos 3 can be used to train more AI models and be enhanced into your own proprietary model."
  • "Whether it's cloud agents, PC agents, autonomous driving systems, or humanoid robots, the underlying computing pattern is the same: model, framework, tool skills, and runtime."

Jensen Huang Names Taiwanese Snacks as Part of AI Supply Chain

Jensen Huang:

The scale of the Taiwan ecosystem's development today is truly incredible. When most people talk about ecosystems, they first think of our software stack, the developer ecosystem built on top of NVIDIA computing systems. But NVIDIA's ecosystem goes beyond that; it extends all the way up to the Taiwan supply chain, where everything begins, and all the way down to the data center, ultimately reaching end users.

Today, we'll discuss almost every part of this ecosystem. There are so many people to thank. I love the ecosystem here; there are many companies, and many of my favorite ecosystem partners. Taiwan has an incredibly rich ecosystem; it's the best supply chain ecosystem in the world.

The AI Agent Era Has Arrived

Jensen Huang:

Two years ago when I came here, I started talking about how AI would move from generative AI to the next wave, which is agentic AI. Today we can say that agentic AI has arrived, useful AI has arrived.

From an industry perspective, this means demand for tokens is becoming extremely strong. Because if AI can actually do things, people will want to produce more of this capability. Tokens are now profitable units, revenue-generating units. Since it can make money, AI companies will want to build more tokens, generate more tokens, construct more AI factories, which is also the reason for the explosive growth in computing demand in Taiwan.

This is exactly why you're all so busy and your business performance is so good. In fact, it seems reflected in the stock prices of some of your companies. The computing paradigm has changed; everything has changed.

The first key point: Useful AI has arrived; AI is now a profit generator and a GDP generator. Behind it is a brand-new computing paradigm. It's not just large language models, but agents. Almost everything we discuss today will be built on this foundation.

Let me take a moment to explain what I mean. Inside is an agent, an agent application. In the past, this would have been an application, code, an operating system—code within the application running on top of the OS. Today, it's an agent, composed of one or more large language models placed within an agent framework. This framework helps coordinate its work, enabling it to truly accomplish productive tasks.

When input enters the system, the agent must understand, observe, reason, act, and use tools. Tools can be spreadsheets, web browsers, data processing engines, or database engines. Every flow of information, whether processing context, understanding what's happening, reasoning what to do next, or forming an actionable plan, needs to be coordinated by some software.

So, the essence of an agent is such a system. It handles short-term memory, or working memory, and also long-term memory, just like humans. The memory management system thus becomes extremely important. The entire system is called the agent. The large language model is responsible for thinking, and the agent framework connects everything, like an operating system.

This is the new computing paradigm and the reason agents can accomplish amazing tasks. This is a major breakthrough: Large language models are now good at thinking, reasoning, planning, using tools; at the same time, we also have agent frameworks capable of managing memory, coordinating workflows, and invoking tools. Therefore, we can now do many things we couldn't before.

What are Tokens in AI Factories?

Jensen Huang:

Tokens, DSX, GPU, CPU, Vera... We've already built the next-generation system Vera Rubin. Vera Rubin is not a chip, nor just a GPU. It starts with the GPU but goes far beyond it. The entire end-to-end system is Vera Rubin.

It includes the GPU, Vera Rubin NVLink 72, coordinated by the Vera CPU which I'll introduce later. It also includes the revolutionary Vera storage system, CX9, our software stack DOCA, and built-in security processors. All data in the system, whether at rest, in transit, or in use, is encrypted. The entire system is secure because AI models are extremely valuable. This is why the whole system follows confidential computing principles.

Any one of these systems alone could be a full revolution. Vera Rubin is NVIDIA's most ambitious engineering project in history. All 40,000 engineers in the company participated in the work on Vera Rubin, not to mention those of you present who also participated in creating the entire system. Vera Rubin is truly a marvel; it's not just a chip, but a system composed of many components.

It goes even further. Long ago, NVIDIA was a GPU company; over the years, we've evolved into a systems company. What you see now is the most complex system we've ever designed from scratch. But ultimately, our customers and partners don't want to buy computers; they want to build AI factories.

This is why NVIDIA is beginning to transform again. As you can see, many of our technologies have expanded to the full infrastructure scale. Our partners are also at the infrastructure scale: power plants, cooling systems, grid suppliers, and many industrial companies are now part of our ecosystem. In the end, we need to build the full technology stack, just like we built GPUs, Grace Blackwell, NVLink 72; now, we need to build full-stack systems enabling customers to build outstanding AI infrastructure.

Doing this well, helping customers build and deploy AI factories, is extremely important. The reason is simple: Computing is revenue, computing is profit. Without revenue and profit, it's a loss.

Everyone needs to understand one thing: When an AI infrastructure goes online, it can go live quickly, or it can drag on; throughput can be high or low; elasticity and reliability can be good or bad; effective service life can be long or short. Because this represents investments of 50, 60, or even 100 billion dollars, this curve is extremely important.

This is also why NVIDIA is a great partner. We have full integration capabilities, not just making a presentation slide, but actually creating the entire infrastructure, connecting everything, and building at scale ourselves to ensure the system runs well. Therefore, our first token time, first inference time, training startup time are all faster.

Second, our throughput per watt, tokens per watt are world-class. The reason is we integrate everything, design everything from scratch, simulate the entire system, and employ extreme co-design. Just like the Vera Rubin rack shown earlier, everything is designed for incredible throughput.

If your data center, your factory has 1 gigawatt of power, it won't get any more; that's all the generation capacity you get. Under 1 GW of power, throughput per watt is revenue, because every token generates profit, every token is revenue.

This is the future. Computing is revenue; performance per watt is your revenue. Choosing the wrong architecture just because the chip is cheaper doesn't translate into real returns; you need to ensure revenue per watt. The more you buy, the more you earn.

Standing before you now, I can tell you: Vera Rubin is in full production. The supply chain scale we've built for Vera Rubin is twice that of Grace Blackwell. Where assembling a Grace Blackwell rack used to take two hours, now it takes only five minutes. So not only is capacity higher, but production throughput is much faster, and we need all of this to meet demand.

This ecosystem is extraordinary. To support Grace Blackwell and prepare for Vera Rubin's ramp, millions of square feet of capacity have come online. I want to thank you all. Vera Rubin is in full production. Thank you.

Vera Rubin System Introduction

Jensen Huang:

Vera Rubin wasn't built just for AI. Vera Rubin wasn't built just to run AI; it was built to run agents. It's an agentic system. Imagine the complexity. And precisely because of this, agents are the final computer science breakthrough. It took so many years to finally realize their potential and become useful. The computer that can run them should also be the world's most advanced.

This is Vera Rubin. Let's take a look. Please bring Vera Rubin up.

This is Vera Rubin, Vera Rubin NVLink 72. This is part of the next-generation system; at the next GTC, I'll talk more about it; we have a lot to cover today. This is the Vera CPU rack, 256 CPUs, all liquid-cooled. I'll introduce Vera later. This is the Vera BlueField storage processing system, also the security system. And of course, our Mellanox networking, the world's first CPO. This is Vera Rubin, an amazing combination of technologies.

When we built Hopper, it was for pre-training. Pre-training was the most important application then, the most important workload we faced. When building Grace Blackwell, people said: "Jensen, NVIDIA is great at pre-training; inference is simple." Remember? Many said: "Inference is simple; we can do it too."

But you know, inference equals money. Models are very complex; achieving excellence simultaneously in high response speed, fast interaction, and high throughput is very difficult. This is why we created NVLink 72.

Today, NVIDIA's token cost is the lowest in the world. Not just 10% lower, but multiples lower, even orders of magnitude. All because we did extreme co-design, because we understood the computational model and pattern of inference, and created NVLink 72.

With Vera Rubin, things have gone beyond inference. Now it's inference within agentic systems. This is Vera Rubin. No cables, no hoses, no fans. Last time I showed it to you, cables were everywhere.

VERA CPU: The CPU for AI Agents

Jensen Huang:

Vera CPU is a CPU built for the AI era. So far, all CPUs have been built for people. We were users, we were tenants. The way humans use CPUs is living in a world measured in seconds. We rent CPU resources in the cloud; more CPU cores mean more resources to rent. The usage scenarios and economics of old CPUs are completely different from those of agents.

Agents have no patience. They live in a world measured not in seconds, but in nanoseconds. When an agent uses a tool, it wants the response as fast as possible; when it accesses a database, it wants results returned instantly. Every moment an agent waits, it's prevented from moving to the next step, and the next, and the next. Therefore, we must make the CPU as low-latency and interactive as possible.

This is why we created Vera CPU for the AI era. In our system, it has three uses. The first, of course, is for thinking within Vera Rubin. In the Vera Rubin rack, there are already two CPUs. You know, we are manufacturing and selling millions of Vera Rubins, and have already sold millions of Grace Blackwells. NVIDIA is already one of the world's largest CPU manufacturers.

The two CPUs in the Vera Rubin rack: one coordinates and manages the GPUs, manages the KV cache, and handles various software running in the rack. We also have Grace BlueField for security and isolation. The Vera compute portion is for the agent framework, responsible for coordinating AI models, tool usage, and database access.

The data server here is Vera BlueField, the world's fastest storage server and storage system. It's crucial because agents access memory at extremely high speeds. Storage servers and CPUs are now on the critical path of the most expensive part of the data center.

There's a good reason why this is the most expensive. The core economics of an AI factory are tokens, and tokens are created here. So, you naturally want to produce and generate as many tokens as possible. Economic value is concentrated here, and the CPU and storage system must not become bottlenecks.

Therefore, Vera CPU puts a lot of pressure on CPU architecture, which is also why we built a completely new architecture from scratch. This is a CPU the world has never seen before; we call it Vera. This is a CPU built for agents. All CPUs until now were built for humans; this CPU is built for agents.

First, Vera's instructions per clock (IPC) must be extremely strong because we need to reduce latency, reduce processing time. We want single-threaded performance, not just throughput. Single-threaded performance must be world-class, the best. So Vera's IPC is extremely high, among the highest in the world: 10 instructions fetched, decoded, and executed per clock cycle.

Second, the bandwidth the CPU needs for data in and out must be world-class. This includes both per-core bandwidth and total bandwidth. As I said earlier, agentic systems are inherently decoupled and distributed. When computing is decoupled and distributed, networking becomes the issue. Therefore, we must move data as fast as possible between CPU cores, between CPU and storage, and between CPU and GPU.

Bandwidth around the system and inside the CPU cores must be world-class because CPU cores are communicating with each other at extremely high bandwidth. They are not rented out one core at a time; they all collaborate together. Vera's cross-sectional bandwidth is amazing. It's the first system to support PCI Express Gen 6, also first to feature LPDDR5, with bandwidth reaching 1.2 to 2 TB per second, 2 to 3 times that of the highest-performance CPUs.

This is a CPU built for agents. This market will certainly be larger than the previous one because the number of agents will far exceed humans, and agents are extremely impatient. This is the NVIDIA Vera CPU.

The Most Important Computing Paradigm for the Next Decade

Jensen Huang:

This is truly the most important slide. The core conclusion here is: This is the application pattern for the next decade, and also the computing pattern for the next decade. Agents, agent frameworks, and the large language models coordinated by the framework—every company will run this. Every company will become an agent company; every company will have agents running internally; every company will find that agents need their own operating system.

Every company is asking us: How to run agents securely? How to build agents for our workloads? So, we have the NVIDIA Enterprise AI Agent Toolkit. You've actually seen me building it publicly step by step.

Almost everything NVIDIA does, as you know, if you look back at my GTC speeches 5 or 10 years ago, you'll see I've been talking about these things for years because we've been preparing for this moment.

For enterprises to build agents as a service, or agents for operations, they need four things. First, they need models. Of course, the smarter, cheaper, and faster the large language model, the better. Second, they need a framework to coordinate the entire system. Third, these models want to use tools, and these tools come with skills. I just showed the CUDA-X libraries; they will become powerful tools for agents in the future. Fourth, they need a runtime, an operating system that ties everything together.

This is the NVIDIA Agent Toolkit. It includes modifiable models, namely NVIDIA's world-class open-source models. I want to show more. You can run agents from anywhere; you can run powerful agents like Claude Code, or powerful agents like Codex. You can place them within a framework called Open Shell for highly secure operation within the enterprise.

This Shell protects the agent, keeping it always constrained by security policies. Privacy is protected, permissions and privileges are explicitly assigned, identity is protected. Therefore, Open Shell is being adopted globally. NVIDIA Open Shell is open-source; you'll see many companies adopting it, including Red Hat, Canonical, and Microsoft. It will be adopted everywhere.

This is an important runtime, and this runtime is fully optimized for the ubiquitous NVIDIA AI platform. You can run Open Shell on any cloud, on-premises, even on devices. Now, you have tools and libraries agents can use, models you can modify or use directly, and agent frameworks. These agent frameworks can now run on-premises or anywhere else.

One of my favorite agent use cases is chip designers. This is one of NVIDIA's most important jobs. So, of course, we worked with Cadence to build a chip design super-agent. It's coordinated by Codex or Claude Code, taking RTL, architecture diagrams, schematics, or specifications as input, helping you fix what needs fixing. We've built some super-agents together and optimized Nemotron for the NVIDIA runtime.

NVIDIA is committed to building open models for the world, so you, all of us, can create our own agents. Today, we announce Nemotron 3 Ultra, our next-generation open model, and it's very smart. Nemotron models not only give you the model, but also all the data we used to train the model.

Because we have a strong partner alliance, you can see all the partners listed here. We work together, contribute data to each other. Through these great partnerships, everything—from the model to the training scripts to the data—will be fully opened to you. This is the best form of open model, the world's best open model system policy. The goal is simple: You can take everything, add to it, make it better, and make it your own model.

Nemotron 3 Ultra is 5 times faster, costs 30% less, and is fully open. We are very firm on this. This is Nemotron 3, and we are also developing Nemotron 4. It's this complete toolkit of models, frameworks, tool skills, and runtimes that enables every enterprise globally to create their own agents, just like Cadence with its super-agent.

NVIDIA's New Generation Personal Computers

Jensen Huang:

Microsoft and NVIDIA will reinvent the PC. This will become the new PC. Tomorrow night, our tomorrow night here, I'll be with Satya to talk more about the work we've been advancing together over the past three years. Microsoft and NVIDIA have spent so much time completely rethinking how the PC operates, precisely to prepare for this moment.

As I mentioned earlier, this agent computing paradigm will run on the AI cloud, within enterprises, and on your PC. What happens when a PC has an autonomous agent? It helps you, understands you. You can talk to it; it can see you. You can have it read files, help you with research. It can do even more, which I'll show later.

The new operating system, of course, is the old OS plus a large language model. In many ways, the large language model is the modern version of DirectX. It has input and output, understands prompts, understands computer vision, can generate video, can generate sound. It's a modern intelligent extension of the PC, of the computer.

On top of that, as I said earlier, applications will be replaced by agent runtimes, and the modern application is the agent.

Everyone, the NVIDIA RTX Spark laptop. Thank you. I have too many things in my pockets. Okay, this is the world's most amazing chip. This is the N1X we built in collaboration with MediaTek. I think I just saw Rick. This is the N1X, a beautiful chip. Frankly, it's a chip that took 33 years to build.

The reason is, 100% of the NVIDIA software stack can run here. Want to do digital biology? No problem. Want to do seismic processing? No problem. Want to do astrophysics? No problem. Everything related to CUDA, all physics, all biology, all genomics, all AI, no problem. All computer graphics, no problem.

Every application NVIDIA has ever created, and every application Windows has ever run, Microsoft and NVIDIA have meticulously optimized so that this computer can truly run everything the world has ever created. On top of that, it can now run agents. This is an incredible computer; I'm very proud of it.

This computer can have a local Nemotron 3 Ultra model, or a Nemotron 3 super model; it can also connect to cloud-based Claude Code, Codex, or other models; it can also connect to models on the network. It will work and accomplish amazing things. RTX Spark is a reinvention of the laptop, but in fact, Microsoft and NVIDIA are reinventing the entire PC.

Today, we announce a brand-new product line: three revolutionary Windows machines, covering desktop, notebook, and workstation. They are 100% compatible with Windows, 100% support CUDA, 100% equipped with NVIDIA AI Tensor Cores. Everything you've seen running on various NVIDIA platforms globally can run here.

We have a roadmap for this. This is a brand-new product family. For each generation architecture, we'll have desktop, notebook, workstation; the next generation will still have desktop, notebook, workstation. I'm very happy and honored that 100% of the global PC industry has joined us in reinventing the PC. This is a new product line and a new beginning.

Cosmos 3: The Foundational Model for Physical AI

Jensen Huang:

In the context of language models, the English and various languages we train on from the internet are from a human perspective. They are written by us and read by us. However, to create data for AI robots, it must be from the robot's perception and perspective. The vast majority of video data in the world is from a third-person perspective, not first-person.

Therefore, for agentic systems, robotic systems, and physical AI, data is the hardest problem. You've seen us climb this ladder. We started with teleoperation, essentially human demonstration. This is no different from the human feedback breakthrough in reinforcement learning. Then, we used simulation, which is where Omniverse comes into play. This is also analogous to verifiable rewards in reinforcement learning.

We use these systems to bootstrap AI models, bootstrap physical AI models. Eventually, we can learn from a third-person perspective and reproject it to a first-person perspective. Through this bootstrapping process, we end up with a world foundation model that can understand the physical world from any perspective you want. Third-person, first-person, outside-in, inside-out, all possible. This is indeed a major breakthrough.

Today, we announce Cosmos 3. Cosmos 3 is the forefront of physical AI. We are at the forefront in language models; many are researching them. But in physical AI, we are absolutely the strongest globally. I'm immensely proud of the team for achieving this.

This is your foundational model for all your work. Whether you want to create robots, factory robots, or robots working in factories, as long as it involves the physical world, you now have a partner: Cosmos 3. It can understand and reason, can generate, can simulate in closed loops, and can even become the strategy itself. It leads in various global benchmarks. I'm very proud of Cosmos. Today, we announce Cosmos 3.

It used to be data plus computing equals AI. Now we have AI, and computing will also become data. So, using Cosmos 3, train a large batch of AI models. Cosmos is a very excellent open model system, exactly like Nemotron. We open the model, open the data, even open the training methods, so you can enhance it for yourself and turn Cosmos into your proprietary model.

Alpamayo 2: Autonomous Vehicle Inference

Jensen Huang:

Today, we announce Alpamayo 2, an open model for autonomous vehicles. We are collaborating with global automotive companies. Looking at these brands that have joined NVIDIA Hyperion, are building NVIDIA Hyperion cars, they represent about 80% of global car production. That is, these manufacturers cover around 80% of global cars.

There will be a large number of NVIDIA Hyperion systems in the future, capable of running Alpamayo and any other autonomous driving technology stack. We also connect to mobility services. About 97% of global mobility services are connecting with us. Therefore, when we deploy Alpamayo on the Hyperion runtime and Halos operating system, we can connect to these global services.

Isaac GR00T: Humanoid Robots

Jensen Huang:

NVIDIA Isaac GR00T is our humanoid robot technology stack, containing models, data generation, simulation, runtime, and operating system. It represents the GR00T platform, the Isaac GR00T platform.

As you can see, every one of our systems follows the exact same pattern: whether it's cloud-based agentic systems, agentic systems on PCs, robotic systems for autonomous vehicles, or robotic systems for humanoid robots, it's the same pattern.

Of course, in each case, we build everything completely. We do vertical integration, complete integration, employ co-design and extreme co-design, then open it up so everyone can use any part as they need. You want to use something; we'll even help you modify it.

But there's still one thing missing: robotic systems need a reference platform. These robotic systems are too complex, with many motors and sensors, and very fragile. However, we need a way to deliver these reference platforms. Just like we did for PCs, DGX, cloud, and autonomous vehicles, now we must do the same for robots.

Today, we announce NVIDIA Isaac GR00T, a fully integrated humanoid robot reference platform. It has 25 degrees of freedom per hand, 31 degrees of freedom for the robot body, stands 6 feet tall, weighs 150 lbs. Just like me, except the first number is smaller than mine, the second larger, otherwise similar.

This platform runs the new Thor, along with our complete software stack, data generation stack, data simulation stack, and runtime. Everything is integrated into a single robot platform for everyone to use. We built it for higher education and university researchers because building such a platform themselves is too difficult.

Recap and Summary

Jensen Huang:

Over the past six months, the computer industry has been completely transformed. The reason for the change is that agents have finally been realized and have converged with the latest frontier models, enabling AI to now do truly useful work.

This computing pattern will repeat over and over: an agent composed of models and a framework, using tools with skills, and running on a certain runtime. The runtime depends on whether it's in the cloud, on-premises enterprise environment, PC, or robot. But the computing pattern is exactly the same.

You will use different frameworks based on your preference, and different models based on your preference. You will improve them for your proprietary uses. You will create super-agents, rent them to others, help others accomplish work. This agentic platform, this agentic pattern, is precisely what the NVIDIA Enterprise AI Toolkit aims to support. For you, this is a great way to participate in AI; for us, it's also a huge growth opportunity.

Vera Rubin is in full production. Grace Blackwell was built to handle AI, especially inference; Vera Rubin was built to run agents. It is in full production. It is far more than just a GPU; it's an entire decoupled, distributed agent processing system.

NVIDIA has truly become an infrastructure company. Not just a GPU company, not just a systems company, but an infrastructure company. Our goal is to help you create maximum revenue, maximum profit, and do so as quickly as possible.

In the world of agents, this new way of computing means CPUs must also be built for agents, not for people. CPUs built for agents have their own special requirements. Our NVIDIA Vera is a revolution. I'm happy to see its ramp and order status; it will be the fastest, most successful product launch in NVIDIA's history.

NVIDIA and Microsoft have created a brand-new PC product line. This is a new beginning. Of course, the same agentic processing pattern, agentic computing pattern I just described, will also run on various devices. I mentioned PCs, but in the future, it will appear in robots, satellites, base stations, factories, cloud, on-premises, edge devices. This agentic AI system, agentic computing pattern, will be replicated in all kinds of computers. Our understanding of the personal computer will likely change.

Perguntas relacionadas

QAccording to Jensen Huang's GTC Taipei 2026 speech, what is the core shift in AI that NVIDIA is focusing on, and what does it signify for the industry?

AThe core shift is from generative AI to agentic AI. Jensen Huang states that 'agentic AI has arrived' and that useful, productive AI is now a reality. This signifies that AI is transitioning from being a content generator to an agent capable of performing actual work, making it a 'profit generator' and a 'GDP generator' for the industry.

QWhat is the Vera Rubin, and how does it represent a strategic evolution for NVIDIA?

AThe Vera Rubin is not just a chip or GPU, but a complete end-to-end system designed specifically to run AI agents. It represents NVIDIA's strategic evolution from a GPU company to a system company, and now into an 'infrastructure company' focused on helping customers build and deploy complete 'AI factories' for generating revenue and profit through agentic AI.

QWhat is unique about the new NVIDIA Vera CPU, and why was it developed?

AThe NVIDIA Vera CPU is unique because it is the first CPU designed specifically for AI agents, not humans. It was developed because agents operate on nanosecond timescales and are 'impatient.' The Vera CPU prioritizes extreme single-thread performance, instructions per clock (IPC), per-core bandwidth, and overall system bandwidth to minimize latency and meet the demands of real-time tool use and memory access by agents.

QWhat are the key components of the 'agentic computing model' that Jensen Huang describes as defining the next decade?

AThe key components of the agentic computing model are: 1) a model (large language model), 2) an agent framework (which coordinates tasks like an operating system), 3) tools with skills (like databases or software libraries), and 4) a runtime. This model will be replicated across various platforms including the cloud, enterprises, PCs, robots, and edge devices.

QWhat new hardware platform did NVIDIA announce in collaboration with Microsoft, and what is its significance?

ANVIDIA announced a new line of personal computers in collaboration with Microsoft, including desktops, notebooks, and workstations. This signifies the reinvention of the PC. These machines will be 100% compatible with Windows and CUDA, and feature NVIDIA AI Tensor Cores to natively support the agentic computing model, allowing AI agents to run locally, understand the user, and perform useful tasks.

Leituras Relacionadas

CPU, Quietly Returning to the Center of the AI Computing Power Stage

Over the past three years, AI computing power narratives have been dominated by GPUs. However, starting in 2026, this story began to shift. While training large models remains GPU-intensive, the rapid growth of inference and AI agent workloads, which require high levels of task orchestration, concurrency, and data flow management, has highlighted a renewed critical role for CPUs. These are tasks GPUs are not designed to handle. Intel's recent launch of the Xeon 6+ processor, built on its Intel 18A process and featuring up to 288 efficiency cores (E-cores), exemplifies this strategic pivot. It is positioned not as a mere companion to GPUs but as the essential "control plane" for AI infrastructure, optimized for high-density, energy-efficient, and high-throughput workloads characteristic of AI agents and inference. This "CPU resurgence" is not about CPUs outperforming GPUs in raw computation. It reflects a systemic bottleneck: as AI scales from training single models to deploying countless intelligent agents, the demand for coordination and data handling surges. Major cloud providers are also developing their own high-density ARM-based server CPUs for similar workloads. However, Intel's success with this strategy faces significant challenges. Competition includes NVIDIA's integrated CPU-GPU solutions, the expanding adoption of cloud vendors' in-house ARM CPUs, and the crucial market test of Intel's 18A manufacturing process against rivals like TSMC's N2. In conclusion, CPUs are indeed reclaiming a central, though redefined, role in AI compute—managing the complex orchestration that enables massive-scale AI deployment. While the trend is clear, which company will ultimately lead this CPU resurgence remains an open question to be decided in the data centers of 2027 and beyond.

marsbitHá 4m

CPU, Quietly Returning to the Center of the AI Computing Power Stage

marsbitHá 4m

After Collaborating with 35+ DeFi Projects, Pink Brains Discovers the New 2026 KOL Marketing Rules

After collaborating with over 35 leading DeFi projects on marketing over three years, Pink Brains identifies a key shift for effective marketing in 2026: prioritizing the user journey over traditional campaign tactics. The most effective marketing mirrors how users actually behave—starting with discovery on social platforms like X (formerly Twitter), followed by data-driven verification on sites like DefiLlama, and finally, participation with small test funds. Success hinges on genuine, verifiable mechanisms, not just marketing hype. Current user interest centers on several key themes: new DeFi trends (RWA, perps, crypto x AI), meaningful airdrops requiring real contributions, real yield from protocol revenue, and tokens with value capture mechanisms directly tied to product usage. Case studies like Hyperliquid's HYPE (with its aggressive buyback program) and Venice's VVV (linking demand to AI compute) exemplify how strong tokenomics foster user retention. New trading venues like prediction markets, collectibles platforms, and GambleFi are also gaining traction, driven by verifiable activity. The article outlines common mistakes in DeFi KOL marketing, such as using creators unfamiliar with the product, generic messaging, or relying on a few top-tier KOLs. Instead, effective strategies align with different KOL types—educators, content creators, airdrop hunters, and niche experts—for various stages of the user journey. Ultimately, long-term user retention depends on a combination of a genuinely useful product, responsive support, community-aligned tokenomics, and strategic community building. The core takeaway is that sustainable growth stems from products whose value is validated by data and real-world utility, not just promotional efforts.

marsbitHá 23m

After Collaborating with 35+ DeFi Projects, Pink Brains Discovers the New 2026 KOL Marketing Rules

marsbitHá 23m

Trading

Spot
Futuros

Artigos em Destaque

O que é $S$

Compreender o SPERO: Uma Visão Abrangente Introdução ao SPERO À medida que o panorama da inovação continua a evoluir, o surgimento de tecnologias web3 e projetos de criptomoeda desempenha um papel fundamental na formação do futuro digital. Um projeto que tem atraído atenção neste campo dinâmico é o SPERO, denotado como SPERO,$$s$. Este artigo tem como objetivo reunir e apresentar informações detalhadas sobre o SPERO, para ajudar entusiastas e investidores a compreender as suas bases, objetivos e inovações nos domínios web3 e cripto. O que é o SPERO,$$s$? O SPERO,$$s$ é um projeto único dentro do espaço cripto que procura aproveitar os princípios da descentralização e da tecnologia blockchain para criar um ecossistema que promove o envolvimento, a utilidade e a inclusão financeira. O projeto é concebido para facilitar interações peer-to-peer de novas maneiras, proporcionando aos utilizadores soluções e serviços financeiros inovadores. No seu núcleo, o SPERO,$$s$ visa capacitar indivíduos ao fornecer ferramentas e plataformas que melhoram a experiência do utilizador no espaço das criptomoedas. Isso inclui a possibilidade de métodos de transação mais flexíveis, a promoção de iniciativas impulsionadas pela comunidade e a criação de caminhos para oportunidades financeiras através de aplicações descentralizadas (dApps). A visão subjacente do SPERO,$$s$ gira em torno da inclusão, visando fechar lacunas dentro das finanças tradicionais enquanto aproveita os benefícios da tecnologia blockchain. Quem é o Criador do SPERO,$$s$? A identidade do criador do SPERO,$$s$ permanece algo obscura, uma vez que existem recursos publicamente disponíveis limitados que fornecem informações detalhadas sobre o(s) seu(s) fundador(es). Esta falta de transparência pode resultar do compromisso do projeto com a descentralização—uma ética que muitos projetos web3 partilham, priorizando contribuições coletivas em vez de reconhecimento individual. Ao centrar as discussões em torno da comunidade e dos seus objetivos coletivos, o SPERO,$$s$ incorpora a essência do empoderamento sem destacar indivíduos específicos. Assim, compreender a ética e a missão do SPERO é mais importante do que identificar um criador singular. Quem são os Investidores do SPERO,$$s$? O SPERO,$$s$ é apoiado por uma diversidade de investidores que vão desde capitalistas de risco a investidores-anjo dedicados a promover a inovação no setor cripto. O foco desses investidores geralmente alinha-se com a missão do SPERO—priorizando projetos que prometem avanço tecnológico social, inclusão financeira e governança descentralizada. Essas fundações de investidores estão tipicamente interessadas em projetos que não apenas oferecem produtos inovadores, mas que também contribuem positivamente para a comunidade blockchain e os seus ecossistemas. O apoio desses investidores reforça o SPERO,$$s$ como um concorrente notável no domínio em rápida evolução dos projetos cripto. Como Funciona o SPERO,$$s$? O SPERO,$$s$ emprega uma estrutura multifacetada que o distingue de projetos de criptomoeda convencionais. Aqui estão algumas das características-chave que sublinham a sua singularidade e inovação: Governança Descentralizada: O SPERO,$$s$ integra modelos de governança descentralizada, capacitando os utilizadores a participar ativamente nos processos de tomada de decisão sobre o futuro do projeto. Esta abordagem promove um sentido de propriedade e responsabilidade entre os membros da comunidade. Utilidade do Token: O SPERO,$$s$ utiliza o seu próprio token de criptomoeda, concebido para servir várias funções dentro do ecossistema. Esses tokens permitem transações, recompensas e a facilitação de serviços oferecidos na plataforma, melhorando o envolvimento e a utilidade gerais. Arquitetura em Camadas: A arquitetura técnica do SPERO,$$s$ suporta modularidade e escalabilidade, permitindo a integração contínua de funcionalidades e aplicações adicionais à medida que o projeto evolui. Esta adaptabilidade é fundamental para manter a relevância no panorama cripto em constante mudança. Envolvimento da Comunidade: O projeto enfatiza iniciativas impulsionadas pela comunidade, empregando mecanismos que incentivam a colaboração e o feedback. Ao nutrir uma comunidade forte, o SPERO,$$s$ pode melhor atender às necessidades dos utilizadores e adaptar-se às tendências do mercado. Foco na Inclusão: Ao oferecer taxas de transação baixas e interfaces amigáveis, o SPERO,$$s$ visa atrair uma base de utilizadores diversificada, incluindo indivíduos que anteriormente podem não ter participado no espaço cripto. Este compromisso com a inclusão alinha-se com a sua missão abrangente de empoderamento através da acessibilidade. Cronologia do SPERO,$$s$ Compreender a história de um projeto fornece insights cruciais sobre a sua trajetória de desenvolvimento e marcos. Abaixo está uma cronologia sugerida que mapeia eventos significativos na evolução do SPERO,$$s$: Fase de Conceituação e Ideação: As ideias iniciais que formam a base do SPERO,$$s$ foram concebidas, alinhando-se de perto com os princípios de descentralização e foco na comunidade dentro da indústria blockchain. Lançamento do Whitepaper do Projeto: Após a fase conceitual, um whitepaper abrangente detalhando a visão, os objetivos e a infraestrutura tecnológica do SPERO,$$s$ foi lançado para atrair o interesse e o feedback da comunidade. Construção da Comunidade e Primeiros Envolvimentos: Esforços ativos de divulgação foram feitos para construir uma comunidade de primeiros adotantes e investidores potenciais, facilitando discussões em torno dos objetivos do projeto e angariando apoio. Evento de Geração de Tokens: O SPERO,$$s$ realizou um evento de geração de tokens (TGE) para distribuir os seus tokens nativos a apoiantes iniciais e estabelecer liquidez inicial dentro do ecossistema. Lançamento da dApp Inicial: A primeira aplicação descentralizada (dApp) associada ao SPERO,$$s$ foi lançada, permitindo que os utilizadores interagissem com as funcionalidades principais da plataforma. Desenvolvimento Contínuo e Parcerias: Atualizações e melhorias contínuas nas ofertas do projeto, incluindo parcerias estratégicas com outros players no espaço blockchain, moldaram o SPERO,$$s$ em um jogador competitivo e em evolução no mercado cripto. Conclusão O SPERO,$$s$ é um testemunho do potencial do web3 e das criptomoedas para revolucionar os sistemas financeiros e capacitar indivíduos. Com um compromisso com a governança descentralizada, o envolvimento da comunidade e funcionalidades inovadoras, abre caminho para um panorama financeiro mais inclusivo. Como em qualquer investimento no espaço cripto em rápida evolução, potenciais investidores e utilizadores são incentivados a pesquisar minuciosamente e a envolver-se de forma ponderada com os desenvolvimentos em curso dentro do SPERO,$$s$. O projeto demonstra o espírito inovador da indústria cripto, convidando a uma exploração mais aprofundada das suas inúmeras possibilidades. Embora a jornada do SPERO,$$s$ ainda esteja a desenrolar-se, os seus princípios fundamentais podem, de facto, influenciar o futuro de como interagimos com a tecnologia, as finanças e uns com os outros em ecossistemas digitais interconectados.

69 Visualizações TotaisPublicado em {updateTime}Atualizado em 2024.12.17

O que é $S$

O que é AGENT S

Agent S: O Futuro da Interação Autónoma no Web3 Introdução No panorama em constante evolução do Web3 e das criptomoedas, as inovações estão constantemente a redefinir a forma como os indivíduos interagem com plataformas digitais. Um projeto pioneiro, o Agent S, promete revolucionar a interação humano-computador através do seu framework aberto e agente. Ao abrir caminho para interações autónomas, o Agent S visa simplificar tarefas complexas, oferecendo aplicações transformadoras em inteligência artificial (IA). Esta exploração detalhada irá aprofundar-se nas complexidades do projeto, nas suas características únicas e nas implicações para o domínio das criptomoedas. O que é o Agent S? O Agent S é um framework aberto e agente, especificamente concebido para abordar três desafios fundamentais na automação de tarefas computacionais: Aquisição de Conhecimento Específico de Domínio: O framework aprende inteligentemente a partir de várias fontes de conhecimento externas e experiências internas. Esta abordagem dupla capacita-o a construir um rico repositório de conhecimento específico de domínio, melhorando o seu desempenho na execução de tarefas. Planeamento ao Longo de Longos Horizontes de Tarefas: O Agent S emprega planeamento hierárquico aumentado por experiência, uma abordagem estratégica que facilita a decomposição e execução eficientes de tarefas intrincadas. Esta característica melhora significativamente a sua capacidade de gerir múltiplas subtarefas de forma eficiente e eficaz. Gestão de Interfaces Dinâmicas e Não Uniformes: O projeto introduz a Interface Agente-Computador (ACI), uma solução inovadora que melhora a interação entre agentes e utilizadores. Utilizando Modelos de Linguagem Multimodais de Grande Escala (MLLMs), o Agent S pode navegar e manipular diversas interfaces gráficas de utilizador de forma fluida. Através destas características pioneiras, o Agent S fornece um framework robusto que aborda as complexidades envolvidas na automação da interação humana com máquinas, preparando o terreno para uma infinidade de aplicações em IA e além. Quem é o Criador do Agent S? Embora o conceito de Agent S seja fundamentalmente inovador, informações específicas sobre o seu criador permanecem elusivas. O criador é atualmente desconhecido, o que destaca ou o estágio nascente do projeto ou a escolha estratégica de manter os membros fundadores em anonimato. Independentemente da anonimidade, o foco permanece nas capacidades e no potencial do framework. Quem são os Investidores do Agent S? Como o Agent S é relativamente novo no ecossistema criptográfico, informações detalhadas sobre os seus investidores e financiadores não estão explicitamente documentadas. A falta de informações disponíveis publicamente sobre as fundações de investimento ou organizações que apoiam o projeto levanta questões sobre a sua estrutura de financiamento e roteiro de desenvolvimento. Compreender o apoio é crucial para avaliar a sustentabilidade do projeto e o seu impacto potencial no mercado. Como Funciona o Agent S? No núcleo do Agent S reside uma tecnologia de ponta que lhe permite funcionar eficazmente em diversos ambientes. O seu modelo operacional é construído em torno de várias características-chave: Interação Humano-Computador Semelhante: O framework oferece planeamento avançado em IA, esforçando-se para tornar as interações com computadores mais intuitivas. Ao imitar o comportamento humano na execução de tarefas, promete elevar as experiências dos utilizadores. Memória Narrativa: Utilizada para aproveitar experiências de alto nível, o Agent S utiliza memória narrativa para acompanhar os históricos de tarefas, melhorando assim os seus processos de tomada de decisão. Memória Episódica: Esta característica fornece aos utilizadores orientações passo a passo, permitindo que o framework ofereça suporte contextual à medida que as tarefas se desenrolam. Suporte para OpenACI: Com a capacidade de funcionar localmente, o Agent S permite que os utilizadores mantenham o controlo sobre as suas interações e fluxos de trabalho, alinhando-se com a ética descentralizada do Web3. Fácil Integração com APIs Externas: A sua versatilidade e compatibilidade com várias plataformas de IA garantem que o Agent S possa integrar-se perfeitamente em ecossistemas tecnológicos existentes, tornando-o uma escolha apelativa para desenvolvedores e organizações. Estas funcionalidades contribuem coletivamente para a posição única do Agent S no espaço cripto, à medida que automatiza tarefas complexas e em múltiplos passos com mínima intervenção humana. À medida que o projeto evolui, as suas potenciais aplicações no Web3 podem redefinir a forma como as interações digitais se desenrolam. Cronologia do Agent S O desenvolvimento e os marcos do Agent S podem ser encapsulados numa cronologia que destaca os seus eventos significativos: 27 de Setembro de 2024: O conceito de Agent S foi lançado num artigo de pesquisa abrangente intitulado “Um Framework Agente Aberto que Usa Computadores como um Humano”, mostrando a base para o projeto. 10 de Outubro de 2024: O artigo de pesquisa foi disponibilizado publicamente no arXiv, oferecendo uma exploração aprofundada do framework e da sua avaliação de desempenho com base no benchmark OSWorld. 12 de Outubro de 2024: Uma apresentação em vídeo foi lançada, proporcionando uma visão visual das capacidades e características do Agent S, envolvendo ainda mais potenciais utilizadores e investidores. Estes marcos na cronologia não apenas ilustram o progresso do Agent S, mas também indicam o seu compromisso com a transparência e o envolvimento da comunidade. Pontos-Chave Sobre o Agent S À medida que o framework Agent S continua a evoluir, várias características-chave destacam-se, sublinhando a sua natureza inovadora e potencial: Framework Inovador: Concebido para proporcionar um uso intuitivo de computadores semelhante à interação humana, o Agent S traz uma abordagem nova à automação de tarefas. Interação Autónoma: A capacidade de interagir autonomamente com computadores através de GUI significa um avanço em direção a soluções computacionais mais inteligentes e eficientes. Automação de Tarefas Complexas: Com a sua metodologia robusta, pode automatizar tarefas complexas e em múltiplos passos, tornando os processos mais rápidos e menos propensos a erros. Melhoria Contínua: Os mecanismos de aprendizagem permitem que o Agent S melhore a partir de experiências passadas, aprimorando continuamente o seu desempenho e eficácia. Versatilidade: A sua adaptabilidade em diferentes ambientes operacionais, como OSWorld e WindowsAgentArena, garante que pode servir uma ampla gama de aplicações. À medida que o Agent S se posiciona no panorama do Web3 e das criptomoedas, o seu potencial para melhorar as capacidades de interação e automatizar processos significa um avanço significativo nas tecnologias de IA. Através do seu framework inovador, o Agent S exemplifica o futuro das interações digitais, prometendo uma experiência mais fluida e eficiente para os utilizadores em diversas indústrias. Conclusão O Agent S representa um ousado avanço na união da IA e do Web3, com a capacidade de redefinir a forma como interagimos com a tecnologia. Embora ainda esteja nas suas fases iniciais, as possibilidades para a sua aplicação são vastas e cativantes. Através do seu framework abrangente que aborda desafios críticos, o Agent S visa trazer interações autónomas para o primeiro plano da experiência digital. À medida que avançamos mais profundamente nos domínios das criptomoedas e da descentralização, projetos como o Agent S desempenharão, sem dúvida, um papel crucial na formação do futuro da tecnologia e da colaboração humano-computador.

653 Visualizações TotaisPublicado em {updateTime}Atualizado em 2025.01.14

O que é AGENT S

Como comprar S

Bem-vindo à HTX.com!Tornámos a compra de Sonic (S) simples e conveniente.Segue o nosso guia passo a passo para iniciar a tua jornada no mundo das criptos.Passo 1: cria a tua conta HTXUtiliza o teu e-mail ou número de telefone para te inscreveres numa conta gratuita na HTX.Desfruta de um processo de inscrição sem complicações e desbloqueia todas as funcionalidades.Obter a minha contaPasso 2: vai para Comprar Cripto e escolhe o teu método de pagamentoCartão de crédito/débito: usa o teu visa ou mastercard para comprar Sonic (S) instantaneamente.Saldo: usa os fundos da tua conta HTX para transacionar sem problemas.Terceiros: adicionamos métodos de pagamento populares, como Google Pay e Apple Pay, para aumentar a conveniência.P2P: transaciona diretamente com outros utilizadores na HTX.Mercado de balcão (OTC): oferecemos serviços personalizados e taxas de câmbio competitivas para os traders.Passo 3: armazena teu Sonic (S)Depois de comprar o teu Sonic (S), armazena-o na tua conta HTX.Alternativamente, podes enviá-lo para outro lugar através de transferência blockchain ou usá-lo para transacionar outras criptomoedas.Passo 4: transaciona Sonic (S)Transaciona facilmente Sonic (S) no mercado à vista da HTX.Acede simplesmente à tua conta, seleciona o teu par de trading, executa as tuas transações e monitoriza em tempo real.Oferecemos uma experiência de fácil utilização tanto para principiantes como para traders experientes.

1.2k Visualizações TotaisPublicado em {updateTime}Atualizado em 2026.06.02

Como comprar S

Discussões

Bem-vindo à Comunidade HTX. Aqui, pode manter-se informado sobre os mais recentes desenvolvimentos da plataforma e obter acesso a análises profissionais de mercado. As opiniões dos utilizadores sobre o preço de S (S) são apresentadas abaixo.

活动图片