In March 2026, AMI Labs, co-founded by Turing Award winner and former Meta Chief AI Scientist Yann LeCun, announced the completion of a $1.03 billion seed funding round.
Almost simultaneously:
- World Labs, founded by Fei-Fei Li, completed a new funding round of approximately $1 billion
- Google DeepMind released the Genie 3 world model
- Tesla continued to advance the deployment of its Optimus humanoid robots in factories
These events did not occur in isolation but collectively point to a clearer trend: AI is moving from 'understanding the digital world' to 'understanding and acting upon the physical world.'
If 2024 was the expansion period for large language models, and 2025 was the exploration period for Agent落地 (Agent implementation), then in 2026, the core narrative in Silicon Valley is shifting to a more fundamental question: Can AI truly understand 'how the world works' and complete tasks in reality?
This is not just a change in technical direction; it also means the industrial value chain is being rewritten. Over the past two years, the main battlefield of AI competition has been concentrated in a few high-barrier areas like models, computing power, and data centers. But when AI truly enters the physical world, competition will no longer occur only at the model layer; it will simultaneously expand to hardware本体 (hardware bodies), system integration, data collection, simulation environments, supply chain coordination, and real-world scenario implementation. In other words, Physical AI brings not a single-point breakthrough, but a reconstruction of an entire infrastructure system.
Precisely because of this, this round of change, for the Chinese-speaking world, especially Chinese entrepreneurs, engineers, and investors, might not just be a new wave of technological enthusiasm, but a rare structural window of opportunity. Unlike the previous competition dominated mainly by large model training resources and super capital, Physical AI inherently relies more on复合能力 (composite capabilities): one must understand algorithms and also engineering; one must be capable of system coordination and also deeply enter manufacturing, supply chains, and industrial scenarios. Teams that possess both technical depth, hardware coordination capabilities, and Sino-US industrial vision反而更有机会 (instead have more opportunity) to occupy key positions in this new cycle.
In other words, Physical AI is not just a new story Silicon Valley is telling; it might also be the most important entry ticket for Chinese talent in the next round of global technological infrastructure transformation.
01 The Century-Long Debate Between Two Paths: The LLM Camp vs. The World Model Camp
Over the past three years, large language models (LLMs) have almost dominated the development path of AI, with their core paradigm being next-token prediction based on massive text data. But the boundaries of this paradigm are gradually becoming apparent: it can 'describe' the physical world but lacks executable understanding; it lacks modeling capabilities for causality and physical constraints; and it performs limitedly in continuous decision-making and long-term tasks.
Therefore, a faction represented by Yann LeCun began promoting another path: World Model—predicting 'state,' not 'text.' The core difference between the two is that LLMs take text as the learning object and language as the output form, essentially停留在 (remaining at) 'cognition and expression'; whereas world models take the state of the physical world as the modeling object, directly pointing to the ability闭环 (closed loop) of 'perception-decision-execution.'
This is not just LeCun's judgment. In Q1 2026, the world model direction almost simultaneously welcomed several key advancements: AMI Labs, with JEPA as its core architecture, clearly bet on a long-term路线 (route) of 'research first, product later'; World Labs切入 (cut into) 'spatial intelligence,' attempting to make AI truly understand relationships, occlusions, and physical constraints in the three-dimensional world; Google DeepMind, through Genie 3, promoted dynamically generated environments for real-time interaction and used them for agent training.
The three companies have different paths, but they point to the same trend: The next leap in AI is not just about generating better text, but about modeling the world more accurately and completing actions within it.
02 The Hardware War: Who is Building the 'Body'?
The world model solves the 'brain' problem—how AI understands the physical world. But the other half of the Physical AI battlefield is equally fierce: who will build the 'body'?
By 2026, the humanoid robot track has moved completely from 'lab demo' to 'factory mass production' stage. A few key numbers:
Tesla Optimus Gen 3: Over 1000 units have been deployed at Gigafactory Texas and Fremont factory, performing parts handling and assembly tasks. This is the largest deployment of humanoid robots in a factory in human history. Tesla is building a dedicated factory at Giga Texas with an annual capacity of 10 million units, aiming to reduce the cost per unit to $20,000—two years ago, the industry average was still $50,000-$250,000.
Boston Dynamics Atlas: The product version Atlas at CES 2026, 6.2 feet tall, 56 degrees of freedom, can lift 110 lbs. More noteworthy is its 'soul'—Boston Dynamics announced a collaboration with Google DeepMind to integrate cutting-edge foundation models into Atlas. The 2026 annual production capacity has been预定了 (booked) by Hyundai and Google DeepMind, and a factory with 30,000 units/year capacity is being planned.
Figure 03: Figure AI raised $1 billion at a $39 billion valuation (2025). Its Figure 02, during an 11-month trial run at the BMW Spartanburg factory, participated in the production of over 30,000 BMW X3s, moved over 90,000 parts, and累计运行 (accumulated operation) 1250 hours. Figure 03 is a comprehensive upgrade based on this, equipped with 48+ degrees of freedom and a proprietary Helix AI platform.
Mind Robotics: Just announced a $500 million funding round in March, focusing on industrial-scale AI robot deployment.
But in this hardware race, an underestimated环节 (link) is emerging: Dexterous Hand.
The legs of humanoid robots solve the mobility problem, the torso solves the承载问题 (load-bearing problem), but what truly determines whether a robot can work in a complex environment is the hand. Taking Tesla Optimus as an example, the hand cost accounts for 17% of the整机 (whole machine), about $9,500—it is the most expensive single component.
The difficulty of the dexterous hand lies in a fundamental contradiction: the finger space is too small to fit large motors; small motors have insufficient torque, requiring high reduction ratio gearboxes to amplify force; and high reduction ratio gearboxes bring inertial distortion, loss of force feedback, and mechanical wear—these three problems will 'poison' the AI's learning process from the physical level.
A batch of new companies is trying to break through this bottleneck. Some use axial flux motor architecture to compress the reduction ratio from 288:1 to 15:1, achieving a fully reversibly drivable dexterous hand; others, through同步设计 (synchronous design) data collection gloves, allow human operation data to be migrated to robot hardware with zero loss. These seemingly small hardware innovations might be one of the most critical infrastructures in the entire Physical AI ecosystem.
03 NVIDIA: The 'Shovel Seller' in the Physical AI Era
Every technological wave has a 'shovel seller.'
In the era of large models, NVIDIA became the biggest beneficiary凭借 (relying on) GPUs and the CUDA ecosystem;而在 (and in) the Physical AI era, its role is being further upgraded—not just providing computing power, but attempting to build an entire infrastructure for the robotics era.
At the GTC conference in March 2026, NVIDIA released a full suite of platform capabilities围绕 (surrounding) Physical AI: including the visual-language-action model Isaac GR00T for humanoid robots, the Cosmos series for generating large-scale synthetic data, and toolchains covering training, evaluation, and deployment (like Isaac Lab and OSMO). These capabilities are not single-point tools but are gradually forming a complete development and operation system.
Including Boston Dynamics, Caterpillar, Franka Robotics, LG, NEURA Robotics, and other robot companies, are already building next-generation systems on the NVIDIA platform.
Its strategy is also very clear:
Do not directly participate in终端产品 (end products), but become the underlying standard for the entire industry.
If Physical AI is a city under construction, then NVIDIA is simultaneously providing the cement, steel bars, and power grid.
04 Data: The Most Scarce 'Oil' of Physical AI
In the world of large language models, the internet provides almost unlimited text data. But in Physical AI, a more fundamental question emerges:
Manipulation data from the real world is extremely scarce.
This makes data one of the most critical and scarce resources in the entire industry chain.
Currently, the industry is mainly exploring three paths.
Real Data Route. Represented by Physical Intelligence, its π0 model is trained on over 10,000 hours of real robot operation data, covering multiple robot形态 (morphologies) and task types, capable of completing complex operations (like folding clothes, assembling cardboard boxes, etc.). Its open-source behavior essentially provides the industry with a 'manipulation pre-training base.'
Synthetic Data Route. Google DeepMind's Genie 3 and NVIDIA's Cosmos attempt to generate大量 (a large number of) simulated environments through world models, complete training in the virtual world, and then migrate to the real world. The core challenge of this path is the sim-to-real gap, but as simulation accuracy improves, this gap is gradually narrowing.
Human Teleoperation Route. Through devices like data collection gloves, human operations are directly mapped to the robot system. This method has the highest data quality but still has limitations in cost and scalability.
Tesla is trying a hybrid path: continuously collecting human operation behaviors through factory videos and using them to train Optimus's motion capabilities.
Long-term, the competitive landscape of Physical AI will likely not depend on whose model is optimal, but on who possesses the most and highest-quality physical world interaction data. Once the data flywheel starts turning, its barriers will增强 exponentially (strengthen exponentially).
05┃ What Money is Saying: A Full Picture of Physical AI Financing in 2026 Q1
Numbers don't lie. Here are the key financing events in the Physical AI field in Q1 2026:
【World Model Layer】
· AMI Labs (LeCun) — $1.03B Seed Round, Valuation $3.5B
· World Labs (Fei-Fei Li) — $1B New Round, Autodesk invested $200M
【Foundation Model Layer】
· Physical Intelligence — Negotiating a $1B new round, valuation will exceed $11B
· RLWRLD — $41M Seed Round Extension
【Humanoid Robot整机 (Whole Machine)】
· Figure AI — Previously raised $1B at $39B valuation (2025)
· Mind Robotics — $500M, industrial-scale deployment
· Galaxea — $434M, Series B Unicorn
· Humanoid — $290M Seed Round,直接独角兽 (direct unicorn)
· Generative Bionics — €70M Seed Round
【Infrastructure & Tools】
· NVIDIA — Continued investment in Isaac GR00T / Cosmos platform
· RoboForce — $52M, Physical AI labor platform
Just the above公开数据 (public data) for Q1 already exceeds $6.4 billion. And this does not include the internal investments of major players like Tesla, Hyundai/Boston Dynamics, Google DeepMind, etc.
The flow of capital说明一件事 (illustrates one thing): Physical AI has moved past the 'proof of concept' stage and entered the 'infrastructure construction' stage. Investors are no longer asking 'can robots be used,' but 'whose infrastructure can scale robots the fastest.'
06 Cold Thinking: Bubble or Inflection Point?
Of course, Silicon Valley is never short of bubbles. Faced with the狂热 (enthusiasm) for Physical AI, a few冷静的问题 (calm questions) are worth considering:
Demo ≠ Deployment. As industry insiders共识 (consensus) at Davos 2026: the gap between a spectacular demo and a system that can run 10,000 times consecutively without error is much larger than the宣传暗示 (publicity implies). Figure 02 did participate in the production of 30,000 cars at the BMW factory, but it performed relatively standardized parts handling, not dexterous assembly.
Sim-to-real is still a hard nut to crack. The fidelity of world models is improving, but the long-tail complexity of the physical world—lighting changes, material differences, unexpected collisions—remains the biggest challenge for the synthetic data route.
Business models have not yet been proven. LeCun himself said AMI Labs will only do research in its first year. World Labs is trying a free + paid model. Physical Intelligence open-sourced its core model. Currently, these companies have almost zero revenue; capital is betting on paradigm垄断 (monopoly) in 3-5 years.
The gray rhino of safety and regulation. When thousands of robots with autonomous decision-making capabilities enter factories and even homes, who is responsible for accidents? The global regulatory framework for Physical AI is almost空白 (a blank slate).
But precisely these problems indicate that we are in the early stages of a technological inflection point, not the top of a bubble. Every true paradigm shift—the internet, smartphones, cloud computing—was accompanied in its early stages by a phase where 'Demo was far better than the product.' The key difference is: is the underlying technology truly advancing, or is it just the PPT that's improving?
From LeCun's JEPA architecture, to Genie 3's real-time world generation, to π0's 68-task generalization capability, to Optimus's factory deployment of 1000 units—the progress in Q1 2026 is real engineering breakthroughs, not castles in the air.
07 Physical AI is Not an Independent Track; It is the Final Form of AI.
Physical AI is not a new track; it is more like one of the endgame forms of AI.
When AI moves from 'understanding the world' to 'entering the world,' what is truly being rewritten is not just the boundary of model capabilities, but also the way industrial division of labor and value distribution occur. Future competition will not only happen in model parameters and computing clusters, but also in robot本体 (bodies), dexterous hands, data collection, simulation systems, industrial scenarios, and supply chain organizational capabilities.
This is also why this round is particularly important for Chinese talent.
Because one of the deepest accumulations of Chinese talent over the past two decades has never been a single-dimensional technical label, but the ability to truly串起来 (string together) cutting-edge technology, engineering execution, hardware manufacturing, and cross-regional industrial coordination. Whether entrepreneurs, engineers, investors, or industrial resource organizers, as long as they can grasp this migration from digital intelligence to physical intelligence, they have the opportunity not only to participate in the trend but to become part of the trend itself at some key layers.
In 2026, Physical AI may still be far from mature; but precisely because it is still early, the window is just opening. For Chinese talent, this might not be another cycle of 'following participation,' but a new starting point with more opportunity to deeply切入 (cut into) the infrastructure layer, platform layer, and key component layer.
This article is from the WeChat public account "硅兔君" (ID: gh_1faae33d0655), author: 硅兔君 (Silicon Rabbit Jun)
















