From a Lunch Table to an Infinite Universe: Fei-Fei Li Bets on AI's Next Dimension

marsbitPublicado a 2026-05-27Actualizado a 2026-05-27

Resumen

From a Lunch Table Conversation to an Infinite Universe: Fei-Fei Li Bets on AI's Next Frontier - Spatial Intelligence In an era dominated by large language models, AI pioneer Fei-Fei Li argues that true understanding requires spatial intelligence — the ability to perceive, reason, and interact within the physical 3D/4D world. She points to evolutionary history: spatial perception drove the Cambrian explosion 540 million years ago, while language is a far more recent, inherently "lossy" way to encode reality. Current models struggle with basic spatial tasks a child can do, like counting chairs in a video. Her company, World Labs, is pioneering this shift with "Marble," a model that generates navigable, consistent 3D worlds from text, images, or simple 3D inputs—distinct from video generators like Sora. Though smaller than models like GPT-5, due to scarce 3D data and early-stage scaling laws, Marble is already used in gaming, robot training (by NVIDIA), architectural design, and personalized therapy for conditions like OCD and acrophobia. Li envisions this technology enabling "infinite universes" for creativity, social interaction, and more. However, she cautions against utopian or dystopian extremes, advocating for a measured vision where AI enhances human dignity and prosperity, akin to how electricity transformed civilization. The journey is long — as evidenced by the 20-year path to viable autonomous vehicles — but the direction is clear: for AI to move from merely talki...

5 million years – that's the evolutionary age of human language. 540 million years – that's the starting point of the Cambrian explosion sparked by vision and spatial perception.

In 2025 and 2026, when nearly every top Silicon Valley lab was fiercely competing on language models, Professor Fei-Fei Li of Stanford University and founder of World Labs repeatedly raised a question that forced the industry to look up: If AI can only talk and look at pictures, it will never truly "understand" this world.

In three key interviews – the a16z Podcast in June 2025, the Cisco AI Summit in February 2026, and the in-depth 1-hour 19-minute Lenny's Podcast conversation released on May 22, 2026 – she systematically elaborated on a judgment that is being rapidly validated: Spatial Intelligence is the next frontier of AI.

Her statements in the a16z dialogue about "creating infinite universes" and "living in a multiverse," along with her views in Lenny's Podcast that "world models are the next frontier" and "AGI is more of a marketing term," have recently been widely reposted again on X.

"We Are Missing a World Model"

According to a16z partner Martin Casado, during a lunch meeting in Silicon Valley, a table full of AI practitioners was excitedly discussing large language models. Sitting at the other end of the table, Fei-Fei Li suddenly turned and asked him:

"Do you know what we are missing? We are missing a world model."

Casado, an early investor in World Labs and a long-time friend of Li's from her Stanford days, recalled that moment: "Everything clicked." He had independently reached a similar conclusion coming out of extensive investment in the image field: language is not the end of the story.

But Li's thinking on this issue goes back much further than most.

In April 2024, she gave a 15-minute TED Talk, using evolution as her starting point: The appearance of trilobites 540 million years ago allowed life to "see" the world for the first time. The birth of vision ignited an evolutionary race of intelligence, the nervous system began to develop, animals became active, and intelligence emerged. Language is merely a very recent product of this long race.

This judgment was repeatedly reinforced in the three interviews. At the Cisco AI Summit, her statement was more direct:

"Language's history is only about 500,000 years old. But 1.5 billion years ago, animals began to perceive light and touch their environment. The ability to understand, reason, interact, and navigate in the real 3D, 4D physical world is fundamental, as important as linguistic intelligence."

Li is not negating the value of linguistic intelligence. Her core argument is: Language is essentially a "lossy" way of encoding the world.

In the a16z interview, Casado conducted a thought experiment: Blindfold yourself, describe a room using language, then try to complete a task – your chance of success is extremely low. Because language's description of reality is always rough. Remove the blindfold, your brain instantly reconstructs the 3D space, and you can operate, touch, and move.

Li supplemented with a more extreme example from scientific history: Rosalind Franklin's X-ray diffraction photo of DNA was a flat, two-dimensional image, showing a pattern that looked like a cross with diffraction. But Watson and Crick reasoned from that two-dimensional photo to deduce the three-dimensional double-helix structure of DNA. "That structure cannot be two-dimensional. You cannot deduce that structure with two-dimensional thinking."

"If you observe human intelligence, much of it is beyond the scope of language. Language is a lossy way of capturing the world. Pure generative 'language' does not exist in nature; we look around, there are no ready-made sentences or words, yet the entire physical, perceptual, visual world exists."

This is a perspective easily overlooked: most capabilities of current large models are built on a format of information compression that is inherently lossy. In Lenny's Podcast, she used a more mundane test to puncture this illusion:

"Today, you take a model, give it a video clip showing a few office rooms, and ask the model to count the number of chairs. This is something a toddler can do, but AI cannot."

Not to mention deducing physical laws from celestial motion: "Let's give AI all the data, including modern instrument data that Newton didn't have, and ask it to create a set of 17th-century equations about the laws of object motion. Today's AI cannot do that."

Marble: Orders of Magnitude Smaller Than GPT-5

Pushing this judgment into a product is World Labs' first-generation model, Marble, released at the end of 2024.

At the Cisco AI Summit, Li detailed Marble's technical positioning: receiving text, images, video, or simple 3D inputs, and generating a "fully navigable, interactive, and permanently consistent 3D world." She specifically emphasized that this is fundamentally different from video generation models like Sora; environments generated by Marble possess geometric structure, not pixel animations that "look like" video.

In Lenny's Podcast, she used Plato's allegory of the cave for a deeper explanation: Prisoners are tied to chairs, only able to see two-dimensional shadows projected on the wall, but the real drama unfolds in the three-dimensional space behind them. Video models are those shadows, while spatial intelligence aims to create and reason about the real world behind those shadows.

A comparison: GPT-5's training compute is roughly on the order of 10^26 FLOPS, while Marble is several orders of magnitude smaller in scale. The reasons are two-fold: data acquisition difficulty is completely different (high-quality 3D physical data is extremely scarce), and this field is still in the early stages of the "scaling law upward curve."

In Lenny's Podcast, she further explained why robot learning cannot simply replicate the "bitter lesson" of language models. There is a famous assertion in AI: simple models with massive data will eventually surpass complex ones. But "language models have a perfect setup: the training data is words, and the output is also words." In robotics, "you want actions, but the training data lacks actions in the 3D world." This fundamental misalignment between training objectives and data form is the core challenge of robot learning.

World Labs employs a hybrid data strategy: internet-scale text, images, and video, plus simulation data, plus real-world captured data. Li admits, "We are still in the relatively early stages of exploring model architectures," but she expects "the next few years will be very exciting."

Right after, in February 2026, World Labs completed a $1 billion funding round, with participation from NVIDIA, AMD, a16z, valuing the company at around $5 billion, up from $1 billion a year earlier. In April, the team open-sourced the 3D Gaussian splatting rendering engine Spark 2.0, capable of real-time rendering of hundred-million-polygon 3D scenes in web browsers, shifting from a closed-source product to a dual-track strategy of "product + open-source ecosystem." The technical barrier for spatial intelligence is being rapidly lowered.

In Lenny's Podcast, Li also rarely revealed the hardships of entrepreneurship: "If I could whisper one thing to myself 18 months ago: 'The intensity of competition in this field, both technologically and for talent, far exceeds your imagination.'"

Infinite Universes and Multiverses

What really made that a16z interview go viral repeatedly on X was Li's statement about "infinite universes":

"In the entire history of human civilization, we have all lived together in one 3D world. Only a handful of people have been to the moon, but very few. And this technology makes digital virtual worlds incredibly rich. Suddenly, we can actually create infinite universes, some for robots, some for creativity, some for social interaction, some for travel, some for storytelling. Suddenly, we are able to live in a multiverse; the space for imagination is infinite."

Casado provided a more concrete technical explanation: from a single two-dimensional photo, the model can generate a complete 360-degree 3D representation, including the back of a table. You can manipulate, measure, stack—anything you can do in space can be achieved.

This is not science fiction. In the two interviews, Li listed applications where Marble is already being used:

• Game developers used early versions to create games

• A virtual production team collaborating with Sony reduced film production cycles by 40 times

• NVIDIA and multiple academic labs used Marble to train robots

• Architects and designers used it for interior design

• Clinical researchers created personalized immersive trigger environments for patients with OCD, acrophobia

• Someone used it to generate personalized yoga training spaces

The last application was particularly surprising. Li mentioned at the summit that OCD patients are triggered by very specific scenes, "for example, personally I am troubled by piles of dirty laundry, but everyone's trigger points are different." In Lenny's Podcast she added that after release, a friend called her overnight asking if Marble could be used to treat acrophobia. Building physical environments is extremely costly, while Marble only needs a prompt to generate various environments in minutes.

Plato's allegory of the cave is also the best entry point for understanding the 2D vs. 3D divergence.

Li used this allegory to explain: Prisoners tied to chairs can only see two-dimensional shadows projected on the wall. Current language models and video models are essentially those shadows, guessing 3D from 2D. The ambition of spatial intelligence is to create, reason about, and interact with the real world behind those shadows.

In terms of technical roadmap, she drew a clear boundary with a concise comparison:

"A car can be seen as a square robot moving on a two-dimensional plane, its goal is not to hit anything. A robot is a three-dimensional entity operating in a three-dimensional world; the goal of a general-purpose robot is to touch objects without breaking them. This is a higher-dimensional problem."

She also provided a timeline from personal experience: In 2006, she helped create the first self-driving car to travel 138 miles in the desert, predicting autonomous vehicles in 20 years. It wasn't until 2025 that Waymo began operating on city streets at scale.

"Seeing the North Star doesn't mean the journey will be short."

Casado added a more business-savvy observation in the a16z conversation: In the autonomous driving sector alone, the industry invested about $100 billion over 20 years to get where it is today. "Our original roadmap was to solve the world navigation problem first, but it turned out to be extremely difficult."

Li even shared a personal experience in the a16z interview to strengthen the point: About five years ago, she lost stereoscopic vision for several months due to a corneal injury. "Even though I knew very well how big my car was, roughly knew the size of my neighbor's parked car, and I had driven this road many years, I could not judge the distance between my car and the parked car very well. I could only drive at ten miles per hour to avoid scratching other cars."

A lifelong researcher of visual intelligence used her own firsthand struggle after losing depth perception to answer the question "why 3D is irreplaceable."

The Double-Edged Sword of Technology and the Measure of Civilization

Between technological optimism and doomsday rhetoric, Li chose a more restrained and actionable stance. She clearly expressed concern about polarized discourse at the Cisco AI Summit:

"The discussion online often tends to be black and white: either full-blown technological utopianism, ignoring that technology is a double-edged sword; or doomsday talk, as if human survival is at risk at any moment. For a technology so profound for human civilization, this way of discussion is irresponsible."

She didn't stop at criticism but offered a quantifiable anchor for value: electricity.

"If we rewind more than a hundred years, imagine how people then defined the success of electricity. I hope the vision then was: schools lit up, homes warm, machines empowered for industrialization, thereby extending human lifespans, allowing more children to be educated."

Then she applied this anchor to AI: "The definition of success should be that civilization becomes more beautiful, and civilization is composed of every individual pursuing happiness, prosperity, and dignity. That is the definition of success for AI and every technology."

At the end of Lenny's Podcast, she brought this concern down to specific people. She said wherever she goes, she is asked the same question: If I am a farmer, nurse, musician, will AI replace me? Her answer: "Ultimately, AI is about people. No technology should strip away human dignity. Human dignity and autonomy should be at the core of the development, deployment, and governance of every technology."

Looking back at the three interviews, a clear thread emerges.

Fei-Fei Li's thinking on spatial intelligence is not a rebellion against the wave of large models, but an extension built upon it. She saw the limits of language models earlier than most – what a lossy information compression format can do is ultimately limited. The problem spatial intelligence aims to solve is: evolving AI from "talking about the world" to "understanding the world," and ultimately to "acting in the world."

The World Labs team has about 30 people and has raised over $1 billion. Marble is the first-generation product, far smaller in scale than top language models. The scarcity of 3D data and the early state of model architectures determine this will not be a path achieved overnight. But Li said another thing in Lenny's Podcast, perhaps the best annotation for this patience:

"Our brains consume only about 20 watts, dimmer than any light bulb in the room, yet can do so much. The more I work in AI, the more I respect humans."

540 million years of evolution gave carbon-based life this 20-watt spatial intelligence. AI's evolution is being compressed to a few years.

Li did not give a timeline in the three interviews. She just repeatedly returned to that judgment extracted from evolution: perception precedes language, space precedes symbols. What is happening in Silicon Valley labs, Stanford labs, and World Labs offices is not a technological iteration, but an accelerated replay of evolution. (This article was first published on Titanium Media APP, author | Silicon Valley Tech News, editor | Zhao Hongyu)

Appendix: The text transcripts of the above three interviews are archived at 【ima Knowledge Base】 Fei-Fei Li Interviews https://ima.qq.com/wiki/?shareId=3f1d4b4c0d6cb2aeca250e2c5d068390e2d45895816ad607309820e25cb2e9c5

Preguntas relacionadas

QAccording to the article, what is the fundamental limitation of current large language models that Li Fei-Fei emphasizes?

AThey are built on a 'lossy' information compression format (language) that inherently fails to capture the full richness of the physical, 3D world. Language is a very recent evolutionary development and a poor representation of spatial understanding, which is foundational to intelligence.

QWhat is the core capability of World Labs' Marble model, and how does it fundamentally differ from video generation models like Sora?

AMarble takes text, images, video, or simple 3D inputs and generates a fully navigable, interactive, and persistent 3D world with geometric structure. It creates a true 3D environment, not just a 'video-like' sequence of pixels that looks 3D, as Sora does. Marble aims to create and reason about the real world behind the 'shadows' (2D projections).

QWhat major challenge in robotics learning does Li Fei-Fei highlight, contrasting it with the success of language models?

ARobotics faces a 'fundamental mismatch' between its training objective (actions in the 3D world) and its available data. Unlike language models where training data (words) perfectly matches the output (text), robotics lacks sufficient 'action' data from the real 3D world to effectively train models to perform physical actions.

QBeyond technological applications, what is the 'civilizational yardstick' or definition of success that Li Fei-Fei proposes for AI technology?

AShe defines success by the broader impact on civilization: AI should make civilization better, where civilization is composed of individuals pursuing happiness, prosperity, and dignity. The ultimate goal is that any technology should not deprive humans of their dignity, and human dignity and autonomy should be central to AI development, deployment, and governance.

QWhat personal experience did Li Fei-Fei share to illustrate the irreplaceable importance of 3D spatial perception?

AShe shared that about five years ago, she temporarily lost her stereoscopic vision (3D depth perception) due to a corneal injury. Even with her full knowledge of her car's size and the familiar road, she could not accurately judge distances and had to drive very slowly (around 10 mph) to avoid hitting parked cars, demonstrating the critical role of innate 3D spatial understanding for basic tasks.

Lecturas Relacionadas

A Trillion-Dollar Frenzy for Memory Sellers, Halved Profits for Memory Buyers

Summary: A stark divide has emerged in the tech industry. While memory chipmaker Micron's stock soared 19% in a single day, pushing its market cap over $1 trillion, smartphone manufacturer Xiaomi reported a 43% plunge in adjusted net profit. The core driver is a severe supply crunch in memory chips, particularly for AI applications. Wall Street analysts, led by UBS and its unprecedented 204% target price hike for Micron, argue that long-term agreements (LTAs) from AI cloud giants are fundamentally ending the sector's notorious boom-and-bust cycles, justifying a re-rating from cyclical to infrastructure-like valuations. However, the "storage" market is now fragmented into three tiers. The first, AI-grade memory like HBM and server DDR5, faces extreme shortages and soaring prices driven by massive cloud capex. The second, mobile memory for smartphones, is also seeing sharp price hikes as manufacturers like Xiaomi are forced to pay more for remaining capacity, severely squeezing their margins. The third, PC retail channels, shows price declines due to existing inventory. The article questions the sustainability of the "supercycle" narrative. It highlights that Micron's revenue surge is driven almost entirely by price increases, not shipment volumes, making it vulnerable to a potential demand slowdown. While LTAs may dampen volatility, history suggests they are often tested during downturns. The current peak earnings, used to justify high valuations, represent a classic cyclical top. The piece concludes with a note of caution: when the entire Street chants "this time is different," it's wise to remember past bubbles, even as it acknowledges AI demand may indeed be structural.

marsbitHace 46 min(s)

A Trillion-Dollar Frenzy for Memory Sellers, Halved Profits for Memory Buyers

marsbitHace 46 min(s)

This New Generation of US Stock Trading Gods No Longer Read Financial Reports

The new generation of "stock gods" in the 2026 US AI bull market are not analyzing traditional financial reports. Instead of focusing on giants like NVIDIA, figures like the 22-year-old Leopold Aschenbrenner (who reportedly turned $200M into $14B) and influencers like Serenity on platforms like Reddit's WallStreetBets, X, and Substack are gaining fame and returns by targeting obscure, low-cap "micro-cap" stocks. Their strategy, dubbed "supply chain sniping," involves identifying critical, often monopolistic, bottlenecks in the AI hardware supply chain—such as specific materials or components essential for giants like Google and NVIDIA—that are missed by mainstream Wall Street analysts. Serenity's call on AXTI, a $700M company supplying indium phosphide substrates crucial for photonics and optical interconnects, saw the stock soar from ~$12 to nearly $150. Similarly, accounts like KawzInvests and PhotonCap focus on thematic, supply-chain-driven research in areas like AI infrastructure, optics, and cloud services for SMEs, bypassing traditional valuation metrics. This shift represents a cultural move away from Warren Buffett-style value investing based on deep financial statement analysis. The new approach thrives on low liquidity, early narratives, and strong community propagation on social media, similar to meme stocks or crypto. However, this "attention economy" strategy carries risks: it depends on sustained information gaps, the underlying companies' ability to deliver fundamental results, and the potential for crowded, volatile exits as narratives shift. The trend also shows crypto traders applying their narrative-sensing skills to US micro-caps, marking a significant evolution in trading culture.

marsbitHace 48 min(s)

This New Generation of US Stock Trading Gods No Longer Read Financial Reports

marsbitHace 48 min(s)

Trillion-Dollar Euphoria for Memory Sellers, Halved Profits for Memory Buyers

Title: The Trillion-Dollar Memory Seller's Carnival vs. The Buyer's Halved Profits On May 26, a stark contrast unfolded. While memory chipmaker Micron's market cap surged past $1 trillion, smartphone maker Xiaomi reported plummeting profits. Xiaomi's Q1 2026 profits fell 43% year-on-year. Executive Lu Weibing cited memory prices quadrupling from last year, adding roughly $210 to a phone's cost. To survive, Xiaomi is cutting entry-level models, sacrificing volume. Micron's stock, however, skyrocketed over 19% in a day, capping an 8x gain in a year. Major banks like UBS and JPMorgan issued bullish reports, raising price targets drastically. Their core thesis: Long-Term Agreements (LTAs) with AI cloud giants (Microsoft, Google, etc.) are eliminating the memory industry's notorious boom-bust cycle. By locking in fixed-price, multi-year contracts for AI-grade memory (HBM, server DDR5), these deals promise stable, utility-like earnings, justifying a higher valuation (20-30x P/E vs. the historical 8-15x). The article reveals a three-tiered memory market in 2026: 1) **AI Storage (HBM/DDR5/Enterprise SSD)**: Extreme shortage, soaring prices, LTAs. This is Micron's story. 2) **Mobile/Embedded Memory**: Also facing sharp price hikes as AI production crowds out capacity, severely pressuring phone makers like Xiaomi. 3) **PC Retail**: Some spot prices are falling due to channel inventory liquidation, creating a divergence from contract markets. The author questions if LTAs truly end the cycle. It hinges on sustained, hyper-growth AI demand. Micron's current profits are at a cycle peak, driven mostly by price hikes, not volume. If AI capital expenditure growth slows, the massive industry capacity expansion (e.g., Micron's $250B+ CapEx plan) could lead to a glut. Historically, using peak-cycle earnings for valuation is a classic trap. While the AI-driven structural shift might be real, the unanimous Wall Street euphoria warrants caution, echoing past bubbles like Cisco's in 2000. The memory seller's trillion-dollar狂欢 (carnival) continues, but the cycle's shadow remains.

链捕手Hace 55 min(s)

Trillion-Dollar Euphoria for Memory Sellers, Halved Profits for Memory Buyers

链捕手Hace 55 min(s)

Trading

Spot
Futuros

Artículos destacados

Qué es $S$

Entendiendo SPERO: Una Visión General Completa Introducción a SPERO A medida que el panorama de la innovación continúa evolucionando, la aparición de tecnologías web3 y proyectos de criptomonedas juega un papel fundamental en la configuración del futuro digital. Un proyecto que ha atraído la atención en este campo dinámico es SPERO, denotado como SPERO,$$s$. Este artículo tiene como objetivo reunir y presentar información detallada sobre SPERO, para ayudar a entusiastas e inversores a comprender sus fundamentos, objetivos e innovaciones dentro de los dominios web3 y cripto. ¿Qué es SPERO,$$s$? SPERO,$$s$ es un proyecto único dentro del espacio cripto que busca aprovechar los principios de descentralización y tecnología blockchain para crear un ecosistema que promueva la participación, la utilidad y la inclusión financiera. El proyecto está diseñado para facilitar interacciones de igual a igual de nuevas maneras, proporcionando a los usuarios soluciones y servicios financieros innovadores. En su esencia, SPERO,$$s$ tiene como objetivo empoderar a los individuos al proporcionar herramientas y plataformas que mejoren la experiencia del usuario en el espacio de las criptomonedas. Esto incluye habilitar métodos de transacción más flexibles, fomentar iniciativas impulsadas por la comunidad y crear caminos para oportunidades financieras a través de aplicaciones descentralizadas (dApps). La visión subyacente de SPERO,$$s$ gira en torno a la inclusividad, buscando cerrar brechas dentro de las finanzas tradicionales mientras aprovecha los beneficios de la tecnología blockchain. ¿Quién es el Creador de SPERO,$$s$? La identidad del creador de SPERO,$$s$ sigue siendo algo oscura, ya que hay recursos públicos limitados que proporcionan información de fondo detallada sobre su(s) fundador(es). Esta falta de transparencia puede derivarse del compromiso del proyecto con la descentralización, una ética que muchos proyectos web3 comparten, priorizando las contribuciones colectivas sobre el reconocimiento individual. Al centrar las discusiones en torno a la comunidad y sus objetivos colectivos, SPERO,$$s$ encarna la esencia del empoderamiento sin señalar a individuos específicos. Como tal, comprender la ética y la misión de SPERO sigue siendo más importante que identificar a un creador singular. ¿Quiénes son los Inversores de SPERO,$$s$? SPERO,$$s$ cuenta con el apoyo de una diversa gama de inversores que van desde capitalistas de riesgo hasta inversores ángeles dedicados a fomentar la innovación en el sector cripto. El enfoque de estos inversores generalmente se alinea con la misión de SPERO, priorizando proyectos que prometen avances tecnológicos sociales, inclusión financiera y gobernanza descentralizada. Estas fundaciones de inversores suelen estar interesadas en proyectos que no solo ofrecen productos innovadores, sino que también contribuyen positivamente a la comunidad blockchain y sus ecosistemas. El respaldo de estos inversores refuerza a SPERO,$$s$ como un contendiente notable en el dominio de proyectos cripto que evoluciona rápidamente. ¿Cómo Funciona SPERO,$$s$? SPERO,$$s$ emplea un marco multifacético que lo distingue de los proyectos de criptomonedas convencionales. Aquí hay algunas de las características clave que subrayan su singularidad e innovación: Gobernanza Descentralizada: SPERO,$$s$ integra modelos de gobernanza descentralizada, empoderando a los usuarios para participar activamente en los procesos de toma de decisiones sobre el futuro del proyecto. Este enfoque fomenta un sentido de propiedad y responsabilidad entre los miembros de la comunidad. Utilidad del Token: SPERO,$$s$ utiliza su propio token de criptomoneda, diseñado para servir diversas funciones dentro del ecosistema. Estos tokens permiten transacciones, recompensas y la facilitación de servicios ofrecidos en la plataforma, mejorando la participación y la utilidad general. Arquitectura en Capas: La arquitectura técnica de SPERO,$$s$ apoya la modularidad y escalabilidad, permitiendo la integración fluida de características y aplicaciones adicionales a medida que el proyecto evoluciona. Esta adaptabilidad es fundamental para mantener la relevancia en el cambiante paisaje cripto. Participación de la Comunidad: El proyecto enfatiza iniciativas impulsadas por la comunidad, empleando mecanismos que incentivan la colaboración y la retroalimentación. Al nutrir una comunidad sólida, SPERO,$$s$ puede abordar mejor las necesidades de los usuarios y adaptarse a las tendencias del mercado. Enfoque en la Inclusión: Al ofrecer tarifas de transacción bajas e interfaces amigables para el usuario, SPERO,$$s$ busca atraer a una base de usuarios diversa, incluyendo a individuos que anteriormente pueden no haber participado en el espacio cripto. Este compromiso con la inclusión se alinea con su misión general de empoderamiento a través de la accesibilidad. Cronología de SPERO,$$s$ Entender la historia de un proyecto proporciona información crucial sobre su trayectoria de desarrollo y hitos. A continuación se presenta una cronología sugerida que mapea eventos significativos en la evolución de SPERO,$$s$: Fase de Conceptualización e Ideación: Las ideas iniciales que forman la base de SPERO,$$s$ fueron concebidas, alineándose estrechamente con los principios de descentralización y enfoque comunitario dentro de la industria blockchain. Lanzamiento del Whitepaper del Proyecto: Tras la fase conceptual, se lanzó un whitepaper completo que detalla la visión, los objetivos y la infraestructura tecnológica de SPERO,$$s$ para generar interés y retroalimentación de la comunidad. Construcción de Comunidad y Primeras Interacciones: Se realizaron esfuerzos de divulgación activa para construir una comunidad de primeros adoptantes y posibles inversores, facilitando discusiones en torno a los objetivos del proyecto y obteniendo apoyo. Evento de Generación de Tokens: SPERO,$$s$ llevó a cabo un evento de generación de tokens (TGE) para distribuir sus tokens nativos a los primeros seguidores y establecer liquidez inicial dentro del ecosistema. Lanzamiento de la dApp Inicial: La primera aplicación descentralizada (dApp) asociada con SPERO,$$s$ se puso en marcha, permitiendo a los usuarios interactuar con las funcionalidades centrales de la plataforma. Desarrollo Continuo y Alianzas: Actualizaciones y mejoras continuas a las ofertas del proyecto, incluyendo alianzas estratégicas con otros actores en el espacio blockchain, han moldeado a SPERO,$$s$ en un jugador competitivo y en evolución en el mercado cripto. Conclusión SPERO,$$s$ se erige como un testimonio del potencial de web3 y las criptomonedas para revolucionar los sistemas financieros y empoderar a los individuos. Con un compromiso con la gobernanza descentralizada, la participación comunitaria y funcionalidades diseñadas de manera innovadora, allana el camino hacia un paisaje financiero más inclusivo. Como con cualquier inversión en el espacio cripto que evoluciona rápidamente, se anima a los posibles inversores y usuarios a investigar a fondo y participar de manera reflexiva con los desarrollos en curso dentro de SPERO,$$s$. El proyecto muestra el espíritu innovador de la industria cripto, invitando a una mayor exploración de sus innumerables posibilidades. Mientras el viaje de SPERO,$$s$ aún se desarrolla, sus principios fundamentales pueden, de hecho, influir en el futuro de cómo interactuamos con la tecnología, las finanzas y entre nosotros en ecosistemas digitales interconectados.

72 Vistas totalesPublicado en 2024.12.17Actualizado en 2024.12.17

Qué es $S$

Qué es AGENT S

Agent S: El Futuro de la Interacción Autónoma en Web3 Introducción En el paisaje en constante evolución de Web3 y las criptomonedas, las innovaciones están redefiniendo constantemente cómo los individuos interactúan con las plataformas digitales. Uno de estos proyectos pioneros, Agent S, promete revolucionar la interacción humano-computadora a través de su marco agente abierto. Al allanar el camino para interacciones autónomas, Agent S busca simplificar tareas complejas, ofreciendo aplicaciones transformadoras en inteligencia artificial (IA). Esta exploración detallada profundizará en las complejidades del proyecto, sus características únicas y las implicaciones para el dominio de las criptomonedas. ¿Qué es Agent S? Agent S se presenta como un marco agente abierto innovador, diseñado específicamente para abordar tres desafíos fundamentales en la automatización de tareas informáticas: Adquisición de Conocimiento Específico del Dominio: El marco aprende inteligentemente de diversas fuentes de conocimiento externas y experiencias internas. Este enfoque dual le permite construir un rico repositorio de conocimiento específico del dominio, mejorando su rendimiento en la ejecución de tareas. Planificación a Largo Plazo de Tareas: Agent S emplea planificación jerárquica aumentada por la experiencia, un enfoque estratégico que facilita la descomposición y ejecución eficiente de tareas complejas. Esta característica mejora significativamente su capacidad para gestionar múltiples subtareas de manera eficiente y efectiva. Manejo de Interfaces Dinámicas y No Uniformes: El proyecto introduce la Interfaz Agente-Computadora (ACI), una solución innovadora que mejora la interacción entre agentes y usuarios. Utilizando Modelos de Lenguaje Multimodal de Gran Escala (MLLMs), Agent S puede navegar y manipular diversas interfaces gráficas de usuario sin problemas. A través de estas características pioneras, Agent S proporciona un marco robusto que aborda las complejidades involucradas en la automatización de la interacción humana con las máquinas, preparando el terreno para una multitud de aplicaciones en IA y más allá. ¿Quién es el Creador de Agent S? Si bien el concepto de Agent S es fundamentalmente innovador, la información específica sobre su creador sigue siendo elusiva. El creador es actualmente desconocido, lo que resalta ya sea la etapa incipiente del proyecto o la elección estratégica de mantener a los miembros fundadores en el anonimato. Independientemente de la anonimidad, el enfoque sigue siendo en las capacidades y el potencial del marco. ¿Quiénes son los Inversores de Agent S? Dado que Agent S es relativamente nuevo en el ecosistema criptográfico, la información detallada sobre sus inversores y patrocinadores financieros no está documentada explícitamente. La falta de información disponible públicamente sobre las bases de inversión u organizaciones que apoyan el proyecto plantea preguntas sobre su estructura de financiamiento y hoja de ruta de desarrollo. Comprender el respaldo es crucial para evaluar la sostenibilidad del proyecto y su posible impacto en el mercado. ¿Cómo Funciona Agent S? En el núcleo de Agent S se encuentra una tecnología de vanguardia que le permite funcionar de manera efectiva en diversos entornos. Su modelo operativo se basa en varias características clave: Interacción Humano-Computadora Similar a la Humana: El marco ofrece planificación avanzada de IA, esforzándose por hacer que las interacciones con las computadoras sean más intuitivas. Al imitar el comportamiento humano en la ejecución de tareas, promete elevar las experiencias de los usuarios. Memoria Narrativa: Empleada para aprovechar experiencias de alto nivel, Agent S utiliza memoria narrativa para hacer un seguimiento de las historias de tareas, mejorando así sus procesos de toma de decisiones. Memoria Episódica: Esta característica proporciona a los usuarios una guía paso a paso, permitiendo que el marco ofrezca apoyo contextual a medida que se desarrollan las tareas. Soporte para OpenACI: Con la capacidad de ejecutarse localmente, Agent S permite a los usuarios mantener el control sobre sus interacciones y flujos de trabajo, alineándose con la ética descentralizada de Web3. Fácil Integración con APIs Externas: Su versatilidad y compatibilidad con varias plataformas de IA aseguran que Agent S pueda encajar sin problemas en ecosistemas tecnológicos existentes, convirtiéndolo en una opción atractiva para desarrolladores y organizaciones. Estas funcionalidades contribuyen colectivamente a la posición única de Agent S dentro del espacio cripto, ya que automatiza tareas complejas y de múltiples pasos con una intervención humana mínima. A medida que el proyecto evoluciona, sus posibles aplicaciones en Web3 podrían redefinir cómo se desarrollan las interacciones digitales. Cronología de Agent S El desarrollo y los hitos de Agent S pueden encapsularse en una cronología que resalta sus eventos significativos: 27 de septiembre de 2024: El concepto de Agent S fue lanzado en un documento de investigación integral titulado “Un Marco Agente Abierto que Usa Computadoras Como un Humano”, mostrando las bases del proyecto. 10 de octubre de 2024: El documento de investigación fue puesto a disposición del público en arXiv, ofreciendo una exploración profunda del marco y su evaluación de rendimiento basada en el benchmark OSWorld. 12 de octubre de 2024: Se lanzó una presentación en video, proporcionando una visión visual de las capacidades y características de Agent S, involucrando aún más a posibles usuarios e inversores. Estos marcadores en la cronología no solo ilustran el progreso de Agent S, sino que también indican su compromiso con la transparencia y la participación comunitaria. Puntos Clave Sobre Agent S A medida que el marco Agent S continúa evolucionando, varios atributos clave destacan, subrayando su naturaleza innovadora y potencial: Marco Innovador: Diseñado para proporcionar un uso intuitivo de las computadoras similar a la interacción humana, Agent S aporta un enfoque novedoso a la automatización de tareas. Interacción Autónoma: La capacidad de interactuar de manera autónoma con las computadoras a través de GUI significa un salto hacia soluciones informáticas más inteligentes y eficientes. Automatización de Tareas Complejas: Con su metodología robusta, puede automatizar tareas complejas y de múltiples pasos, haciendo que los procesos sean más rápidos y menos propensos a errores. Mejora Continua: Los mecanismos de aprendizaje permiten a Agent S mejorar a partir de experiencias pasadas, mejorando continuamente su rendimiento y eficacia. Versatilidad: Su adaptabilidad en diferentes entornos operativos como OSWorld y WindowsAgentArena asegura que pueda servir a una amplia gama de aplicaciones. A medida que Agent S se posiciona en el paisaje de Web3 y criptomonedas, su potencial para mejorar las capacidades de interacción y automatizar procesos significa un avance significativo en las tecnologías de IA. A través de su marco innovador, Agent S ejemplifica el futuro de las interacciones digitales, prometiendo una experiencia más fluida y eficiente para los usuarios en diversas industrias. Conclusión Agent S representa un audaz avance en la unión de la IA y Web3, con la capacidad de redefinir cómo interactuamos con la tecnología. Aunque aún se encuentra en sus primeras etapas, las posibilidades para su aplicación son vastas y atractivas. A través de su marco integral que aborda desafíos críticos, Agent S busca llevar las interacciones autónomas al primer plano de la experiencia digital. A medida que nos adentramos más en los reinos de las criptomonedas y la descentralización, proyectos como Agent S sin duda desempeñarán un papel crucial en la configuración del futuro de la tecnología y la colaboración humano-computadora.

457 Vistas totalesPublicado en 2025.01.14Actualizado en 2025.01.14

Qué es AGENT S

Cómo comprar S

¡Bienvenido a HTX.com! Hemos hecho que comprar Sonic (S) sea simple y conveniente. Sigue nuestra guía paso a paso para iniciar tu viaje de criptos.Paso 1: crea tu cuenta HTXUtiliza tu correo electrónico o número de teléfono para registrarte y obtener una cuenta gratuita en HTX. Experimenta un proceso de registro sin complicaciones y desbloquea todas las funciones.Obtener mi cuentaPaso 2: ve a Comprar cripto y elige tu método de pagoTarjeta de crédito/débito: usa tu Visa o Mastercard para comprar Sonic (S) al instante.Saldo: utiliza fondos del saldo de tu cuenta HTX para tradear sin problemas.Terceros: hemos agregado métodos de pago populares como Google Pay y Apple Pay para mejorar la comodidad.P2P: tradear directamente con otros usuarios en HTX.Over-the-Counter (OTC): ofrecemos servicios personalizados y tipos de cambio competitivos para los traders.Paso 3: guarda tu Sonic (S)Después de comprar tu Sonic (S), guárdalo en tu cuenta HTX. Alternativamente, puedes enviarlo a otro lugar mediante transferencia blockchain o utilizarlo para tradear otras criptomonedas.Paso 4: tradear Sonic (S)Tradear fácilmente con Sonic (S) en HTX's mercado spot. Simplemente accede a tu cuenta, selecciona tu par de trading, ejecuta tus trades y monitorea en tiempo real. Ofrecemos una experiencia fácil de usar tanto para principiantes como para traders experimentados.

872 Vistas totalesPublicado en 2025.01.15Actualizado en 2025.03.21

Cómo comprar S

Discusiones

Bienvenido a la comunidad de HTX. Aquí puedes mantenerte informado sobre los últimos desarrollos de la plataforma y acceder a análisis profesionales del mercado. A continuación se presentan las opiniones de los usuarios sobre el precio de S (S).

活动图片