When AI's Bottleneck Is No Longer the Model: Perseus Yang's Open Source Ecosystem Building Practices and Reflections

marsbitPublicado a 2026-04-13Actualizado a 2026-04-13

Resumen

In 2026, the AI industry's primary bottleneck is no longer model capability but rather the encoding of domain knowledge, agent-world interfaces, and toolchain maturity. The open-source community is rapidly bridging this gap, evidenced by projects like OpenClaw and Claude Code experiencing explosive growth in their Skill ecosystems. Perseus Yang, a contributor to over a dozen AI open-source projects, argues that Skill systems are the most underestimated infrastructure of the AI agent era. They enable non-coders to program AI by writing natural language SKILL.md files, transferring power from engineers to all professionals. His project, GTM Engineer Skills, demonstrates this by automating go-to-market workflows, proving Skills can extend far beyond engineering into areas like product strategy and business analysis. He also identifies a critical blind spot: while browser automation thrives, agent operations are nearly absent from mobile apps, the world's dominant computing interface. His project, OpenPocket, is an open-source framework that allows agents to operate Android devices via ADB. It features human-in-the-loop security, agent isolation, and the ability for agents to autonomously create and save new reusable Skills. Yang believes the value of open source lies not in the code itself, but in defining the infrastructure standards during this formative period. His work validates the SKILL.md format as a portable unit for agent capability and pioneers new architectures for...

Author: Liu Jun

In 2026, a consensus is forming in the AI industry: model capability is no longer the bottleneck. The gap lies outside the model—in the encoding of domain knowledge, in the interface between agents and the real world, in the maturity of toolchains. This gap is being filled by the open-source community, and the speed exceeds everyone's expectations. OpenClaw gained 60,000 GitHub stars within 72 hours, surpassing 350,000 three months later. The Claude Code Skill ecosystem grew from 50 to over 334 Skills within half a year. Hermes Agent is even more radical, enabling agents to autonomously build reusable skills. Data from Vela Partners shows that in the past 90 days, the combined categories of personal AI assistants and Agentic Skill plugins added 244,000 new stars. This is a Skill explosion.

Perseus Yang's work sits at the heart of this explosion. With a background in Mathematics and Computer Science from Cornell, a member of the Forbes Business Council, and a THINC Fellowship recipient, he has participated in and maintained over a dozen AI-related open-source projects on GitHub in recent years, covering areas such as agent skill expansion, mobile device-level control, AI engine optimization toolchains, GEO data analysis agents, content automation workflows, and payment protocol infrastructure. His characteristic is possessing both a deep engineering background and strong product intuition. He doesn't just write code; he defines what a tool should look like based on user needs, then builds it end-to-end and drives its adoption.

Here are several core judgments he has formed during this process.

First Judgment: The Skill System is the Most Underestimated Infrastructure in the AI Agent Era

After Anthropic released Agent Skills as an open standard at the end of 2025, OpenAI's Codex CLI also adopted the same SKILL.md format. OpenClaw's ClawHub registry has accumulated over 13,000 community-contributed Skills, and the Claude Code ecosystem is quickly following suit. The significance of Skills goes far beyond "adding plugins to agents." It essentially enables people who don't know how to code to participate in AI programming. An operations personnel can write a SKILL.md in natural language, enabling an agent to learn a new workflow. This is a paradigm shift: the true power of AI depends not on the model's parameter count, but on what domain knowledge is injected into the model, and Skills extend the power to inject knowledge from engineers to everyone.

But Perseus observed a problem. The vast majority of Skills are concentrated in the engineering field—code review, front-end design, DevOps, testing. Expertise in non-engineering fields has hardly been systematically encoded into Skills. This means the coverage of the Skill ecosystem is far from reaching its potential boundary.

This observation drove a series of his open-source work in the GTM (Go-To-Market) toolchain direction. The most representative is GTM Engineer Skills, a set of Claude Code and Codex skill sets covering the complete workflow of AI engine discoverability, which has accumulated over 600 stars on GitHub. It encodes work that traditionally requires collaboration between SEO experts, content strategists, and front-end developers into an automated process executable by a single person: website AI discoverability audit, content structure optimization, keyword research, a machine-parsable layer for data visualization. The auditor doesn't output suggestions; instead, it automatically detects the front-end framework and generates code fixes that can be directly submitted as a Pull Request. Around the same direction, he also built a supporting GEO analysis tool that can simultaneously send queries to ChatGPT, Claude, Gemini, and Perplexity to analyze brand mention rates, sentiment, market share, and competitive positioning, outputting interactive HTML reports and structured data.

The actual results demonstrate the product value of this toolset. Companies like Articuler AI and Axis Robotics used GTM Engineer Skills to complete the full process from research to Resource Center setup in a few hours, whereas such work traditionally requires dozens of hours of cross-team collaboration. This efficiency gap is not achieved by model capability, but by Perseus's deep understanding and productized breakdown of the GTM workflow: he broke down a vague "improve AI discoverability" requirement into standardized stages executable step-by-step by an agent, each with clear inputs, outputs, and quality checks. This toolchain is currently adopted by over a dozen startups and several Fortune 500 companies. The open-source tool is the entry point, the commercial product is the scaled extension, and both share the same technical core.

The project itself is valuable, but Perseus believes the proposition is more important: the capability boundary of the Skill system extends far beyond the engineering field. Product strategy, go-to-market, business analysis—any expertise that can be structurally described can be encoded into agent capabilities.

Second Judgment: AI Agent's Operational Boundary Should Not Stop at Browsers and APIs

The agent discussion in 2026 is dominated by browser agents and API integrations. LangGraph, CrewAI, and Google ADK constitute a thriving multi-agent orchestration ecosystem. But Perseus noticed a structural blind spot: most global digital activity happens in native mobile apps—social, payment, gaming, communication—and these apps lack public APIs and browser equivalents. Existing frameworks cannot operate WeChat, Douyin, WhatsApp, or Alipay. Mobile is the world's dominant computing interface, but the infrastructure for native mobile agents is almost zero.

Perseus's thinking is: Why is everyone teaching AI to operate browsers, but no one is seriously teaching it to operate phones? The prosperity of browser agents is largely because the web is naturally automation-friendly, with DOM, APIs, and mature toolchains like Playwright. But the phone is a completely different world. Native apps are black boxes, without structured interface descriptions; operations can only be performed by simulating human touches and swipes. The difficulty of this problem lies not in getting the LLM to understand whether a button should be pressed, but in building the entire execution layer infrastructure from scratch: device connection management, screen state parsing, device mutex between multiple agents, security boundaries for sensitive operations.

This judgment drove the birth of OpenPocket. It is an open-source framework that uses ADB to allow LLM-driven agents to autonomously operate Android devices, currently with about a dozen contributors and over 500 commits. What users are really doing with it speaks volumes: automatically managing social media accounts, replying to messages in IMs for you, handling payments and bills on the phone, even automatically playing mobile games. A typical scenario is: the user tells the agent in natural language "Open Slack every morning at 8 am to check in," and the agent will persistently run this task in an isolated session, turning a previously manual, repetitive daily operation into background automation.

Perseus made several key product and architectural choices in this project. First, agents can automatically create new Skills during runtime. When encountering an unfamiliar operation flow, it can save the learned steps as a reusable SKILL.md for direct调用 next time. This means the agent is not a tool with fixed capabilities, but a system that grows stronger with use. Second, all sensitive operations must be approved by a human, rather than letting the agent judge what is safe. In his view, the most dangerous thing about autonomous agents is not that they do the wrong thing, but that they do the wrong thing "confidently" while thinking they are right. Third, each agent is completely isolated, bound to an independent device, configuration, and session state, allowing multiple agents to run simultaneously without interfering with each other. If only TypeScript engineers can extend the agent's capabilities, this ecosystem will never grow large, so OpenPocket, like Claude Code, uses SKILL.md as the standard format for capability extension.

The entire system supports 29+ LLM configurations. Agent phones are completely isolated from users' personal phones, and all data remains local. In 2026, with OWASP listing "Tool Misuse" among the Top 10 Risks for Agentic AI and the high-risk obligations of the EU AI Act about to take effect, this local-first, human-in-the-loop design is not conservative but a prerequisite for agents entering real-world scenarios.

Third Judgment: The Value of Open Source Lies Not in the Code Itself, But in the Definition of Standards at the Infrastructure Layer

Perseus's understanding of open source is not "putting code on GitHub." He repeatedly mentions a viewpoint: The open-source AI ecosystem in 2026 is in a window where standards have not yet solidified. The architectural patterns and interface specifications adopted by the community now will become the industry's default infrastructure in the coming years. In this window, defining a niche is more important than optimizing an existing solution.

Specifically, his Skill project pushed forward something technically meaningful: proving that the SKILL.md format is not just a container for engineering tools, but a sufficiently general standard for encoding domain knowledge. When the same SKILL.md can be loaded and executed by Claude Code, OpenAI Codex CLI, and OpenClaw, it de facto becomes the "portable capability unit" of the AI agent ecosystem. Perseus stuffed the complete workflow of go-to-market—a non-engineering field—into this format and successfully ran end-to-end automation from audit to code fix. This is a significant validation of the generality of the entire Skill standard.

His mobile agent project addresses an architectural gap at the agent execution layer. Existing agent frameworks rely on structured interfaces at the tool-calling level, either APIs or DOM. OpenPocket must operate in an environment without any structured interface, relying purely on screen pixel parsing and touch event injection. This forced the project to redesign the agent's perception-decision-execution loop from the ground up, including real-time parsing of device state, device mutex protocols for multiple agents, and automatic recovery mechanisms after operation failures. These are not simple adaptations of existing agent frameworks, but an architectural solution independently evolved for the problem of "autonomous operation in API-less environments."

The engineering design of the two projects is worth mentioning separately. OpenPocket adopts a three-layer separated architecture of Manager, Gateway, and Agent Runtime, where each layer can be iterated independently, and community contributors only need to focus on the layer they are familiar with. Each Skill within GTM Engineer Skills follows a staged pipeline design internally, where the output of the previous stage is the input of the next, with mandatory quality check gates in between. The workflow can be interrupted and resumed at any stage, and errors can be pinpointed to a specific stage. The purpose of these architectural choices is the same: to make the open-source project trustworthy for real users in production environments.

From a product perspective, these two projects also share a commonality: Perseus always places "who will use it" and "how to extend it" at the forefront of architectural decisions. The target users of GTM Engineer Skills are not engineers but growth teams, so each Skill has clear input-output contracts and built-in quality checks, allowing non-technical users to understand what the agent is doing. OpenPocket's SKILL.md extension mechanism, natural language scheduled tasks, and multi-channel access (Telegram, Discord, WhatsApp, CLI) are all designed to lower the barrier to entry for non-engineering users. In his view, if an open-source infrastructure project can only be used by engineers, its ceiling is the size of the engineering community. The truly leveraged design is to enable the boundary of agent capabilities to be expanded collectively by practitioners from all fields.

This pattern runs through his multiple projects. It's not about doing application-layer development on existing frameworks, but identifying missing components in the infrastructure layer of the agent ecosystem and then building them.

The Bigger Picture

The open-source AI ecosystem in 2026 is experiencing a moment similar to the early cloud-native ecosystem of the 2010s: standards and tools at the infrastructure layer are being defined, and these definitions will constrain the entire industry's development path for years to come. In this window, every Skill format adopted by the community, every agent architectural pattern validated, every ecosystem gap filled, is participating in shaping the next interface layer of AI.

What Perseus Yang is doing is simple: using engineering capability and product thinking to explore the paradigm at the technological frontier of the AI era. Models will continue to become more powerful, but who defines how agents should interact with the real world, who decides in what form domain knowledge should be encoded and distributed—the answers to these questions will not grow out of models. They can only be figured out bit by bit by people who build things.

Preguntas relacionadas

QAccording to the article, what is the current bottleneck in the AI industry as of 2026?

AThe bottleneck is no longer the model capabilities themselves, but rather the gap in encoding domain knowledge, creating interfaces for agents and the real world, and the maturity of toolchains.

QWhat is the significance of the SKILL.md format, as discussed in the article?

AThe SKILL.md format is an open standard that allows non-coders to participate in AI programming. It enables anyone to define a new workflow for an AI agent using natural language, making it a portable unit of capability that can be executed across different AI platforms like Claude Code and OpenAI Codex CLI.

QWhat problem did Perseus Yang identify with the current landscape of AI agents and mobile applications?

AHe identified a structural blind spot: while most digital activity happens within native mobile apps (like WeChat, TikTok, WhatsApp, Alipay), these apps lack public APIs and are not accessible to browser-based agents. This creates a significant gap, as there is almost no infrastructure for native mobile AI agents.

QWhat are the key architectural and safety features of the OpenPocket project?

AKey features include: agents that can autonomously create new Skills from learned operations; a requirement for human approval on sensitive operations; complete isolation of each agent with its own device and session state; and a design that keeps all operations local to the device for security and privacy.

QHow does Perseus Yang view the role of open source in the current AI ecosystem?

AHe believes the value of open source lies not just in sharing code, but in defining the architectural patterns and interface standards that will become the default infrastructure for the entire industry. He focuses on identifying and building missing components at the infrastructure layer to shape how agents interact with the real world.

Lecturas Relacionadas

Crypto’s CLARITY Act Sits At 50-50 For 2026 Passage, Galaxy Says

Galaxy Digital estimates a 50-50 chance that the CLARITY Act, a key U.S. crypto market structure bill, will be passed into law in 2026. The bill, which passed the House with bipartisan support, is now in the Senate Banking Committee. A planned markup was delayed until May, and Galaxy warns that if it slips past mid-May, the odds of passage drop sharply. Key unresolved issues include stablecoin rewards language, protections for software developers, and SEC commissioner vacancies. The report emphasizes that the bill is strong on policy but faces a tight legislative calendar and significant procedural steps. Failure to pass it by May could delay crypto legislation indefinitely, especially with midterm elections potentially shifting priorities.

bitcoinistHace 21 min(s)

Crypto’s CLARITY Act Sits At 50-50 For 2026 Passage, Galaxy Says

bitcoinistHace 21 min(s)

Bitcoin Fees Crash To Lowest Level In A Decade, But What Does This Mean For Price?

Bitcoin transaction fees have plummeted to their lowest levels in over a decade, with the average fee dropping to just $0.22, a figure not seen since 2011. This sharp decline is attributed to reduced network participation and ongoing bear market conditions, leading to decreased liquidity. However, historically, such periods of low fees and diminished activity have often preceded the start of a new bull run. With liquidity at low levels, even a modest influx of new capital could significantly push the price upward due to minimal resistance. While a price recovery may take time, there are early signs of improving market sentiment, as indicated by a shift from Extreme Fear to Fear on the Fear & Greed Index.

bitcoinistHace 1 hora(s)

Bitcoin Fees Crash To Lowest Level In A Decade, But What Does This Mean For Price?

bitcoinistHace 1 hora(s)

IOG Unveils Cardano 2030 Scaling Plan: 27 Million Monthly Transactions With Leios

Input Output Global (IOG) has unveiled a scaling plan aimed at increasing Cardano’s transaction capacity from 800,000 to 27 million monthly transactions by 2030. A key part of this roadmap is the Leios upgrade, which is currently progressing through Software Readiness Levels 5 to 8 to prepare for mainnet deployment. The plan includes developing a release candidate, conducting rigorous validation through load and adversarial testing, and enabling a hard fork. IOG emphasizes that success is measured by completing preparatory work—such as technical documentation and ecosystem coordination—rather than by the timing of the mainnet launch. The upgrade is expected to support network growth, higher total value locked (TVL), and improved sustainability. ADA was trading at $0.25 at the time of writing.

bitcoinistHace 2 hora(s)

IOG Unveils Cardano 2030 Scaling Plan: 27 Million Monthly Transactions With Leios

bitcoinistHace 2 hora(s)

Sentora and Firelight Collaborate to Bring Native DeFi Coverage

Sentora, an institutional DeFi intelligence platform, and Firelight Protocol have partnered to integrate native coverage for Sentora’s public and private vaults. This collaboration introduces a capital-backed protection layer designed to facilitate institutional participation by offering defense against risks like smart contract exploits, oracle failures, and bad debt. The integration addresses a key structural gap in DeFi, where security concerns have limited institutional adoption. Firelight, built on Flare Network, uses FXRP—a 1:1 representation of XRP—as primary collateral, enabling XRP to function as a yield-bearing asset while diversifying the reserve base. The partnership aims to standardize embedded protection within DeFi capital deployment, combining Sentora’s risk models with Firelight’s automated claims and underwriting systems. Flare, a strategic investor in Sentora, supports the alignment of infrastructure and risk layers. Together, the collaboration seeks to enhance trust and accelerate broader institutional adoption of onchain finance.

TheNewsCryptoHace 2 hora(s)

Sentora and Firelight Collaborate to Bring Native DeFi Coverage

TheNewsCryptoHace 2 hora(s)

a16z: Scaling AI Without Cryptographic Verification Is a Dangerous Liability

A16z argues that scaling AI without cryptographic verification is a dangerous liability. As AI agents rapidly evolve from tools into autonomous economic participants, they currently lack standardized, portable identities, verifiable permissions, and programmable payment methods. This creates systemic risks in an economy where non-human entities already vastly outnumber human users in sectors like finance. Blockchain infrastructure offers a solution by providing a neutral coordination layer. It enables verifiable, on-chain credentials for agent identity (a "Know Your Agent" standard), ensures transparent governance to prevent centralized control of AI systems, and facilitates native payments through stablecoins and emerging markets for AI-to-AI commerce. Without cryptographic guarantees—such as auditable transaction records, constrained agent behavior, and proof of origin—scaling AI agents accumulates unmanaged risk. Trust deficit, not intelligence, becomes the bottleneck. The authors conclude that cryptographic verification is essential to maintain user control, ensure accountability, and safely delegate economic activity to autonomous systems.

marsbitHace 2 hora(s)

a16z: Scaling AI Without Cryptographic Verification Is a Dangerous Liability

marsbitHace 2 hora(s)

Trading

Spot

Futuros

Artículos destacados

Qué es $S$

Entendiendo SPERO: Una Visión General Completa Introducción a SPERO A medida que el panorama de la innovación continúa evolucionando, la aparición de tecnologías web3 y proyectos de criptomonedas juega un papel fundamental en la configuración del futuro digital. Un proyecto que ha atraído la atención en este campo dinámico es SPERO, denotado como SPERO,$$s$. Este artículo tiene como objetivo reunir y presentar información detallada sobre SPERO, para ayudar a entusiastas e inversores a comprender sus fundamentos, objetivos e innovaciones dentro de los dominios web3 y cripto. ¿Qué es SPERO,$$s$? SPERO,$$s$ es un proyecto único dentro del espacio cripto que busca aprovechar los principios de descentralización y tecnología blockchain para crear un ecosistema que promueva la participación, la utilidad y la inclusión financiera. El proyecto está diseñado para facilitar interacciones de igual a igual de nuevas maneras, proporcionando a los usuarios soluciones y servicios financieros innovadores. En su esencia, SPERO,$$s$ tiene como objetivo empoderar a los individuos al proporcionar herramientas y plataformas que mejoren la experiencia del usuario en el espacio de las criptomonedas. Esto incluye habilitar métodos de transacción más flexibles, fomentar iniciativas impulsadas por la comunidad y crear caminos para oportunidades financieras a través de aplicaciones descentralizadas (dApps). La visión subyacente de SPERO,$$s$ gira en torno a la inclusividad, buscando cerrar brechas dentro de las finanzas tradicionales mientras aprovecha los beneficios de la tecnología blockchain. ¿Quién es el Creador de SPERO,$$s$? La identidad del creador de SPERO,$$s$ sigue siendo algo oscura, ya que hay recursos públicos limitados que proporcionan información de fondo detallada sobre su(s) fundador(es). Esta falta de transparencia puede derivarse del compromiso del proyecto con la descentralización, una ética que muchos proyectos web3 comparten, priorizando las contribuciones colectivas sobre el reconocimiento individual. Al centrar las discusiones en torno a la comunidad y sus objetivos colectivos, SPERO,$$s$ encarna la esencia del empoderamiento sin señalar a individuos específicos. Como tal, comprender la ética y la misión de SPERO sigue siendo más importante que identificar a un creador singular. ¿Quiénes son los Inversores de SPERO,$$s$? SPERO,$$s$ cuenta con el apoyo de una diversa gama de inversores que van desde capitalistas de riesgo hasta inversores ángeles dedicados a fomentar la innovación en el sector cripto. El enfoque de estos inversores generalmente se alinea con la misión de SPERO, priorizando proyectos que prometen avances tecnológicos sociales, inclusión financiera y gobernanza descentralizada. Estas fundaciones de inversores suelen estar interesadas en proyectos que no solo ofrecen productos innovadores, sino que también contribuyen positivamente a la comunidad blockchain y sus ecosistemas. El respaldo de estos inversores refuerza a SPERO,$$s$ como un contendiente notable en el dominio de proyectos cripto que evoluciona rápidamente. ¿Cómo Funciona SPERO,$$s$? SPERO,$$s$ emplea un marco multifacético que lo distingue de los proyectos de criptomonedas convencionales. Aquí hay algunas de las características clave que subrayan su singularidad e innovación: Gobernanza Descentralizada: SPERO,$$s$ integra modelos de gobernanza descentralizada, empoderando a los usuarios para participar activamente en los procesos de toma de decisiones sobre el futuro del proyecto. Este enfoque fomenta un sentido de propiedad y responsabilidad entre los miembros de la comunidad. Utilidad del Token: SPERO,$$s$ utiliza su propio token de criptomoneda, diseñado para servir diversas funciones dentro del ecosistema. Estos tokens permiten transacciones, recompensas y la facilitación de servicios ofrecidos en la plataforma, mejorando la participación y la utilidad general. Arquitectura en Capas: La arquitectura técnica de SPERO,$$s$ apoya la modularidad y escalabilidad, permitiendo la integración fluida de características y aplicaciones adicionales a medida que el proyecto evoluciona. Esta adaptabilidad es fundamental para mantener la relevancia en el cambiante paisaje cripto. Participación de la Comunidad: El proyecto enfatiza iniciativas impulsadas por la comunidad, empleando mecanismos que incentivan la colaboración y la retroalimentación. Al nutrir una comunidad sólida, SPERO,$$s$ puede abordar mejor las necesidades de los usuarios y adaptarse a las tendencias del mercado. Enfoque en la Inclusión: Al ofrecer tarifas de transacción bajas e interfaces amigables para el usuario, SPERO,$$s$ busca atraer a una base de usuarios diversa, incluyendo a individuos que anteriormente pueden no haber participado en el espacio cripto. Este compromiso con la inclusión se alinea con su misión general de empoderamiento a través de la accesibilidad. Cronología de SPERO,$$s$ Entender la historia de un proyecto proporciona información crucial sobre su trayectoria de desarrollo y hitos. A continuación se presenta una cronología sugerida que mapea eventos significativos en la evolución de SPERO,$$s$: Fase de Conceptualización e Ideación: Las ideas iniciales que forman la base de SPERO,$$s$ fueron concebidas, alineándose estrechamente con los principios de descentralización y enfoque comunitario dentro de la industria blockchain. Lanzamiento del Whitepaper del Proyecto: Tras la fase conceptual, se lanzó un whitepaper completo que detalla la visión, los objetivos y la infraestructura tecnológica de SPERO,$$s$ para generar interés y retroalimentación de la comunidad. Construcción de Comunidad y Primeras Interacciones: Se realizaron esfuerzos de divulgación activa para construir una comunidad de primeros adoptantes y posibles inversores, facilitando discusiones en torno a los objetivos del proyecto y obteniendo apoyo. Evento de Generación de Tokens: SPERO,$$s$ llevó a cabo un evento de generación de tokens (TGE) para distribuir sus tokens nativos a los primeros seguidores y establecer liquidez inicial dentro del ecosistema. Lanzamiento de la dApp Inicial: La primera aplicación descentralizada (dApp) asociada con SPERO,$$s$ se puso en marcha, permitiendo a los usuarios interactuar con las funcionalidades centrales de la plataforma. Desarrollo Continuo y Alianzas: Actualizaciones y mejoras continuas a las ofertas del proyecto, incluyendo alianzas estratégicas con otros actores en el espacio blockchain, han moldeado a SPERO,$$s$ en un jugador competitivo y en evolución en el mercado cripto. Conclusión SPERO,$$s$ se erige como un testimonio del potencial de web3 y las criptomonedas para revolucionar los sistemas financieros y empoderar a los individuos. Con un compromiso con la gobernanza descentralizada, la participación comunitaria y funcionalidades diseñadas de manera innovadora, allana el camino hacia un paisaje financiero más inclusivo. Como con cualquier inversión en el espacio cripto que evoluciona rápidamente, se anima a los posibles inversores y usuarios a investigar a fondo y participar de manera reflexiva con los desarrollos en curso dentro de SPERO,$$s$. El proyecto muestra el espíritu innovador de la industria cripto, invitando a una mayor exploración de sus innumerables posibilidades. Mientras el viaje de SPERO,$$s$ aún se desarrolla, sus principios fundamentales pueden, de hecho, influir en el futuro de cómo interactuamos con la tecnología, las finanzas y entre nosotros en ecosistemas digitales interconectados.

72 Vistas totalesPublicado en 2024.12.17Actualizado en 2024.12.17

Qué es AGENT S

Agent S: El Futuro de la Interacción Autónoma en Web3 Introducción En el paisaje en constante evolución de Web3 y las criptomonedas, las innovaciones están redefiniendo constantemente cómo los individuos interactúan con las plataformas digitales. Uno de estos proyectos pioneros, Agent S, promete revolucionar la interacción humano-computadora a través de su marco agente abierto. Al allanar el camino para interacciones autónomas, Agent S busca simplificar tareas complejas, ofreciendo aplicaciones transformadoras en inteligencia artificial (IA). Esta exploración detallada profundizará en las complejidades del proyecto, sus características únicas y las implicaciones para el dominio de las criptomonedas. ¿Qué es Agent S? Agent S se presenta como un marco agente abierto innovador, diseñado específicamente para abordar tres desafíos fundamentales en la automatización de tareas informáticas: Adquisición de Conocimiento Específico del Dominio: El marco aprende inteligentemente de diversas fuentes de conocimiento externas y experiencias internas. Este enfoque dual le permite construir un rico repositorio de conocimiento específico del dominio, mejorando su rendimiento en la ejecución de tareas. Planificación a Largo Plazo de Tareas: Agent S emplea planificación jerárquica aumentada por la experiencia, un enfoque estratégico que facilita la descomposición y ejecución eficiente de tareas complejas. Esta característica mejora significativamente su capacidad para gestionar múltiples subtareas de manera eficiente y efectiva. Manejo de Interfaces Dinámicas y No Uniformes: El proyecto introduce la Interfaz Agente-Computadora (ACI), una solución innovadora que mejora la interacción entre agentes y usuarios. Utilizando Modelos de Lenguaje Multimodal de Gran Escala (MLLMs), Agent S puede navegar y manipular diversas interfaces gráficas de usuario sin problemas. A través de estas características pioneras, Agent S proporciona un marco robusto que aborda las complejidades involucradas en la automatización de la interacción humana con las máquinas, preparando el terreno para una multitud de aplicaciones en IA y más allá. ¿Quién es el Creador de Agent S? Si bien el concepto de Agent S es fundamentalmente innovador, la información específica sobre su creador sigue siendo elusiva. El creador es actualmente desconocido, lo que resalta ya sea la etapa incipiente del proyecto o la elección estratégica de mantener a los miembros fundadores en el anonimato. Independientemente de la anonimidad, el enfoque sigue siendo en las capacidades y el potencial del marco. ¿Quiénes son los Inversores de Agent S? Dado que Agent S es relativamente nuevo en el ecosistema criptográfico, la información detallada sobre sus inversores y patrocinadores financieros no está documentada explícitamente. La falta de información disponible públicamente sobre las bases de inversión u organizaciones que apoyan el proyecto plantea preguntas sobre su estructura de financiamiento y hoja de ruta de desarrollo. Comprender el respaldo es crucial para evaluar la sostenibilidad del proyecto y su posible impacto en el mercado. ¿Cómo Funciona Agent S? En el núcleo de Agent S se encuentra una tecnología de vanguardia que le permite funcionar de manera efectiva en diversos entornos. Su modelo operativo se basa en varias características clave: Interacción Humano-Computadora Similar a la Humana: El marco ofrece planificación avanzada de IA, esforzándose por hacer que las interacciones con las computadoras sean más intuitivas. Al imitar el comportamiento humano en la ejecución de tareas, promete elevar las experiencias de los usuarios. Memoria Narrativa: Empleada para aprovechar experiencias de alto nivel, Agent S utiliza memoria narrativa para hacer un seguimiento de las historias de tareas, mejorando así sus procesos de toma de decisiones. Memoria Episódica: Esta característica proporciona a los usuarios una guía paso a paso, permitiendo que el marco ofrezca apoyo contextual a medida que se desarrollan las tareas. Soporte para OpenACI: Con la capacidad de ejecutarse localmente, Agent S permite a los usuarios mantener el control sobre sus interacciones y flujos de trabajo, alineándose con la ética descentralizada de Web3. Fácil Integración con APIs Externas: Su versatilidad y compatibilidad con varias plataformas de IA aseguran que Agent S pueda encajar sin problemas en ecosistemas tecnológicos existentes, convirtiéndolo en una opción atractiva para desarrolladores y organizaciones. Estas funcionalidades contribuyen colectivamente a la posición única de Agent S dentro del espacio cripto, ya que automatiza tareas complejas y de múltiples pasos con una intervención humana mínima. A medida que el proyecto evoluciona, sus posibles aplicaciones en Web3 podrían redefinir cómo se desarrollan las interacciones digitales. Cronología de Agent S El desarrollo y los hitos de Agent S pueden encapsularse en una cronología que resalta sus eventos significativos: 27 de septiembre de 2024: El concepto de Agent S fue lanzado en un documento de investigación integral titulado “Un Marco Agente Abierto que Usa Computadoras Como un Humano”, mostrando las bases del proyecto. 10 de octubre de 2024: El documento de investigación fue puesto a disposición del público en arXiv, ofreciendo una exploración profunda del marco y su evaluación de rendimiento basada en el benchmark OSWorld. 12 de octubre de 2024: Se lanzó una presentación en video, proporcionando una visión visual de las capacidades y características de Agent S, involucrando aún más a posibles usuarios e inversores. Estos marcadores en la cronología no solo ilustran el progreso de Agent S, sino que también indican su compromiso con la transparencia y la participación comunitaria. Puntos Clave Sobre Agent S A medida que el marco Agent S continúa evolucionando, varios atributos clave destacan, subrayando su naturaleza innovadora y potencial: Marco Innovador: Diseñado para proporcionar un uso intuitivo de las computadoras similar a la interacción humana, Agent S aporta un enfoque novedoso a la automatización de tareas. Interacción Autónoma: La capacidad de interactuar de manera autónoma con las computadoras a través de GUI significa un salto hacia soluciones informáticas más inteligentes y eficientes. Automatización de Tareas Complejas: Con su metodología robusta, puede automatizar tareas complejas y de múltiples pasos, haciendo que los procesos sean más rápidos y menos propensos a errores. Mejora Continua: Los mecanismos de aprendizaje permiten a Agent S mejorar a partir de experiencias pasadas, mejorando continuamente su rendimiento y eficacia. Versatilidad: Su adaptabilidad en diferentes entornos operativos como OSWorld y WindowsAgentArena asegura que pueda servir a una amplia gama de aplicaciones. A medida que Agent S se posiciona en el paisaje de Web3 y criptomonedas, su potencial para mejorar las capacidades de interacción y automatizar procesos significa un avance significativo en las tecnologías de IA. A través de su marco innovador, Agent S ejemplifica el futuro de las interacciones digitales, prometiendo una experiencia más fluida y eficiente para los usuarios en diversas industrias. Conclusión Agent S representa un audaz avance en la unión de la IA y Web3, con la capacidad de redefinir cómo interactuamos con la tecnología. Aunque aún se encuentra en sus primeras etapas, las posibilidades para su aplicación son vastas y atractivas. A través de su marco integral que aborda desafíos críticos, Agent S busca llevar las interacciones autónomas al primer plano de la experiencia digital. A medida que nos adentramos más en los reinos de las criptomonedas y la descentralización, proyectos como Agent S sin duda desempeñarán un papel crucial en la configuración del futuro de la tecnología y la colaboración humano-computadora.

338 Vistas totalesPublicado en 2025.01.14Actualizado en 2025.01.14

Cómo comprar S

¡Bienvenido a HTX.com! Hemos hecho que comprar Sonic (S) sea simple y conveniente. Sigue nuestra guía paso a paso para iniciar tu viaje de criptos.Paso 1: crea tu cuenta HTXUtiliza tu correo electrónico o número de teléfono para registrarte y obtener una cuenta gratuita en HTX. Experimenta un proceso de registro sin complicaciones y desbloquea todas las funciones.Obtener mi cuentaPaso 2: ve a Comprar cripto y elige tu método de pagoTarjeta de crédito/débito: usa tu Visa o Mastercard para comprar Sonic (S) al instante.Saldo: utiliza fondos del saldo de tu cuenta HTX para tradear sin problemas.Terceros: hemos agregado métodos de pago populares como Google Pay y Apple Pay para mejorar la comodidad.P2P: tradear directamente con otros usuarios en HTX.Over-the-Counter (OTC): ofrecemos servicios personalizados y tipos de cambio competitivos para los traders.Paso 3: guarda tu Sonic (S)Después de comprar tu Sonic (S), guárdalo en tu cuenta HTX. Alternativamente, puedes enviarlo a otro lugar mediante transferencia blockchain o utilizarlo para tradear otras criptomonedas.Paso 4: tradear Sonic (S)Tradear fácilmente con Sonic (S) en HTX's mercado spot. Simplemente accede a tu cuenta, selecciona tu par de trading, ejecuta tus trades y monitorea en tiempo real. Ofrecemos una experiencia fácil de usar tanto para principiantes como para traders experimentados.

713 Vistas totalesPublicado en 2025.01.15Actualizado en 2025.03.21

Discusiones

Bienvenido a la comunidad de HTX. Aquí puedes mantenerte informado sobre los últimos desarrollos de la plataforma y acceder a análisis profesionales del mercado. A continuación se presentan las opiniones de los usuarios sobre el precio de S (S).