Tsinghua '00s Alumnus Wang Guan's New Work: Disrupting Transformer Pretraining Models with 1/900 Tokens, 1/432 Compute Power

marsbitPublished on 2026-05-26Last updated on 2026-05-26

Abstract

Tsinghua alumnus Wang Guan's team proposes HRM-Text, a novel pre-training paradigm using a Hierarchical Recurrent Model to replace standard Transformers. With just 1B parameters and 40B unique tokens trained at a cost of ~$1500, HRM-Text achieves performance comparable to 2B-7B open-source models, using up to 900x fewer tokens and 432x less estimated compute. Key innovations include a dual-timescale recurrent architecture for greater effective depth, a task-completion objective training only on answer tokens with PrefixLM masking, and techniques like MagicNorm and Warmup Deep Credit Assignment for stability. Evaluations show strong results on benchmarks like MMLU (60.7%) and GSM8K (84.5%). The work highlights how architectural priors and targeted objectives can lower pre-training barriers, though limitations include knowledge-reasoning coupling, fixed compute per token, and scalability beyond 3B parameters.

Breaking the traditional paradigm of large model pretraining, Tsinghua '00s alumnus Wang Guan's team has released a new work:

They used a Hierarchical Recurrent Model (HRM) to replace the standard Transformer, proposing HRM-Text, an efficient pretraining method that goes beyond Scaling.

Paper link: https://arxiv.org/abs/2605.20613

While using approximately 100-900x fewer training tokens and 96-432x less estimated compute compared to the standard baseline model, HRM-Text still achieved performance comparable to open-source models with 2B to 7B parameters.

Furthermore, using 1B parameters, 40B non-repeating tokens, and a training cost of about $1500, HRM-Text achieved the following scores on mainstream benchmarks: MMLU 60.7%, ARC-C 81.9%, DROP 82.2%, GSM8K 84.5%, MATH 56.2%.

Figure|Pretraining efficiency.

Based on this, they clearly propose: Structural priors and targeted training objectives can significantly lower the barrier to pretraining. This training approach makes training a foundational model from scratch feasible.

How is HRM-Text designed?

Large Language Model (LLM) pretraining is increasingly reliant on a few institutions with ample computing and data resources. Training a competitive foundational model often requires trillions of tokens, thousands of GPUs, and even tens of millions of dollars in compute investment.

However, the current training paradigm is inefficient. A large amount of computation is consumed on irrelevant tokens such as prompts, formatting padding, and web noise, resulting in a significant portion of training compute not directly serving inference.

In this work, the research team redesigned the architecture and training objective, making the pretraining of HRM-Text relatively more efficient.

Architecture: Employs a Hierarchical Recurrent Model with dual timescales, splitting computation into a slow H module and a fast L module. While a standard Transformer performs one forward pass per token, HRM performs multiple rounds of recursive updates on the same token. The H and L modules each account for only half of the recursive core parameters. The overall computational load is roughly equivalent to performing 4 recursive unrolls on the same set of parameters, increasing computational depth without adding more parameters.

Training Objective: Abandons the standard full-text autoregressive pretraining. Instead, training is performed directly on instruction-answer pairs, with loss calculated only on the answer part, combined with PrefixLM masking, allowing bidirectional attention on the instruction part and causal masked generation for the answer part.

Figure|HRM-Text architecture.

To enhance the stability of recursive training, the research team introduced MagicNorm and Warmup Deep Credit Assignment.

MagicNorm is a hybrid normalization strategy that leverages the asymmetry between forward and backward computation depths under Truncated Backpropagation Through Time (Truncated BPTT). It uses PreNorm internally within modules and adds an extra normalization layer at the module exit, thereby improving the stability of deep recursive training.

Warmup Deep Credit Assignment, on the other hand, only backpropagates gradients from the last 2 recursive steps during the initial training phase, then linearly extends to the last 5 steps. This training mechanism allows the model to converge stably on shorter credit assignment paths before gradually introducing longer dependencies.

How effective is it?

Experimental results show that HRM-Text demonstrates clear advantages in architecture efficiency, training objectives, and overall performance.

1. Under fixed training compute, is the recurrent architecture more effective?

Results show that under FLOPs-aligned conditions, HRM 1B outperforms Transformer 1B, Transformer 3B, Looped Transformer 1B, and RINS 1B on most benchmarks; comparison with TRM also indicates that HRM training is more stable.

Figure|Comparison of performance and stability with Transformer models. HRM maintained stable training dynamics across all scales, while Transformer models exhibited severe instability at the 1-billion-parameter scale. Furthermore, at the 0.6B scale, HRM achieved competitive performance on most benchmarks using only 2x less computation than Transformer models.

2. Does the task completion objective and PrefixLM help?

Ablation studies show that under FLOPs-aligned conditions, the MMLU score for a 1B Transformer increased from 40.55 with standard autoregressive training, to 47.72 after introducing the task completion objective, to 53.15 after adding PrefixLM, and finally to 60.73 after switching to the HRM architecture.

Figure|Performance comparison between different model architectures and training objectives

3. How does HRM-Text's efficiency compare to contemporary open models?

HRM-Text 1B achieved scores of 60.7, 81.9, 82.2, 84.5, and 56.2 on MMLU, ARC-C, DROP, GSM8K, and MATH respectively. Compared to open models that generally have larger training budgets, it entered the performance range of 2B to 7B open-source models using only 40B unique tokens and 1B parameters; training required up to 900x fewer tokens and up to 432x less compute.

Figure|Evaluation results of HRM-Text 1B compared with contemporary fully open-source models and open-weight models

4. Does the recurrent structure bring greater effective depth?

Results show that the standard Transformer and Looped Transformer stabilize at shallower depths, while HRM maintains more pronounced inter-block representation changes, lower cosine similarity, and higher logit lens KL values even at deeper layers.

Figure|Effective depth analysis.

Figure|Layer-wise Logit Lens KL analysis.

Limitations and Future Directions

Although HRM-Text demonstrates strong performance on inference-intensive tasks, this method still has limitations, and future research directions are proposed.

1. Towards Decoupling "Knowledge" and "Reasoning"

Currently, broader factual knowledge coverage still depends more on model scale and data breadth. HRM-Text was only trained on 40B unique tokens, and explicit knowledge sources constitute only part of the task-formatted mixed data. In the future, researchers need to design a compact reasoning core separately from external fact storage, delegating knowledge breadth to curated corpora, retrieval-augmented modules, or learnable memory.

2. Adaptive Computation Time

The recurrent scheduling of HRM-Text brings greater effective serial depth, but this also means the model must execute a fixed number of recursive steps during inference. A promising future direction is to introduce an adaptive computation time mechanism, allowing simple samples to stop computation earlier and reserving the full recursive budget for difficult samples, thereby reducing inference cost.

3. Current Scaling Validation Scope Remains Limited

The current scaling experiments only cover up to the 3B parameter Transformer control group and the 1B parameter HRM-Text. The research team states that it remains to be verified by subsequent work whether similar efficiency advantages can be maintained at larger model scales.

4. PrefixLM and Inference Frameworks

Currently, PrefixLM still faces certain engineering implementation constraints in practical deployment. Although it can run on standard text generation inference frameworks like vLLM, this requires the framework to support custom attention masks during the prefill stage. Extending it to multi-turn dialogue scenarios further requires designing a KV-cache mechanism that ensures bidirectional visibility within user segments while maintaining causal constraints for the assistant's generation process.

For more technical details, please refer to the original paper.

This article comes from the WeChat public account "Academic Headlines" (ID: SciTouTiao), author: Xia Qiansi

Related Questions

QWhat is HRM-Text and how does it differ from the standard Transformer architecture for pre-training large language models?

AHRM-Text is an efficient pre-training model proposed by a Tsinghua University research team. It uses a Hierarchical Recurrent Model (HRM) instead of the standard Transformer. The key difference is that HRM employs a two-timescale hierarchical recurrence, where each token undergoes multiple recursive updates (via slow 'H' and fast 'L' modules), increasing computational depth without adding parameters. This contrasts with the Transformer's single forward pass per token.

QAccording to the article, what are the key efficiency claims of HRM-Text in terms of training tokens and computational cost?

AThe article claims HRM-Text achieves performance comparable to 2B to 7B parameter open-source models while using approximately 100-900 times fewer training tokens and 96-432 times less estimated computational power compared to standard baseline models. A specific example is a 1B parameter model trained on 40B unique tokens at a cost of around $1,500.

QWhat are the two main design choices in HRM-Text's training objective that contribute to its efficiency?

AThe two main design choices in the training objective are: 1) Training directly on instruction-answer pairs and computing the loss only on the answer part, rather than using standard full-sequence autoregressive pre-training. 2) Employing PrefixLM masking, which allows bidirectional attention on the instruction (prefix) part and causal masking for generating the answer.

QWhat techniques did the researchers introduce to improve the stability of deep recurrent training in HRM-Text?

ATo improve stability for deep recurrent training, the researchers introduced two techniques: 1) MagicNorm, a hybrid normalization strategy using PreNorm inside modules and an extra normalization at the module output, leveraging asymmetry in forward/backward depths under Truncated BPTT. 2) Warmup Deep Credit Assignment, which initially backpropagates gradients only from the last 2 recursion steps and linearly extends to the last 5 steps during training.

QWhat are some of the limitations and future research directions mentioned for HRM-Text?

AThe mentioned limitations and future directions include: 1) Decoupling 'knowledge' and 'reasoning', suggesting a need to combine the compact reasoning core with external factual storage (e.g., curated corpora, retrieval-augmented modules). 2) Exploring Adaptive Computation Time to reduce inference cost for easier samples. 3) Validating the efficiency advantage at larger model scales beyond the current 3B/1B experiments. 4) Addressing engineering challenges for deploying PrefixLM in multi-turn dialogue, such as designing a suitable KV-cache mechanism.

Related Reads

BTC Thrice Rejected at $80,000 Threshold, HYPE Reaches New Highs Signaling Opportunity | Guest Analysis

**Bitcoin (BTC) Struggles at $80k; HYPE Reaches New Highs | Key Analysis & Strategy** Bitcoin faces continued resistance in the $78.5k - $79.5k zone after failing to sustain a break above its daily chart rising channel. It has retreated to the channel's midline. A failure to hold here could see a test of the $73.5k - $75k support area. The 4-hour chart shows a complex corrective structure. The strategy is neutral for mid-term positions. For short-term trading, two scenarios are outlined: **A)** Selling on a failed rally into the $78.5k-$79.5k resistance, and **B)** Selling on a confirmed breakdown below the $73.5k-$75k support, both with tight risk management. Meanwhile, **HYPE** has posted consecutive highs. The 4-hour chart indicates its current uptrend may be weakening near $65, with models showing potential bearish divergence. The view is that a short-term top could be forming. The strategy advises against chasing the rally and instead looking for a potential long entry on a pullback to the $47.5 - $50 support zone, provided clear reversal signals appear. Last week, a disciplined short BTC trade based on model signals yielded a 2.78% profit. The article emphasizes that all analysis is for informational purposes only and not investment advice, highlighting the importance of strict stop-loss discipline and dynamic position management in a volatile market. *(Note: The text references proprietary models like the "Price Difference Trading Model" and "Momentum Quantification Model" for generating trade signals.)*

marsbit51m ago

BTC Thrice Rejected at $80,000 Threshold, HYPE Reaches New Highs Signaling Opportunity | Guest Analysis

marsbit51m ago

Tether's New Business: Helping Small Countries Issue Stablecoins

Tether has announced a partnership with the Georgian government to issue GEL₮, a Lari-pegged stablecoin, aiming to reduce costs, accelerate settlements, and promote cross-border payments. This move is part of Tether's broader strategy to establish a replicable, standardized business of issuing sovereign currency-backed stablecoins for smaller nations, alongside its flagship USDT and other regional offerings like MXNT (Mexican Peso) and CNHT (Offshore Yuan). Georgia represents an ideal test case due to its high reliance on remittances (~15% of GDP), established digital asset regulatory framework aligned with U.S. standards, and prior engagement with Tether. The country gains accelerated internationalization of its currency by accessing Tether's global distribution network and liquidity pools, where GEL₮ can be swapped directly with assets like USDT. For Tether, the immediate financial gain from Georgia's small market is minimal. The true value lies in creating a template. Successfully navigating the compliance, reserve, and redemption processes for GEL₮ allows Tether to replicate this model swiftly for other nations with similar profiles, such as Azerbaijan or Nigeria. The deeper strategy involves subtly integrating these national currencies into an informal USDT-anchored dollar system, positioning Tether as the essential routing infrastructure. This partnership highlights a potential new model: the outsourcing of sovereign currency globalization to private stablecoin issuers. It offers smaller states a faster path to digital currency integration than developing a Central Bank Digital Currency (CBDC). However, it raises significant questions about monetary sovereignty, financial stability risks, and increased dependency on a private entity. If more countries adopt this model in the coming year, Tether could evolve from a stablecoin issuer into a unique, cross-sovereign financial infrastructure service provider.

marsbit55m ago

Tether's New Business: Helping Small Countries Issue Stablecoins

marsbit55m ago

Notion CEO: AI companies should be a 'Jazz Band,' and I am a 'Refounder'

Notion CEO Ivan Zhao, in a recent podcast, shared his journey of twice rebuilding the company from near-collapse and now applying the same "Refounder" mindset to reshape the 1000-person organization in the AI era. He argues that AI has commoditized technical capability (Capability). True talent now hinges on Taste (judgment/values) and Agency (proactive drive), necessitating a shift in hiring—e.g., hiring more juniors for curiosity and having sales candidates demonstrate work upfront. Zhao envisions the company as a "Jazz Band"—agile and improvisational—versus a rigid "Marching Band." This is reflected in an engineering "dumbbell" structure (super juniors + top-tier seniors, with middle layers compressed), dissolving the CMO role to let teams operate directly, and integrating entrepreneurs via acquisitions to lead their expertise areas. Notion has abandoned traditional long-term product roadmaps, planning only conservatively for finances while adopting a week-by-week, improvisational approach to product strategy, as longer plans proved futile during rapid AI shifts. He concludes that while human nature and roles remain constants, companies must rewrite their approaches to hiring (valuing Taste/Agency over Capability), organizational design (reducing roles focused on coordination/execution), and planning (embracing flexibility). Modern knowledge work, being only ~150 years old, is ripe for reinvention.

marsbit56m ago

Notion CEO: AI companies should be a 'Jazz Band,' and I am a 'Refounder'

marsbit56m ago

BTC Faces Triple Resistance at $80,000 Milestone, HYPE Hits New Highs Signaling Potential | Invited Analysis

This weekly analysis maintains a structured framework, focusing on Bitcoin (BTC) and HYPE, dissecting their multi-timeframe price action to identify key support and resistance zones and formulate actionable trading plans. The previous week's short position on BTC yielded a 2.78% gain, reinforcing the "signal-driven, disciplined" approach. For Bitcoin, the core scenario revolves around the battle between the 78,500–79,500 USD resistance zone and the 73,500–75,000 USD support area. The daily chart shows BTC within a rising channel; a failure to hold support at the channel's midline could lead to a test of the lower boundary. The 4-hour chart details an 8-segment corrective structure from the 82,850 USD high. Two short-term strategies are proposed: (A) Selling on a failed rally into the 78.5k-79.5k zone, with a stop above 80,600, or (B) Selling a confirmed breakdown below the 73.5k-75k support, with a stop above 76,500. Medium-term positioning remains neutral. For HYPE, the 4-hour chart indicates a five-wave advance from the May 14th low, now showing potential exhaustion and a top warning signal near 65 USD. The core view is to watch for a potential short-term peak formation. The recommended strategy is to avoid chasing the rally and instead look for a long setup upon a pullback to the 47.5–50 USD support zone, provided clear stabilization and model confirmation signals appear. The report concludes with a detailed review of the prior BTC short trade, executed based on model signals and candlestick patterns, and reiterates strict risk management rules, including immediate stop-loss placement and trailing stops to protect profits. All analysis is presented as a personal trading log, not investment advice.

Odaily星球日报1h ago

BTC Faces Triple Resistance at $80,000 Milestone, HYPE Hits New Highs Signaling Potential | Invited Analysis

Odaily星球日报1h ago

Trading

Spot
Futures

Hot Articles

What is SONIC

Sonic: Pioneering the Future of Gaming in Web3 Introduction to Sonic In the ever-evolving landscape of Web3, the gaming industry stands out as one of the most dynamic and promising sectors. At the forefront of this revolution is Sonic, a project designed to amplify the gaming ecosystem on the Solana blockchain. Leveraging cutting-edge technology, Sonic aims to deliver an unparalleled gaming experience by efficiently processing millions of requests per second, ensuring that players enjoy seamless gameplay while maintaining low transaction costs. This article delves into the intricate details of Sonic, exploring its creators, funding sources, operational mechanics, and the timeline of significant events that have shaped its journey. What is Sonic? Sonic is an innovative layer-2 network that operates atop the Solana blockchain, specifically tailored to enhance the existing Solana gaming ecosystem. It accomplishes this through a customised, VM-agnostic game engine paired with a HyperGrid interpreter, facilitating sovereign game economies that roll up back to the Solana platform. The primary goals of Sonic include: Enhanced Gaming Experiences: Sonic is committed to offering lightning-fast on-chain gameplay, allowing players and developers to engage with games at previously unattainable speeds. Atomic Interoperability: This feature enables transactions to be executed within Sonic without the need to redeploy Solana programmes and accounts. This makes the process more efficient and directly benefits from Solana Layer1 services and liquidity. Seamless Deployment: Sonic allows developers to write for Ethereum Virtual Machine (EVM) based systems and execute them on Solana’s SVM infrastructure. This interoperability is crucial for attracting a broader range of dApps and decentralised applications to the platform. Support for Developers: By offering native composable gaming primitives and extensible data types - dining within the Entity-Component-System (ECS) framework - game creators can craft intricate business logic with ease. Overall, Sonic's unique approach not only caters to players but also provides an accessible and low-cost environment for developers to innovate and thrive. Creator of Sonic The information regarding the creator of Sonic is somewhat ambiguous. However, it is known that Sonic's SVM is owned by the company Mirror World. The absence of detailed information about the individuals behind Sonic reflects a common trend in several Web3 projects, where collective efforts and partnerships often overshadow individual contributions. Investors of Sonic Sonic has garnered considerable attention and support from various investors within the crypto and gaming sectors. Notably, the project raised an impressive $12 million during its Series A funding round. The round was led by BITKRAFT Ventures, with other notable investors including Galaxy, Okx Ventures, Interactive, Big Brain Holdings, and Mirana. This financial backing signifies the confidence that investment foundations have in Sonic’s potential to revolutionise the Web3 gaming landscape, further validating its innovative approaches and technologies. How Does Sonic Work? Sonic utilises the HyperGrid framework, a sophisticated parallel processing mechanism that enhances its scalability and customisability. Here are the core features that set Sonic apart: Lightning Speed at Low Costs: Sonic offers one of the fastest on-chain gaming experiences compared to other Layer-1 solutions, powered by the scalability of Solana’s virtual machine (SVM). Atomic Interoperability: Sonic enables transaction execution without redeployment of Solana programmes and accounts, effectively streamlining the interaction between users and the blockchain. EVM Compatibility: Developers can effortlessly migrate decentralised applications from EVM chains to the Solana environment using Sonic’s HyperGrid interpreter, increasing the accessibility and integration of various dApps. Ecosystem Support for Developers: By exposing native composable gaming primitives, Sonic facilitates a sandbox-like environment where developers can experiment and implement business logic, greatly enhancing the overall development experience. Monetisation Infrastructure: Sonic natively supports growth and monetisation efforts, providing frameworks for traffic generation, payments, and settlements, thereby ensuring that gaming projects are not only viable but also sustainable financially. Timeline of Sonic The evolution of Sonic has been marked by several key milestones. Below is a brief timeline highlighting critical events in the project's history: 2022: The Sonic cryptocurrency was officially launched, marking the beginning of its journey in the Web3 gaming arena. 2024: June: Sonic SVM successfully raised $12 million in a Series A funding round. This investment allowed Sonic to further develop its platform and expand its offerings. August: The launch of the Sonic Odyssey testnet provided users with the first opportunity to engage with the platform, offering interactive activities such as collecting rings—a nod to gaming nostalgia. October: SonicX, an innovative crypto game integrated with Solana, made its debut on TikTok, capturing the attention of over 120,000 users within a short span. This integration illustrated Sonic’s commitment to reaching a broader, global audience and showcased the potential of blockchain gaming. Key Points Sonic SVM is a revolutionary layer-2 network on Solana explicitly designed to enhance the GameFi landscape, demonstrating great potential for future development. HyperGrid Framework empowers Sonic by introducing horizontal scaling capabilities, ensuring that the network can handle the demands of Web3 gaming. Integration with Social Platforms: The successful launch of SonicX on TikTok displays Sonic’s strategy to leverage social media platforms to engage users, exponentially increasing the exposure and reach of its projects. Investment Confidence: The substantial funding from BITKRAFT Ventures, among others, emphasizes the robust backing Sonic has, paving the way for its ambitious future. In conclusion, Sonic encapsulates the essence of Web3 gaming innovation, striking a balance between cutting-edge technology, developer-centric tools, and community engagement. As the project continues to evolve, it is poised to redefine the gaming landscape, making it a notable entity for gamers and developers alike. As Sonic moves forward, it will undoubtedly attract greater interest and participation, solidifying its place within the broader narrative of blockchain gaming.

1.5k Total ViewsPublished 2024.04.04Updated 2024.12.03

What is SONIC

What is $S$

Understanding SPERO: A Comprehensive Overview Introduction to SPERO As the landscape of innovation continues to evolve, the emergence of web3 technologies and cryptocurrency projects plays a pivotal role in shaping the digital future. One project that has garnered attention in this dynamic field is SPERO, denoted as SPERO,$$s$. This article aims to gather and present detailed information about SPERO, to help enthusiasts and investors understand its foundations, objectives, and innovations within the web3 and crypto domains. What is SPERO,$$s$? SPERO,$$s$ is a unique project within the crypto space that seeks to leverage the principles of decentralisation and blockchain technology to create an ecosystem that promotes engagement, utility, and financial inclusion. The project is tailored to facilitate peer-to-peer interactions in new ways, providing users with innovative financial solutions and services. At its core, SPERO,$$s$ aims to empower individuals by providing tools and platforms that enhance user experience in the cryptocurrency space. This includes enabling more flexible transaction methods, fostering community-driven initiatives, and creating pathways for financial opportunities through decentralised applications (dApps). The underlying vision of SPERO,$$s$ revolves around inclusiveness, aiming to bridge gaps within traditional finance while harnessing the benefits of blockchain technology. Who is the Creator of SPERO,$$s$? The identity of the creator of SPERO,$$s$ remains somewhat obscure, as there are limited publicly available resources providing detailed background information on its founder(s). This lack of transparency can stem from the project's commitment to decentralisation—an ethos that many web3 projects share, prioritising collective contributions over individual recognition. By centring discussions around the community and its collective goals, SPERO,$$s$ embodies the essence of empowerment without singling out specific individuals. As such, understanding the ethos and mission of SPERO remains more important than identifying a singular creator. Who are the Investors of SPERO,$$s$? SPERO,$$s$ is supported by a diverse array of investors ranging from venture capitalists to angel investors dedicated to fostering innovation in the crypto sector. The focus of these investors generally aligns with SPERO's mission—prioritising projects that promise societal technological advancement, financial inclusivity, and decentralised governance. These investor foundations are typically interested in projects that not only offer innovative products but also contribute positively to the blockchain community and its ecosystems. The backing from these investors reinforces SPERO,$$s$ as a noteworthy contender in the rapidly evolving domain of crypto projects. How Does SPERO,$$s$ Work? SPERO,$$s$ employs a multi-faceted framework that distinguishes it from conventional cryptocurrency projects. Here are some of the key features that underline its uniqueness and innovation: Decentralised Governance: SPERO,$$s$ integrates decentralised governance models, empowering users to participate actively in decision-making processes regarding the project’s future. This approach fosters a sense of ownership and accountability among community members. Token Utility: SPERO,$$s$ utilises its own cryptocurrency token, designed to serve various functions within the ecosystem. These tokens enable transactions, rewards, and the facilitation of services offered on the platform, enhancing overall engagement and utility. Layered Architecture: The technical architecture of SPERO,$$s$ supports modularity and scalability, allowing for seamless integration of additional features and applications as the project evolves. This adaptability is paramount for sustaining relevance in the ever-changing crypto landscape. Community Engagement: The project emphasises community-driven initiatives, employing mechanisms that incentivise collaboration and feedback. By nurturing a strong community, SPERO,$$s$ can better address user needs and adapt to market trends. Focus on Inclusion: By offering low transaction fees and user-friendly interfaces, SPERO,$$s$ aims to attract a diverse user base, including individuals who may not previously have engaged in the crypto space. This commitment to inclusion aligns with its overarching mission of empowerment through accessibility. Timeline of SPERO,$$s$ Understanding a project's history provides crucial insights into its development trajectory and milestones. Below is a suggested timeline mapping significant events in the evolution of SPERO,$$s$: Conceptualisation and Ideation Phase: The initial ideas forming the basis of SPERO,$$s$ were conceived, aligning closely with the principles of decentralisation and community focus within the blockchain industry. Launch of Project Whitepaper: Following the conceptual phase, a comprehensive whitepaper detailing the vision, goals, and technological infrastructure of SPERO,$$s$ was released to garner community interest and feedback. Community Building and Early Engagements: Active outreach efforts were made to build a community of early adopters and potential investors, facilitating discussions around the project’s goals and garnering support. Token Generation Event: SPERO,$$s$ conducted a token generation event (TGE) to distribute its native tokens to early supporters and establish initial liquidity within the ecosystem. Launch of Initial dApp: The first decentralised application (dApp) associated with SPERO,$$s$ went live, allowing users to engage with the platform's core functionalities. Ongoing Development and Partnerships: Continuous updates and enhancements to the project's offerings, including strategic partnerships with other players in the blockchain space, have shaped SPERO,$$s$ into a competitive and evolving player in the crypto market. Conclusion SPERO,$$s$ stands as a testament to the potential of web3 and cryptocurrency to revolutionise financial systems and empower individuals. With a commitment to decentralised governance, community engagement, and innovatively designed functionalities, it paves the way toward a more inclusive financial landscape. As with any investment in the rapidly evolving crypto space, potential investors and users are encouraged to research thoroughly and engage thoughtfully with the ongoing developments within SPERO,$$s$. The project showcases the innovative spirit of the crypto industry, inviting further exploration into its myriad possibilities. While the journey of SPERO,$$s$ is still unfolding, its foundational principles may indeed influence the future of how we interact with technology, finance, and each other in interconnected digital ecosystems.

54 Total ViewsPublished 2024.12.17Updated 2024.12.17

What is $S$

What is AGENT S

Agent S: The Future of Autonomous Interaction in Web3 Introduction In the ever-evolving landscape of Web3 and cryptocurrency, innovations are constantly redefining how individuals interact with digital platforms. One such pioneering project, Agent S, promises to revolutionise human-computer interaction through its open agentic framework. By paving the way for autonomous interactions, Agent S aims to simplify complex tasks, offering transformative applications in artificial intelligence (AI). This detailed exploration will delve into the project's intricacies, its unique features, and the implications for the cryptocurrency domain. What is Agent S? Agent S stands as a groundbreaking open agentic framework, specifically designed to tackle three fundamental challenges in the automation of computer tasks: Acquiring Domain-Specific Knowledge: The framework intelligently learns from various external knowledge sources and internal experiences. This dual approach empowers it to build a rich repository of domain-specific knowledge, enhancing its performance in task execution. Planning Over Long Task Horizons: Agent S employs experience-augmented hierarchical planning, a strategic approach that facilitates efficient breakdown and execution of intricate tasks. This feature significantly enhances its ability to manage multiple subtasks efficiently and effectively. Handling Dynamic, Non-Uniform Interfaces: The project introduces the Agent-Computer Interface (ACI), an innovative solution that enhances the interaction between agents and users. Utilizing Multimodal Large Language Models (MLLMs), Agent S can navigate and manipulate diverse graphical user interfaces seamlessly. Through these pioneering features, Agent S provides a robust framework that addresses the complexities involved in automating human interaction with machines, setting the stage for myriad applications in AI and beyond. Who is the Creator of Agent S? While the concept of Agent S is fundamentally innovative, specific information about its creator remains elusive. The creator is currently unknown, which highlights either the nascent stage of the project or the strategic choice to keep founding members under wraps. Regardless of anonymity, the focus remains on the framework's capabilities and potential. Who are the Investors of Agent S? As Agent S is relatively new in the cryptographic ecosystem, detailed information regarding its investors and financial backers is not explicitly documented. The lack of publicly available insights into the investment foundations or organisations supporting the project raises questions about its funding structure and development roadmap. Understanding the backing is crucial for gauging the project's sustainability and potential market impact. How Does Agent S Work? At the core of Agent S lies cutting-edge technology that enables it to function effectively in diverse settings. Its operational model is built around several key features: Human-like Computer Interaction: The framework offers advanced AI planning, striving to make interactions with computers more intuitive. By mimicking human behaviour in tasks execution, it promises to elevate user experiences. Narrative Memory: Employed to leverage high-level experiences, Agent S utilises narrative memory to keep track of task histories, thereby enhancing its decision-making processes. Episodic Memory: This feature provides users with step-by-step guidance, allowing the framework to offer contextual support as tasks unfold. Support for OpenACI: With the ability to run locally, Agent S allows users to maintain control over their interactions and workflows, aligning with the decentralised ethos of Web3. Easy Integration with External APIs: Its versatility and compatibility with various AI platforms ensure that Agent S can fit seamlessly into existing technological ecosystems, making it an appealing choice for developers and organisations. These functionalities collectively contribute to Agent S's unique position within the crypto space, as it automates complex, multi-step tasks with minimal human intervention. As the project evolves, its potential applications in Web3 could redefine how digital interactions unfold. Timeline of Agent S The development and milestones of Agent S can be encapsulated in a timeline that highlights its significant events: September 27, 2024: The concept of Agent S was launched in a comprehensive research paper titled “An Open Agentic Framework that Uses Computers Like a Human,” showcasing the groundwork for the project. October 10, 2024: The research paper was made publicly available on arXiv, offering an in-depth exploration of the framework and its performance evaluation based on the OSWorld benchmark. October 12, 2024: A video presentation was released, providing a visual insight into the capabilities and features of Agent S, further engaging potential users and investors. These markers in the timeline not only illustrate the progress of Agent S but also indicate its commitment to transparency and community engagement. Key Points About Agent S As the Agent S framework continues to evolve, several key attributes stand out, underscoring its innovative nature and potential: Innovative Framework: Designed to provide an intuitive use of computers akin to human interaction, Agent S brings a novel approach to task automation. Autonomous Interaction: The ability to interact autonomously with computers through GUI signifies a leap towards more intelligent and efficient computing solutions. Complex Task Automation: With its robust methodology, it can automate complex, multi-step tasks, making processes faster and less error-prone. Continuous Improvement: The learning mechanisms enable Agent S to improve from past experiences, continually enhancing its performance and efficacy. Versatility: Its adaptability across different operating environments like OSWorld and WindowsAgentArena ensures that it can serve a broad range of applications. As Agent S positions itself in the Web3 and crypto landscape, its potential to enhance interaction capabilities and automate processes signifies a significant advancement in AI technologies. Through its innovative framework, Agent S exemplifies the future of digital interactions, promising a more seamless and efficient experience for users across various industries. Conclusion Agent S represents a bold leap forward in the marriage of AI and Web3, with the capacity to redefine how we interact with technology. While still in its early stages, the possibilities for its application are vast and compelling. Through its comprehensive framework addressing critical challenges, Agent S aims to bring autonomous interactions to the forefront of the digital experience. As we move deeper into the realms of cryptocurrency and decentralisation, projects like Agent S will undoubtedly play a crucial role in shaping the future of technology and human-computer collaboration.

695 Total ViewsPublished 2025.01.14Updated 2025.01.14

What is AGENT S

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of S (S) are presented below.

活动图片