Hermes Agent Guide: Surpassing OpenClaw, Boosting Productivity by 100x

marsbitDipublikasikan tanggal 2026-04-13Terakhir diperbarui pada 2026-04-13

Abstrak

A guide to Hermes Agent, an open-source AI agent framework by Nous Research, positioned as a powerful alternative to OpenClaw. It is described as a self-evolving agent with a built-in learning loop that autonomously creates skills from experience, continuously improves them, and solidifies knowledge into reusable assets. Its core features include a memory system (storing environment info and user preferences in MEMORY.md and USER.md) and a skill system that generates structured documentation for complex tasks. The agent boasts over 40 built-in tools for web search, browser automation, vision, image generation, and text-to-speech. It supports scheduling automated tasks and can run on various infrastructures, from a $5 VPS to GPU clusters. Popular tools within its ecosystem include the Hindsight memory plugin, the Anthropic Cybersecurity Skills pack, and the mission-control dashboard for agent orchestration. Key differentiators from OpenClaw are its architecture philosophy—centered on the agent's own execution loop rather than a central controller—and its autonomous skill generation versus OpenClaw's manually written skills. Installation is a one-line command, and setup is guided. It integrates with messaging platforms like Telegram, Discord, and Slack. It's suited for scenarios requiring a persistent, context-aware assistant that improves over time, automates workflows, and operates across various deployment environments.

On February 25th, a team called Nous Research quietly released a v0.1.0 on GitHub. Initially, the Hermes model only had a one-line installation command and a one-sentence product positioning: "An agent that grows with you".

At the time, very few people noticed it. Even though Nous Research has a certain reputation in the model community, and their Hermes series models have accumulated 33 million downloads on HuggingFace, the entire developer community's attention was focused on the deified OpenClaw "Crawfish". Gaining stars 33% faster than React to become the historical number one, "Crawfish" became the fastest-growing project in GitHub history, peaking at 710 stars per hour. But at the same time, security researchers were disclosing vulnerabilities at an average rate of 2.2 CVEs per day within the same time window, accumulating 138 security vulnerabilities over 63 days. The entire community began to rethink a question: Can this thing actually be used in a production environment?

Against this backdrop, Hermes Agent, as a competing product, finally found an opportunity and welcomed its own first period of rapid growth.

Hermes wrote a one-click migration tool from OpenClaw into its code. Those developers leaving OpenClaw needed a place to land, and Hermes Agent became a well-recommended choice passed around by word of mouth.

So, starting from early March, Hermes Agent broke into GitHub Trending, reaching a high of 11th place, with stars surpassing 2200. AwesomeAgents called it "the most ambitious open-source Agent release of 2026 so far". Currently, Hermes has 69.9k Stars and 9k Forks on GitHub.

Today, BlockBeats will talk to everyone about what makes this Agent different.

What is Hermes Agent?

Hermes Agent is a self-evolving AI agent built by Nous Research, and it is currently the only Agent with a built-in learning loop.

It can automatically create skills from usage experience, continuously improve these skills during use, actively solidify knowledge into reusable assets, retrieve its own past conversation history, and continuously deepen its understanding of you, the user, across multiple sessions.

So, simply put, the biggest advantage of Hermes Agent is: The more you use it, the smarter and more handy it becomes.

Its positioning is not a programming assistant bound to an IDE, nor is it a chat wrapper for a single API, but a truly autonomous agent that resides on your server, remembers what it has learned, and grows stronger the longer it runs.

Nous Research has positioned itself from the beginning as an open-source first, decentralization-oriented AI lab, aiming to build AI that users can control autonomously, rather than concentrating intelligence in the hands of a few closed companies. Their early work focused on the Hermes model series, while also investing heavily in infrastructure and systems level, exploring technologies like DisTrO for model training across globally distributed consumer-grade GPUs, and simulation environments like WorldSim and Doomscroll for multi-agent interaction and long-term behavior.

The team behind Hermes Agent is the same group that previously built a series of models like Nomos and Psyche.

What are the useful tools?

The core mechanisms of Hermes Agent are its memory system and skill system. The Agent maintains two concise core files: MEMORY.md stores environmental information, conventions, and experiences summarized from past tasks; USER.md stores your preferences and communication style. These two files are automatically injected into the system prompt at the start of each session, acting as the Agent's "long-term working memory". Additionally, all historical sessions are stored in a SQLite full-text search database, allowing the Agent to retrieve conversation content from weeks ago.

Regarding the skill system, each time a complex task is completed (typically involving 5 or more tool calls), the Agent autonomously creates a structured Markdown "skill document" recording the steps, known information, and verification methods for future reuse. Skill files follow a progressive disclosure pattern: the Agent by default only sees the skill name and description (about 3000 tokens), and loads the full content of a specific skill only when needed, thus controlling token consumption.

At the tool level, Hermes Agent comes with over 40 built-in tools, covering web search, browser automation, visual understanding, image generation, text-to-speech, and also supports setting up scheduled tasks via natural language, allowing the Agent to automatically perform periodic work like report generation, data backup, and system monitoring unattended.

Among these, the most popular tools, meaning those with the highest frequency of use by the community users, the most feedback, and based on Hermes's functional architecture and the typical needs of the developer community, the ones ranked at the top are these:

Hindsight is currently the hottest single tool in the ecosystem and is the officially recommended long-term memory plugin for Hermes. It automatically recalls relevant context before each LLM call, supports local PostgreSQL or cloud deployment, and is already integrated into Hermes as a native Memory Provider.

Anthropic-Cybersecurity-Skills is the skill pack with the highest Stars in the ecosystem, containing 753+ structured cybersecurity skills, fully mapping the MITRE ATT&CK framework, suitable for security research and penetration testing scenarios.

mission-control is currently the most popular Agent orchestration dashboard in the ecosystem, supporting Agent fleet management, task distribution, cost tracking, and multi-Agent collaborative workflows, recommended by the community as the standard for production-level deployment.

Hermes Agent Self-Evolution is an evolutionary self-improvement technique using DSPy + GEPA to optimize skills, prompts, and code.

Hermes Workspace is the native workspace for Hermes, integrating a chat interface, terminal, and skill manager, and is the most popular graphical entry point.

Furthermore, it can spawn independent sub-agents, each with its own dialogue context, independent terminal, and Python RPC scripts, enabling zero-context-cost parallel pipelines.

In terms of infrastructure flexibility, it supports six terminal backends: local run, Docker, SSH remote, Daytona serverless, Singularity containers, and Modal cloud functions. Daytona and Modal hibernate when idle, costing almost nothing. You can run it on a $5 VPS or a GPU cluster, issue commands through Telegram, and have it work on cloud servers you never SSH into directly.

Hermes Agent currently constitutes the most direct competition with OpenClaw, as both are open-source Agent frameworks aimed at developers.

The architectural philosophies of the two are截然不同 (distinctly different): OpenClaw's design core is a "control plane", a unified long-running process responsible for managing sessions, routing, tool execution, and state—everything flows through this central controller. Hermes, however, centers around the Agent's own execution loop, building the gateway, task scheduler, tool runtime, etc., all around this "do, learn, improve" iterative cycle.

The difference in the skill systems is particularly significant: OpenClaw's skills are mostly manually written, loaded from different levels like workspace, personal, shared, or plugins; Hermes's approach is to let the Agent generate skills from experience itself, forming a true autonomous learning loop.

How to Install and Use

Getting started is extremely simple. A single command, "curl -fsSL https://raw.githubusercontent.com/NousResearch/hermes-agent/main/scripts/install.sh | bash", completes the installation. It works on Linux, macOS, and WSL2. Hermes Agent automatically completes all configuration without manual operation.

Hermes Official Website

After Hermes Agent is installed, run "hermes setup" to start the setup wizard. Select your model provider (supports Nous Portal, OpenRouter, OpenAI, or any custom endpoint), connect your messaging platform (Telegram, Discord, Slack, or WhatsApp), and then start your first conversation. From the first interaction, Hermes Agent immediately enters learning mode, starting to build memory, create skills, and becoming more capable after each session.

Core commands for daily use include:

hermes (start conversation),

hermes model (select LLM provider and model),

hermes tools (configure which tools to enable),

hermes gateway (start message gateway, connect to Telegram, Discord, etc.),

hermes setup (run the full setup wizard, configure everything at once),

hermes claw migrate (migrate from OpenClaw),

hermes update (update to the latest version),

hermes doctor (diagnose problems);

Hermes Agent is suitable for scenarios including: General AI assistants that need to remember context across sessions and continuously improve capabilities; Custom Agent workflows that need to combine tools, plugins, MCP servers, browsers, or Shell; Deploying Agents on local hardware, cloud VMs, or low-cost serverless infrastructure; and Persistent assistant scenarios that require cross-platform searchable conversation history and acquired skills.

More specifically, you can use it to talk to it on Telegram while it executes tasks on a cloud VM, set up automations and push reports to any platform, have it take over periodic tasks; you can also integrate it into Slack or Discord to provide AI collaboration support for the entire team; or utilize its trajectory export function to generate training data for RL training of next-generation tool-calling models.

Pertanyaan Terkait

QWhat is the core mechanism that makes Hermes Agent unique according to the article?

AIts memory system and skill system. It maintains two core files (MEMORY.md and USER.md) for long-term working memory and autonomously creates structured Markdown skill documents from complex tasks for future reuse.

QWhat was a key factor that contributed to Hermes Agent's initial growth period?

AThe security vulnerabilities discovered in the competing project OpenClaw, which caused the developer community to reconsider its use in production environments. Hermes provided a migration tool for developers leaving OpenClaw.

QName one of the most popular tools in the Hermes Agent ecosystem mentioned in the article.

AHindsight, which is the official long-term memory plugin that automatically recalls relevant context before each LLM call.

QHow does the installation process for Hermes Agent work?

AWith a single command: `curl -fsSL https://raw.githubusercontent.com/NousResearch/hermes-agent/main/scripts/install.sh | bash`. It automatically completes all configuration.

QWhat is the fundamental architectural difference between Hermes Agent and OpenClaw as described in the article?

AOpenClaw is designed around a central 'control plane' that manages everything, while Hermes is built around the agent's own execution loop of 'do, learn, improve', with all components supporting that cycle.

Bacaan Terkait

Setelah Bekerja Sama dengan 35 Proyek DeFi, Pink Brains Menemukan Formula Baru Pemasaran KOL 2026

Setelah bermitra dengan lebih dari 35 proyek DeFi selama tiga tahun, Pink Brains menemukan bahwa kampanye pemasaran yang paling efektif adalah yang memahami perilaku pengguna, bukan sekadar taktik promosi. Pengguna DeFi biasanya menemukan protokol baru melalui postingan media sosial (terutama X), tetapi keputusan mereka didorong oleh data dari platform seperti DefiLlama, DeBank, dan dokumentasi resmi. Pada tahun 2026, minat pengguna terfokus pada tema-tema yang dapat diverifikasi seperti: * **Narasi baru:** Perpetual (kontrak berjangka), RWA (Aset Dunia Nyata), dan konvergensi Crypto×AI, diukur dari pendapatan dan penggunaan nyata (mis., Hyperliquid, Ostium, Bittensor, Venice). * **Airdrop dengan kontribusi nyata,** bukan aktivitas mudah. * **Real Yield (Pendapatan Riil)** yang berasal dari biaya transaksi atau aktivitas ekonomi asli, bukan dari inflasi token. * **Tokenomics yang menangkap nilai** secara langsung dari penggunaan produk, seperti model pembelian kembali dan pembakaran (HYPE, VVV). * **Tempat perdagangan baru** seperti pasar prediksi, kartu koleksi (Collector Crypt), dan iGaming yang terenkripsi. Faktor retensi pengguna yang utama meliputi: 1. **Kegunaan dunia nyata** dalam kehidupan sehari-hari (mis., kartu kripto, neobank). 2. **Tokenomics yang merefleksikan produk** dan memberikan nilai kepada pemegangnya. 3. **Program insentif** yang menghargai penggunaan jangka panjang dan kontribusi otentik, bukan aktivitas jangka pendek. 4. **Pengalaman produk yang baik**, dukungan pelanggan responsif, dan pembangunan komunitas strategis. KOL pemasaran DeFi harus digunakan secara strategis sesuai dengan tahapan perjalanan pengguna (pendidik, pembuat konten, ahli airdrop, spesialis vertikal). Kesalahan umum termasuk menggunakan KOL yang tidak memahami produk, konten yang terlalu umum, dan ketergantungan berlebihan pada beberapa KOL besar. Kesimpulannya, pemasaran DeFi yang sukses mencerminkan perilaku pengguna sebenarnya: penemuan melalui sumber tepercaya, ketertarikan berdasarkan mekanisme yang dapat diverifikasi, dan retensi yang didorong oleh tokenomics kuat dan desain produk yang solid, bukan sekadar jargon pemasaran.

foresightnews_api3m yang lalu

Setelah Bekerja Sama dengan 35 Proyek DeFi, Pink Brains Menemukan Formula Baru Pemasaran KOL 2026

foresightnews_api3m yang lalu

Aturan Berubah Dadakan, Pasar Prediksi BTC Senilai Miliaran di Polymarket Terjebak dalam Kekacauan Penyelesaian

Sengketa penyelesaian kontrak prediksi senilai hampir $150 juta di platform Polymarket memicu kontroversi besar. Kontrak ini mempertaruhkan apakah MicroStrategy (kini Strategy) akan menjual Bitcoin sebelum 31 Mei. Meskipun perusahaan mengonfirmasi penjualan 32 BTC dalam pengajuan reguler 8-K, penjualan terjadi antara 26-31 Mei namun diumumkan pada 1 Juni. Polymarket, melalui oracle UMA, memutuskan kontrak dimenangkan oleh pihak "Tidak" yang bertaruh tidak akan ada penjualan. Alasannya, pengumuman resmi terjadi setelah batas waktu kontrak. Keputusan ini ditentang keras oleh pedagang yang memilih "Ya", seperti pengguna willo2 yang kehilangan $527,000. Mereka menuduh platform mengubah aturan secara sepihak setelah faktanya, karena kontrak tetap diperdagangkan setelah 31 Mei dan aturan tentang tanggal pengumuman tidak jelas sejak awal. Kasus ini menyoroti kelemahan struktural dalam pasar prediksi terdesentralisasi. Mekanisme penyelesaian Polymarket bergantung pada oracle optimis UMA, di mana hasil akhir ditentukan oleh suara pemegang token. Sistem ini dikritik rentan dimanipulasi oleh "paus" besar yang dapat mempengaruhi hasil untuk melindungi posisi mereka sendiri, terlepas dari fakta objektif. Data menunjukkan bahwa dompet teratas sering mengendalikan mayoritas suara, dan banyak pemilih memiliki kepentingan langsung dalam kontrak yang mereka putuskan. Perselisihan ini terjadi di tengah pertumbuhan pesat industri pasar prediksi dan upaya kerasnya untuk mendapatkan legitimasi regulasi. Insiden ini mempertanyakan keandalan mekanisme penyelesaian berbasis token dalam menangani kontrak bernilai tinggi, meninggalkan ketidakpastian bagi para pedagang di ekosistem yang sedang berkembang ini.

foresightnews_api8m yang lalu

Aturan Berubah Dadakan, Pasar Prediksi BTC Senilai Miliaran di Polymarket Terjebak dalam Kekacauan Penyelesaian

foresightnews_api8m yang lalu

Peneliti Ethereum Foundation: Hari Kuantum Akan Tiba, Rencanakan Migrasi Tahan Kuantum Selesai Tahun 2029

**Judul Riset: Hari Kuantum (Q-Day) Mendekat, Ethereum Targetkan Migrasi Tahan Kuantum pada 2029** Pada 31 Maret, tim Google Quantum AI menerbitkan terobosan signifikan dalam algoritma Shor untuk kriptografi kurva eliptis, meningkatkan efisiensi 10x. Mereka menggunakan kurva secp256k1 (dasar tanda tangan Bitcoin/Ethereum) sebagai demonstrasi, menjadi peringatan bagi industri blockchain. Uniknya, detail teknis optimasi ini dirahasiakan dan hanya divalidasi menggunakan bukti tanpa pengetahuan (ZK), sebuah preseden dalam publikasi akademik yang melibatkan koordinasi dengan pemerintah AS. Namun, upaya pembatasan informasi ini memicu "Efek Streisand". Hanya dalam dua bulan, peneliti Prancis André Schrottenloher berhasil mereproduksi logika inti optimasi tersebut. Selain itu, tantangan open-source global (ecdsa.fail) diluncurkan dan dengan cepat memecahkan rekor optimasi algoritma Shor, bahkan melibatkan kontribusi dari amatir dan AI. Di sisi lain, perusahaan startup Oratomic menerbitkan penelitian yang menyimpulkan bahwa dengan arsitektur fisik atom netral, hanya diperlukan 10.000 qubit fisik untuk menjalankan algoritma Shor dan memecahkan kriptografi secp256k1 — angka yang sangat rendah dan mengganggu. Teknologi atom netral ini dianggap layak dan didukung oleh lab baru Google. Berdasarkan analisis informasi publik dan rahasia, penulis memperkirakan ada peluang 50% Hari Kuantum (Q-Day — saat komputer kuantum memecahkan kriptografi komersial) akan tiba sebelum 2032, dan peluang 10% sebelum 2030. Perkiraan resmi AS saat ini (2035) dianggap sudah tidak realistis. Menghadapi ancaman ini, respons panik bukanlah solusi. Migrasi terburu-buru ke kriptografi pascakuantum (PQC) yang belum matang justru berisiko. Tahun 2029 dinilai sebagai jendela waktu yang tepat untuk migrasi bertahap, sebuah target yang juga diadopsi oleh Google, Cloudflare, dan Yayasan Ethereum. Penulis, sebagai peneliti Yayasan Ethereum, saat ini berkonsentrasi pada persiapan migrasi penuh Ethereum ke PQC, yang mencakup lapisan konsensus (tanda tangan BLS), lapisan data (komitmen KZG), dan lapisan eksekusi (tanda tangan ECDSA). Rencananya akan dibangun di atas kriptografi berbasis hash. Yayasan Ethereum juga mengembangkan leanVM, zkVM yang sangat minimalis dan dirancang untuk keamanan tertinggi, serta menawarkan dua program hadiah $1 juta (Proximity Prize dan Poseidon Initiative) untuk mendorong inovasi dalam SNARK berbasis hash dan fungsi hash yang ramah-SNARK.

foresightnews_api9m yang lalu

Peneliti Ethereum Foundation: Hari Kuantum Akan Tiba, Rencanakan Migrasi Tahan Kuantum Selesai Tahun 2029

foresightnews_api9m yang lalu

Peter Todd Peringatkan Teknologi Zcash Terlalu Berisiko Untuk Dorongan Privasi Bitcoin

Pengembang Bitcoin, Peter Todd, menentang usulan untuk memasukkan fitur privasi ala Zcash ke dalam lapisan konsensus Bitcoin, dengan alasan profil risiko kriptografinya terlalu tinggi untuk protokol inti jaringan. Debat ini muncul setelah pengembang ZODL mengungkap masalah yang memengaruhi "shielded pool" Orchard di Zcash, yang memicu diskusi lebih luas tentang privasi, kemampuan audit, dan konsep "osifikasi" Bitcoin. Todd berargumen bahwa sistem akuntansi transparan Bitcoin memungkinkan bug kritis lebih mudah dideteksi dan diperbaiki, seperti pada insiden "value overflow" 2010, karena koin palsu dapat terlihat di blockchain. Sebaliknya, dalam sistem privasi seperti Zcash yang "terlindungi", sebuah bug dapat lebih sulit diamati, diatribusikan, dan dibatalkan, sehingga berpotensi menghancurkan saldo pengguna tanpa mudah dikembalikan. Ia menekankan bahwa meskipun Bitcoin tidak kebal bug, jenis kriptografi yang digunakan Zcash membawa risiko operasional yang jauh lebih tinggi. Todd mencontohkan bahwa sekitar 30% pasokan ZEC sudah berada dalam shielded pool, sehingga kerusakan di sana akan menjadi bencana bagi banyak pengguna. Para pendukung Zcash membantah, menyatakan bahwa bug tidak dapat memengaruhi total pasokan ZEC dan menyoroti bahwa Bitcoin juga pernah mengalami masalah serius. Namun, Todd bersikeras bahwa tidak satu pun insiden Bitcoin mengancam kelangsungan mata uangnya seperti yang mungkin terjadi pada sistem privasi yang kompleks.

bitcoinist1j yang lalu

Peter Todd Peringatkan Teknologi Zcash Terlalu Berisiko Untuk Dorongan Privasi Bitcoin

bitcoinist1j yang lalu

Trading

Spot
Futures
活动图片