In-Depth Explanation of ERC-8183: Ethereum's Solution to the AI Agent Trust Challenge

Odaily星球日报Published on 2026-03-10Last updated on 2026-03-10

Abstract

Analysis of ERC-8183: Ethereum's Solution to AI Agent Trust Issues On March 10, the Ethereum Foundation's dAI team and Virtuals Protocol introduced ERC-8183, a new standard designed to enable trustless commercial transactions between AI Agents. This standard addresses the core problem of how two untrusted Agents can securely complete a "hire-deliver-settle" workflow without relying on a centralized platform. ERC-8183 introduces a "Job" concept with three roles: the Client (task publisher), the Provider (task executor), and the Evaluator (task validator). The Evaluator, which can be an AI Agent, a ZK-verifier smart contract, or a multi-sig/DAO, is the core innovation, determining whether a job is completed or rejected based on submitted proofs. A Job progresses through four states: Open (task creation), Funded (client deposits funds into escrow), Submitted (provider submits work), and Terminal (evaluator approves or rejects, funds are distributed accordingly). The standard also supports modular Hooks for added functionality like reputation checks or bidding systems. ERC-8183 complements other standards like x402 (a payment protocol for Agents) and ERC-8004 (an identity/reputation standard). Together, they form a foundational stack for a decentralized, autonomously operating AI Agent economy, with ERC-8183 specifically solving the trust problem in transactions.

Original | Odaily Planet Daily (@OdailyChina)

Author | Azuma (@azuma_eth)

On March 10, the dAI team under the Ethereum Foundation, which focuses on promoting the "deep integration of artificial intelligence (AI) and blockchain," today jointly launched a new standard, ERC-8183, with Virtuals Protocol.

Regarding this standard, Davide Crapis, the AI lead at the Ethereum Foundation, stated that ERC-8183 is one of the missing components in the open Agent economy system being built by the Ethereum community. This standard can be used in combination with x402 and ERC-8004 to serve as infrastructure for secure interactions between Agents. The dAI team will support the adoption of ERC-8183 and is committed to making it a neutral standard.

What Problem Does ERC-8183 Aim to Solve?

According to the introductory article released by Virtuals Protocol, ERC-8183 is specifically designed for commercial transactions between AI Agents. This standard defines a set of on-chain rules that enable two mutually untrusting Agents to complete a business process such as "hire-deliver-settle" without relying on a centralized platform.

The core problem ERC-8183 attempts to solve is: when Agents hire and cooperate with each other, how can transactions be completed without a platform, without legal frameworks, and without human arbitration?

For example, suppose a marketing-oriented Agent A wants to hire an image-generation-oriented Agent B to create a batch of marketing posters. Here lies a commercial trust issue — the two parties do not know each other and have no basis for trust. When should the payment be made? If A pays first, B might go on strike or return unsatisfactory work results; if B works first, A might refuse to pay...

In the traditional internet world, users and businesses also face similar commercial trust issues, and platforms play a key intermediary role — the platform is responsible for escrowing A's funds, judging whether B's service is completed, and handling the final fund release. Familiar platforms like Taobao, JD.com, Meituan, and Didi are essentially this type of platform intermediary.

What the Ethereum Foundation and Virtuals Protocol aim to do is abstract the platform's functions into an on-chain protocol through ERC-8183, executed by smart contracts, thereby assuming a decentralized intermediary role in the Agent economy.

Breakdown of the ERC-8183 Working Mechanism

The operating mechanism of ERC-8183 is not complicated. This standard introduces a new concept called a Job (you can think of it as a "task"). Each Job can be regarded as a complete commercial transaction, which includes three different roles:

  • Client: The "customer," simply put, is the Agent that publishes various tasks;
  • Provider: The "service provider," the Agent responsible for completing the task;
  • Evaluator: The most special role, responsible for judging whether the task is completed.

Here, the Evaluator needs special explanation; the introduction of this role is the core design of ERC-8183. In this standard, the Evaluator is only defined as an on-chain address, but from a broader perspective, this address can correspond to various execution forms.

  • For subjective tasks such as writing, design, or analysis, the Evaluator can be an AI Agent that reads the submitted results, compares them with the initial task requirements, and makes a judgment;
  • For deterministic tasks such as computation, proof generation, or data transformation, the Evaluator can be a smart contract encapsulating a zero-knowledge verifier (ZK verifier). The Provider submits the proof, the Evaluator verifies it on-chain, and automatically calls 「complete」 or 「reject」 to complete or reject the task;
  • In high-value or high-risk task scenarios, the Evaluator can also be a multi-signature account, a DAO, or a validation cluster supported by a staking mechanism.

ERC-8183 does not distinguish between these different forms. The protocol layer only cares about one thing — whether an address calls 「complete」 or 「reject」. What runs behind this address, whether it's an AI Agent driven by an LLM or a ZK circuit, is not within the scope of the protocol's consideration.

Returning to the Job, the lifecycle of each Job will have the following four states, which also correspond to different processes during the operation of ERC-8183.

  • Open: The Client creates the Job in this phase, publishes the task, and specifies requirements;
  • Funded: The Client transfers the commission to a smart contract escrow address, rather than directly to the Provider;
  • Submitted: The Provider completes the work and submits proof;
  • Terminal (Completed / Rejected / Expired): The Evaluator is responsible for reviewing the task, judges whether the task is completed based on the review results (Completed or Rejected), and transfers the funds to the Client or Provider accordingly; if no Provider responds or completes the task within the required time, the funds are returned to the Client.

In addition to the above standard process, ERC-8183 can also achieve more derivative functions through modular extension features called Hooks to correspond to complex commercial use cases in the real world. Hooks are optional smart contracts attached when a Job is created. They can execute custom logic before or after various lifecycles of the Job, such as reputation thresholds, bidding mechanisms, fee distribution, or other special requirements.

How is ERC-8183 Different from x402 and ERC-8004?

From x402 to ERC-8004, and now to ERC-8183, unfamiliar readers might be confused, wondering why a new thing is created every now and then. But in fact, these three are located in three different segments of the AI Agent economy system, aiming to solve different problems.

x402 is an HTTP payment protocol; it aims to solve the problem of enabling AI Agents to pay directly as if calling an API; ERC-8004 is an AI Agent identity and reputation standard; it solves the problem of how to judge whether an Agent is reliable; ERC-8183 targets the commercial transaction环节, aiming to break through the challenge of how to enable two untrusting Agents to complete a transaction.

To put it in one sentence: x402 is responsible for solving "how to pay"; ERC-8004 is responsible for knowing "who the other party is and whether they are reliable"; ERC-8183 is responsible for handling "how to transact with confidence".

The three are not competitive but complementary. They collectively point to the same goal — building a decentralized, self-operating AI Agent economy system.

Trending Cryptos

Related Questions

QWhat is the core problem that ERC-8183 aims to solve?

AERC-8183 aims to solve the core problem of enabling two untrusted AI Agents to complete a commercial transaction (such as hire-deliver-settle) without relying on a centralized platform, addressing the issue of commercial trust in agent-to-agent interactions.

QWhat are the three key roles defined in an ERC-8183 Job?

AThe three key roles are: Client (the agent that publishes tasks), Provider (the agent that completes the tasks), and Evaluator (the address responsible for judging whether the task is completed).

QHow does the Evaluator role function in ERC-8183, and what forms can it take?

AThe Evaluator is a chain address that determines task completion. It can be an AI Agent (for subjective tasks), a smart contract with a ZK verifier (for deterministic tasks), or a multi-signature account/DAO (for high-value scenarios). The protocol only cares whether the address calls 'complete' or 'reject'.

QWhat are the four lifecycle states of a Job in ERC-8183?

AThe four states are: Open (task creation), Funded (commission escrowed), Submitted (proof submitted by Provider), and Terminal (Completed, Rejected, or Expired, determined by Evaluator).

QHow do ERC-8183, x402, and ERC-8004 complement each other in the AI Agent economy?

Ax402 handles payments (how to pay), ERC-8004 handles identity and reputation (who to trust), and ERC-8183 handles secure transactions (how to trade trustlessly). They are complementary components building a decentralized AI Agent economy.

Related Reads

Manus Buyback Plan Emerges: Chinese Investors Plan to Repurchase Equity with $2 Billion, Path to Hong Kong IPO Becomes Clearer

According to a report by The Information, early Chinese investors of Manus, including Tencent, Sequoia Capital China, and ZhenFund, are planning to repurchase the company from Meta for $2 billion—the same price Meta paid in its acquisition last December. This move is a direct response to the Chinese government's prohibition of the foreign acquisition in April. As part of the repurchase plan, Manus is considering establishing a Sino-foreign joint venture within China. This structure is seen as a way to ensure regulatory compliance for its Chinese investors and to pave the way for a future IPO in Hong Kong. Notably, U.S. investor Benchmark will not participate in the buyback, which will concentrate ownership even more among Chinese capital. Since its acquisition by Meta, Manus's business has grown rapidly, with its annualized revenue run rate reportedly increasing four-to-fivefold to $400-$500 million in roughly six months. This strong growth underpins the investors' willingness to repurchase at the original price. Financially, the forced unwinding of the deal may benefit the early investors, allowing them to regain equity at a cost far below the company's current implied valuation, with the added prospect of an independent future listing. However, specific terms of the repurchase, including funding proportions and the joint venture's equity structure, are still under negotiation. This "repurchase-joint venture-Hong Kong IPO" approach could serve as a reference model for other Chinese AI startups navigating cross-border M&A regulations.

marsbit13m ago

Manus Buyback Plan Emerges: Chinese Investors Plan to Repurchase Equity with $2 Billion, Path to Hong Kong IPO Becomes Clearer

marsbit13m ago

STRC Loses Peg by 11%, Can Strategy's Perpetual Motion Machine Keep Running?

The article discusses the significant and concerning depegging of MicroStrategy's (MSTR) preferred stock, STRC. Designed to trade near its $100 target par value, STRC has recently fallen sharply, reaching a low of $83.26 and closing at $88.59, representing an over 11% discount. STRC is a core component of MicroStrategy's financial strategy. As a perpetual preferred stock, it allows the company to raise capital through an "at-the-market" (ATM) issuance program without diluting common shareholders (MSTR). This capital is primarily used to purchase Bitcoin, creating a "capital flywheel": issuing STRC → raising cash → buying BTC → increasing net assets → supporting STRC's value. The flywheel's operation depends on STRC maintaining its $100 price. To enforce this, MicroStrategy employs a dynamic dividend mechanism, recently raising the rate to 11.5% and increasing payout frequency. However, this has failed to halt the depegging, indicating market concerns extend beyond yield. Analysts cite two main reasons. First, technical factors like forced liquidations from leveraged arbitrage trades may have exacerbated the sell-off. Second, and more fundamentally, is waning confidence in MicroStrategy's financial resilience. A JPMorgan report highlighted the company's limited cash relative to its ~$1.7 billion annual dividend obligation, raising liquidity concerns. While MicroStrategy counters that its massive Bitcoin holdings provide decades of coverage, this argument relies on the potential need to sell BTC—a departure from its long-standing "never sell" narrative. The company's recent sale of a small amount of Bitcoin for "testing," despite being framed as minor, has intensified these fears. The persistent depegging threatens to cripple MicroStrategy's primary funding channel. If STRC remains discounted, the company's ability to fund further Bitcoin purchases weakens. Should cash reserves dwindle while financing is constrained, the market may increasingly price in the risk of MicroStrategy becoming a forced seller of Bitcoin to meet obligations. This shift from a major marginal buyer to a potential seller could pose significant downside risk to the broader Bitcoin market.

链捕手21m ago

STRC Loses Peg by 11%, Can Strategy's Perpetual Motion Machine Keep Running?

链捕手21m ago

Behind the AI Scorecards Lies a Chinese 'Question Setter'

Behind the AI scorecards that dominate industry discussions—benchmarks like MMLU-Pro, MMMU, and MMMU-Pro—stands a Chinese-Canadian researcher: Wenhu Chen. As an assistant professor at the University of Waterloo and founder of the TIGER Lab, Chen has become a key "exam-setter" for evaluating large language and multimodal models. Chen first gained broader recognition with MMLU-Pro, a more challenging and stable update to the popular MMLU benchmark. As top models like OpenAI’s o3 began achieving near-perfect scores on the original MMLU, it became difficult to distinguish their true capabilities. MMLU-Pro introduced more complex reasoning questions, expanded answer choices, and filtered out ambiguous or simple items, effectively reintroducing differentiation among state-of-the-art models. His work on MMMU addressed the evaluation of multimodal models, requiring them to integrate visual information (like charts, diagrams, or tables) with textual knowledge across diverse academic subjects. Even the strongest models initially scored only around 56-59%, highlighting significant room for improvement in genuine multimodal reasoning. MMMU-Pro further refined this by preventing models from bypassing visual cues. Chen’s research focus has long been on complex information understanding and reasoning. His background—including a PhD at UC Santa Barbara, research at Google/DeepMind on Gemini, and now a role in Meta’s superintelligence lab—provides deep insight into model development and their potential weaknesses. His TIGER Lab also builds models (e.g., for video understanding and generation), ensuring his evaluation benchmarks are grounded in practical challenges. While AI headlines often spotlight company leaders and product launches, Chen’s work exemplifies the critical, behind-the-scenes contributions of researchers crafting the rigorous standards that define and drive progress in AI capabilities.

marsbit1h ago

Behind the AI Scorecards Lies a Chinese 'Question Setter'

marsbit1h ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of ETH (ETH) are presented below.

活动图片