Who Funds the Agents?

marsbitОпубликовано 2026-06-04Обновлено 2026-06-04

Введение

**Summary: Who Funds AI Agents?** OpenAI recently shut down a feature allowing AI agents to shop for users, highlighting the challenge of creating a secure and regulated environment for agent-driven transactions. While payment infrastructure exists, a crucial governance layer—defining spending limits, fraud detection, tax handling, and return policies—is largely missing. The potential is enormous: AI agents already processed $73M across 176M transactions last year, with McKinsey forecasting this could grow to $3-5T in global consumer commerce by 2030. The core competition isn't just about processing payments, which can be very cheap (especially with crypto-based settlement), but about controlling the rules that govern agent spending. Key players like Stripe and Coinbase are racing to dominate this governance layer. Stripe's acquisition of wallet provider Privy allows it to set spending policies, identity checks, and human-in-the-loop approvals directly at the wallet level. Similarly, Coinbase's stack, including its x402 protocol and AgentKit, embeds governance rules. This vertical integration across settlement, wallet, and governance layers is becoming the dominant strategy. Control over the governance layer is where significant future value lies. If agents handle trillions in transactions, even a small fee for managing compliance, fraud prevention, and policy enforcement could generate billions in annual revenue. The companies that successfully integrate across the payme...

Written by: Prathik Desai

Compiled by: Block unicorn

In March of this year, OpenAI shut down a feature that allowed AI agents to shop on behalf of users. Since its launch, in just five months, fewer than 30 Shopify merchants had used this feature. The payment infrastructure itself was not the problem; the issue lay in the lack of rules to ensure a smooth shopping experience. Which products could agents purchase? Who was responsible for collecting sales tax? How to identify fraudulent behavior? Who would handle returns—all these questions were not properly addressed.

Providing wallets for agents or building payment infrastructure is relatively easy. However, enabling individuals or businesses to use agents for consumption in a trustworthy and regulated manner is not. Only programmability and rules can ensure a trusted environment. This governance-level gap presents an opportunity for the agent economy.

Last year, AI agents processed 176 million transactions, totaling $73 million. While this number might seem insignificant now, McKinsey predicts that by 2030, AI agents will facilitate $3 trillion to $5 trillion in global consumer commercial transactions.

Companies building this economic system are racing to control the governance layer, which includes spending controls, identity verification, and policy enforcement—these determine which agents have the authority to manage budgets.

Today, we will analyze who is building the banking layer for robots and what benefits those who dominate this layer can gain.

Why Own a Multi-Layered Architecture?

The economics of handling agent payments are very low. Over the past 12 months, the average AI agent payment was just 31 cents.

Consider how much profit is left for those processing transactions through multiple backend layers for a 31-cent payment. Stripe's standard pricing is 2.9% plus a 30-cent fixed fee, leaving the merchant with less than one-tenth of a cent. Visa's interchange fees would eat away another third. In contrast, a Layer-2 stablecoin payment system processing the same transaction charges only 0.0001 USD.

These economic factors provide a rationale for the use of cryptocurrency at the settlement layer.

The payment infrastructure at the settlement layer is largely in place. Coinbase's x402 protocol handled the vast majority of last year's 176 million transactions, and currently, about 3,900 merchants accept agent payments. Stripe and Tempo jointly developed a competing protocol—the Machine Pay Protocol (MPP), launched in March, which integrates over 100 services. Google, Visa, and Mastercard also launched agent payment products around the same time. This means five competing payment architectures emerged within just 12 months.

But the problem with agent payments is that no one gets rich processing 31-cent payments. Therefore, value concentrates in the funds in circulation and the enforcement of rules governing how agents make payments.

Last week, we explained how companies capture value by owning the wallet layer that holds AI agents' stablecoin balances. But floating balances are just one of many valuable layers worth capturing. Another value layer is the rules governing how these floating balances are used.

These rules include spending controls, agent identification, policy enforcement, audit trails, and allocating liability for failed transactions. This layer is wide open.

In April this year, American Express launched the "Agent Purchase Protection" plan, an insurance product designed to cover losses from erroneous purchases made by AI agents. This effectively acknowledges the current state of the AI agent governance layer. Solving the missing governance issue holds immense value in an industry projected to reach $3-5 trillion within five years.

This is why incumbent players are now racing to seize control of the governance layer.

But at which level should this layer be built? It could be a bank, a developer API, or even a wallet.

Wallet as the Governance Layer

Every purchase an agent makes must go through a wallet. Therefore, the wallet is the optimal point to implement spending limits, identity verification, and human approval. Control the wallet, and you control the governance. Payment infrastructure company Stripe realized this early on.

In June 2025, Stripe acquired Privy, a company building embedded wallets for consumer crypto applications. Through this acquisition, Stripe gained 75 million wallets distributed across over 1,000 development teams. These wallets now sit at a critical choke point for fund flow, where all policies, spending limits, and human approvals must be executed before funds move.

Stripe has also built an entire agent payment technology stack. It acquired Bridge to handle stablecoin coordination and fiat conversion. Furthermore, it co-incubated Tempo with Paradigm, a Layer-1 blockchain focused on payments. Stripe and Tempo jointly authored the Machine Pay Protocol (MPP), an open standard for how agents request, authorize, and settle payments.

Stripe's Agent-Ready Financial Solutions now allow software to check balances, pay bills, store funds, create virtual cards, and transfer money. Agents can handle routine payments themselves, but any action outside their policy triggers a human review. Fund balances are powered by non-custodial Privy wallets available in over 150 markets.

Even Amazon, when forced to allow its developers to give AI agents spending power, chose two wallet companies—Privy and Coinbase. Not an experienced financial institution like a bank or card network, but a five-year-old wallet provider.

This is because wallets act as the ideal checkpoint, allowing an appropriate level of human intervention to ensure necessary checks and balances.

Keyrock, in its report "Who Pays for the Agent," states that the agent commerce market will "settle on an equilibrium where agents have considerable autonomy but operate within boundaries enforced by cryptography that humans can audit and revoke."

This is the position Privy holds within Stripe's stack. The wallet is responsible for setting the boundaries within which agents must operate.

Here's how governance policies work on this stack.

Privy offers two smart wallet models. In the first, the agent has full control of the wallet and executes transactions within policy constraints without human approval—ideal for fully autonomous agents like trading bots and portfolio managers. In the second, the user retains wallet ownership but grants the agent limited permissions to operate as a signer. The user can revoke access at any time.

Stripe's MPP follows similar governance logic.

The MPP introduces a feature called "Sessions" for high-frequency agent tasks. In Session mode, the agent pre-authorizes a spending budget and then makes continuous payments within that limit without needing a separate request for each on-chain transaction. The MPP has implemented sub-cent billing for LLM inference and per-query billing for data APIs.

This is a level of governance granularity that card networks cannot support.

Vertical Stack Expansion

Although Coinbase's x402 currently leads in AI agent payments, Privy's advantage is less about cryptocurrency itself and more about its distribution moat built through Stripe.

Coinbase has 3,900 merchants accepting agent payments. For every merchant accepting agent payments via Coinbase, Stripe has roughly 1,000. In February this year, Privy stated that if all Stripe merchants opted to accept machine payments, agent commerce could scale today via Privy wallets. Stripe merchants do not need to build custom crypto infrastructure.

Competition between Stripe and Coinbase is intensifying, and other traditional giants have joined the race for vertical expansion, aiming to grow across the entire technology stack.

Keyrock mapped 179 projects across the six layers of the agent payment stack (Settlement, Wallet, Routing, Protocol, Governance, and Application).

Coinbase and Stripe each cover five of these six layers. Circle covers four. Despite their size, Google covers only two layers, and Visa just one.

Over the past twelve months, incumbent payment giants spent over $80 billion filling gaps in their own stacks. Capital One acquired AI-native software platform Brex for $5.15 billion. Mastercard acquired BVNK for $1.8 billion. Among these, the wallet layer and the AI software layer saw the most active M&A activity. Stripe acquired Privy, Fireblocks acquired Dynamic, Arbitrum acquired ZeroDev. In each case, a payment infrastructure provider acquired an independent wallet provider.

Collectively, these deals indicate the market has identified a scarce resource layer. Settlement fees become cheap and interchangeable, but programming permissions, budgets, and liability are where the value lies.

Vertical integration across multiple layers also has compounding effects.

Whoever owns this checkpoint sets the spending rules, intercepts funds before they move, decides which merchants, agents, and apps get trusted access, and charges fees to do all this. We see this in the Privy-Stripe distribution moat.

Even Coinbase's stance reflects this dynamic. Every x402 payment generates USDC demand on its Layer-2, Base, producing float yield. This yield funds more agent tools via AgentKit, which features built-in session caps, single-transaction limits, and allowlists restricting transfers to audited contracts. More agents on AgentKit mean more x402 payments. Each layer influences the others.

Incumbent investment activity is significantly more active.

Coinbase Ventures also invested in Catena Labs, Skyfire, and Payman, three of the most prominent independent governance startups currently. Circle's co-founder Sean Neville founded Catena, and Circle invested in Skyfire. a16z led funding rounds for both. Visa backed Payman and partnered with Skyfire.

The same players building payment settlement infrastructure are funding the governance layer. The idea is that incumbents maximize their returns if governance functions remain part of the existing infrastructure, as Privy built in its two models. If governance becomes a separate layer, they profit via their portfolio.

What Does Controlling the Governance Layer Mean?

Payment processing was never the most valuable position because financial systems eventually commoditize. Once that happens, profit margins shift to where the decision is made about whether a transaction is allowed and under what conditions.

Historically, many industries have undergone the same commodification process.

Consider what happened after the internet commodified cable TV. All Internet Service Providers (ISPs) became nearly identical, almost interchangeable. Therefore, telecom companies had to expand vertically to stay competitive.

India's two largest telecom operators, Jio and Airtel, began bundling hundreds of TV channels, six OTT platform subscriptions, unlimited voice calls, set-top boxes, and free routers into a single broadband package. Similarly, AT&T spent $85 billion to acquire Time Warner, becoming a media-telecom giant. The aim was to combine Time Warner's premium content—like HBO, Warner Bros., and CNN—with AT&T's vast distribution network to compete with streaming platforms like Netflix and Amazon.

When broadband connectivity (the underlying infrastructure) became the cheapest part of the package, value shifted to the combination of content, relationships, and offers that best attracted customers.

We see this happening in crypto as well.

Settlement was supposed to happen at the protocol layer. Think of Ethereum as a shared ledger where everyone settles. After Coinbase launched Base as a faster, less congested Layer-2 chain, it began collecting gas fees from every transaction settled on its own chain. Today, Coinbase earns approximately $60 million annually in sequencer revenue by processing transactions on Base.

Players building agent payment systems have learned this lesson.

In "Active Buoyancy," we explained how an economy can be built by controlling the stablecoin balances agents hold between transactions. This allows companies that control the wallet layer of the stack to add a revenue stream.

The governance layer adds another revenue stream, and potentially a much larger one.

Visa processes $14.2 trillion in payment volume annually, earning a roughly 0.28% take rate. This rate includes not just fees but also the implicit management fee Visa earns for the trust built by preventing fraud, resolving disputes, and enforcing network rules.

Applying even a fraction of this rate to agent transactions gives us an idea of the immense value it brings to companies built on the governance layer. McKinsey's $3 trillion projection for agent transactions by 2030 means that even a governance fee of just 0.1% (about 35% of Visa's take) could generate $3 billion in annual revenue. For reference, Coinbase's total subscription and services revenue in 2025 was approximately $2.8 billion. Revenue from just the agent transaction governance layer alone could match what Coinbase currently earns from staking, custody, and Coinbase One combined.

Companies with a presence in the wallet, settlement, and governance layers of the agent finance stack can profit from idle agent balances (float revenue), settlement fees (sequencer revenue per transaction), and compliance fees (enforcing governance).

This is why vertical integration across the entire technology stack will be the only business model that allows companies to remain competitive in the agent era.

Связанные с этим вопросы

QAccording to the article, what was the main reason OpenAI shut down its AI agent shopping feature in March?

AThe main reason was the lack of proper rules to ensure a smooth shopping experience. Issues like what items agents could buy, who collects sales tax, how to identify fraud, and who handles returns were not properly resolved.

QWhy does the article argue that payment processing for AI agents has low economic value, and what solution does it propose for the settlement layer?

AThe economic value is low because the average AI agent payment over the past 12 months was only 31 cents, leaving very little profit after traditional fees (e.g., from Stripe, Visa). The article proposes that cryptocurrencies, specifically Layer-2 stablecoin payment systems, are well-suited for the settlement layer due to their much lower transaction costs (around $0.0001).

QWhat role does the governance layer play in the AI agent economy, and why is it considered a major source of value?

AThe governance layer manages rules like spending controls, identity checks, policy enforcement, audit trails, and liability assignment for AI agent transactions. It's a major source of value because it decides *if* and *under what conditions* transactions can occur, shifting profit potential away from mere payment processing. Controlling this layer allows companies to charge compliance fees for enforcing these rules.

QHow does Stripe's acquisition of Privy position it in the competition for the AI agent payment stack, particularly regarding the governance layer?

AStripe's acquisition of Privy, a company building embedded wallets, gives Stripe control over a key checkpoint (the wallet layer) where governance policies (spending limits, approvals) are enforced before funds move. This, combined with Stripe's other infrastructure (Bridge, Tempo, MPP protocol), allows for vertical integration across the stack and creates a significant distribution advantage through Stripe's existing merchant network.

QWhat are the potential revenue streams for a company that is vertically integrated across the AI agent financial stack, according to the article's conclusion?

AA vertically integrated company can profit from three main streams: 1) Float income (revenue from idle agent balances held in wallets). 2) Settlement/sequencer fees (revenue from processing transactions on its own layer, like Coinbase does on Base). 3) Compliance/governance fees (revenue from enforcing rules and policies on agent spending).

Похожее

Fei-Fei Li's Team Clarifies the Concept of 'World Models', Sora Merely a Renderer

"World Models" has become a widely used yet confusing term in AI. To address this, a team led by Fei-Fei Li and World Labs proposed a functional taxonomy based on the Partially Observable Markov Decision Process framework. This taxonomy categorizes systems called "world models" into three distinct projections: Renderers, Simulators, and Planners. Renderers, like OpenAI's Sora and other video generation models, focus on producing photorealistic visual outputs for human perception. They prioritize visual fidelity over physical accuracy. Simulators, such as NVIDIA Omniverse, aim to compute precise future environmental states for computational tasks like engineering analysis or digital twins. Planners, like Vision-Language-Action models, take in observations and goals to output executable actions for robots or agents. The article clarifies that most current "world models," including Sora, are primarily Renderers. They generate convincing visuals but lack the core ability to simulate state transitions based on actions, a key requirement for a true world model in classic reinforcement learning definitions. This conceptual confusion has practical implications, leading to potential misalignment in technology selection, investment, and public understanding of AI capabilities. Clear categorization is crucial. It helps enterprises avoid costly mistakes (e.g., using a renderer for robot training), allows investors to accurately assess markets, and enables researchers to build comparable benchmarks. While future systems may integrate these functions, recognizing current boundaries is essential for honest assessment and progress.

marsbit1 ч. назад

Fei-Fei Li's Team Clarifies the Concept of 'World Models', Sora Merely a Renderer

marsbit1 ч. назад

Торговля

Спот
Фьючерсы
活动图片