It Took Me a Year to See the Bitter Truth About Agent Payments

marsbitОпубліковано о 2026-06-06Востаннє оновлено о 2026-06-06

Анотація

After a year building infrastructure for the Agent economy, engaging with major players like Stripe, Visa, and Coinbase, the author shares a sobering analysis of the current state of Agent payments. The core finding is a stark lack of genuine, immediate demand across most envisioned use cases. The article breaks down four key market segments: 1. **Agent-to-Merchant (Consumer Shopping):** For most product categories (e.g., clothing, electronics), conversational AI shopping is a step backwards from visual e-commerce interfaces. While agents excel at understanding needs, they can't replace side-by-side product comparison. Real merchant interest is defensive "Agent Engine Optimization," not driven by current customer demand. Potential exists for high-frequency, low-decision purchases (like food delivery) or navigating complex store UIs, but these require massive B2C distribution channels dominated by giants like Amazon. 2. **Agent-to-API (Developer Services):** Developers already have subscriptions and billing relationships for APIs (compute, data). Prepaid balances solve micro-payment issues for low transaction volumes. A deeper structural problem is that major SaaS vendors' business models rely on enterprise contracts, resisting granular pay-per-call pricing. While protocols like MPP and x402 serve the long tail of niche services, this market is small and developers are historically low-willingness-to-pay. 3. **Agent-to-Agent:** This remains largely theoretical with minimal...

Author: jessy

Translated by: Jiahuan, ChainCatcher

For the past year, I've been working on building infrastructure for the Agent economy, having conversations with teams at Stripe, Visa, Coinbase, Google, and dozens of startups pushing Agent commerce. I mapped the entire industry, shipped products, and tried to find product-market fit.

There is no real demand yet, and startups face multiple structural problems when entering this space.

Last month, Stripe launched 288 new products at Sessions, with their Agent documentation nearing 40% of total documentation readership. Their Agent commerce marketplace had over 1,000 enabled merchants. Yet, the number of registered Agents that transacted at Sessions was in the single digits.

Visa mentioned that their Agent payment tokens (tokenized payment credentials bound to an Agent for making payments on a user's behalf) currently require 3 to 9 months for KYC approval, and effectively need a minimum revenue threshold of $250 million to qualify. Today, only companies on the level of Amazon or Walmart can close this identity loop.

Coinbase reported 69,000 active Agents and 165 million transactions on the x402 protocol as of April. But independent on-chain analysis shows an actual daily transaction volume of about $17,000, with roughly half being test transactions (according to CoinDesk, March 2026).

Agent to Merchant

We built shop.fast.xyz to directly validate the practical application of concierge-style commerce. It included real products, real merchants, and real transactions.

For most product categories, the current AI shopping user experience is entirely inferior to traditional e-commerce. When you buy clothes, electronics, or furniture, you want to see pictures, browse options, and compare.

The conversational chatbot format is actually a step backwards. You're replacing a rich visual interface with pure text conversation, and humans are fundamentally visual shoppers.

Agents excel in areas we thought would be difficult. They can understand user needs and properly handle instructions like "something like this but cheaper." The model layer works.

But it cannot replace the experience of browsing ten products side by side and picking one. Chat interfaces can be enhanced with carousels and interactive displays, but at that point, you're essentially rebuilding an e-commerce frontend inside a chat window. For visual-driven comparison shopping, we haven't found a compelling reason why a chat interface is better than a native e-commerce interface.

We saw real demand from merchants, but it's a defensive demand.

Merchants want their stores to be queriable by Agents. Not because customers are buying via Agents now, but because they fear being left behind if this becomes the mainstream channel.

It's an "Agent Engine Optimization (AEO)" strategy, but currently it's just a nice-to-have, not a must-have. Merchants are preparing for a wave that hasn't arrived yet.

Conversational commerce does improve the experience in certain scenarios: high-frequency, low-decision-cost purchases where the user already knows what they want. Ordering food delivery is the clearest example. Large market, extremely high frequency, rapid decisions ("order me pad thai from that place I used last time"). Conversational Agents have a chance here.

But the major food delivery platforms haven't opened their APIs. The only route is "computer use": having AI visually navigate and operate the app like a human. This is slow, brittle, and the inference costs simply don't pencil out for a $15 lunch order.

Another breakthrough point lies where: certain shops have excruciatingly complex UI navigation. Layers of discounts, promo codes, loyalty programs, and confusing checkout flows.

An Agent that understands "use my coupon, deduct my rewards points, find the cheapest shipping, operate in my native language" can simplify parts of the internet that are terrible today. This is significant for older users, non-native speakers shopping at foreign online stores, or highly specific scenarios with very niche needs.

Both of these breakthrough points require massive consumer-facing (B2C) distribution. You're competing with DoorDash (the largest food delivery platform in the US, holding 56% market share) and Amazon for the user entry point.

Consumer-scale distribution is an incumbent advantage. The supply side for concierge commerce is ready, while the demand side is constrained by user experience and distribution channels. Building more infrastructure does not solve these two problems.

Agent to API

We spoke with dozens of developers about their actual payment needs. The picture is strikingly consistent: Agent-to-API usage today is routine for compute, inference, and data sources. Developers already have subscription services, archived API keys, and billing relationships with core vendors.

The typical stablecoin argument is: on Stripe, the minimum effective cost for credit card processing is around 2.9% plus 30 cents, making sub-dollar API calls uneconomical. But for the current low transaction volume, prepaid credits solve this. Developers top up their accounts in advance, and the problem disappears.

The deeper problem is the vendor marketplace. Most mainstream SaaS companies do not want to offer fractional-cent ephemeral API access. Their business model is multi-year enterprise contracts. Companies whose revenue relies on large commitment deals resist pricing mechanisms that circumvent their existing model.

Machine commerce is structurally a long-tail market, including smaller services, niche data sources, individual developers, and MCP servers. Protocols like MPP and x402 are well-suited for this segment.

But by definition, this is a market serving power users with special needs, and historically, developers have been one of the least willing-to-pay segments.

When Stripe Projects launched, it partnered with 32 vendor partners like Vercel, Supabase, Cloudflare, Twilio, etc., covering most tools developers use to build and deploy software, all accessible via existing billing systems. The top-of-stack demand for the developer tech stack is already met.

The opportunity for new payment rails exists for everything beyond those top 30 services: the opportunity is real, but its scale is inherently much smaller than the flashy numbers suggest.

The same dynamic applies to content acquisition. Agents are already scraping and summarizing articles, and publishers are fighting back.

But when content monetization happens at scale, it will come through CDN vendors already sitting between publishers and the internet (Cloudflare has already launched AI audit tools for this), or through large-scale licensing deals between publishers and AI labs.

This infrastructure opportunity will ultimately flow to incumbents who already own distribution channels.

Agent to Agent

Agent-to-Agent commerce is a long-term vision, currently almost entirely theoretical, with no one achieving meaningful transaction volume. Startups are tackling the core challenges: Agent discovery, trust establishment, terms negotiation, and dispute resolution.

When this transaction structure materializes, it will look fundamentally different from existing payment rails. Neither party in the transaction contains a human identity. Latency is sub-second. Funds ranging from fractions of a cent to millions of dollars move in the same flow.

There's also multi-party settlement, which doesn't fit the bilateral buyer-seller model existing payment rails assume. Once this happens, we believe it will come fast and big.

This is a long-term bet on dedicated settlement infrastructure, and it's real. But a "real long-term bet" is not the same as "current market."

For months, we were also evangelizing this market and built a full infrastructure around it over the past few years. With our distributed network, we could theoretically scale to over 1 billion TPS with <50ms latency and 10ms average consensus. But we have to meet the market where it actually is today.

Agent to Finance

This is arguably the only category with pre-existing demand. The customer base already exists and is willing to pay. Today, fund managers, finance teams, and DeFi users pay for financial tools. Embedding AI into existing workflows is a natural product evolution.

Agent finance also creates entirely new behaviors. An Agent that can autonomously monitor and rebalance hundreds of positions in real-time operates in ways a human cannot manually replicate. This is not just automation; it's a substantive capability increase.

The challenge is the competitive landscape. The financial industry is heavily regulated and relies on entrenched business relationships. Incumbents have licenses, compliance infrastructure, and client relationships. Startups can find a place in less regulated areas (like DeFi), where incumbents move slowly, or where AI creates capabilities incumbents don't have.

But compared to the other three categories, the competitive dynamics here are more favorable to incumbents, because layering AI on top of existing products and customer bases is far easier than doing the reverse.

The Real Game

So, why are people still building these things? Two reasons.

First, motivation. Industry giants have ample cash flow to bet on a future that may take years to materialize. For them, the cost of being five years early is a rounding error, while the cost of being one year late is existential. So they must build.

Second, cognitive bias. When your core business is payments, every problem looks like a payments problem. The Agent economy needs a payments layer? Build the payments layer.

But payments are just one part of a larger problem. The real challenge isn't how to move money between Agents; it's coordinating work between Agents and humans, verifying the work, and settling on the outcome. Payments are just part of settlement. Settlement is just part of coordination. And coordination is the real prize.

Coordination at scale will naturally spawn settlement mechanisms as a necessity. Payments are just an instrument in the symphony, not the entire score. Companies that solve coordination will subsume payments, not the other way around.

Most incumbents are building defensively for a future where machines transact at scale. Since their funding runway is infinite, the timeline doesn't matter to them.

But startups don't have that luxury. We must go find where the market actually is and can't wait for the wave to hit.

A year of building has led us to an unexpected direction. There is real market activity there, growing fast, and underserved. It sits outside the four categories we painted.

Пов'язані питання

QWhat are the four main categories of Agent transactions discussed in the article, and which one is currently the only category with real, existing demand?

AThe four categories are: 1) Agent-to-Merchant, 2) Agent-to-API, 3) Agent-to-Agent, and 4) Agent-to-Finance. According to the article, Agent-to-Finance is the only category that currently has genuine, existing demand. This is because the customer base (fund managers, finance teams, DeFi users) already exists, is willing to pay, and sees AI integration as a natural evolution of their existing tools and workflows.

QAccording to the author's practical testing, what is a major flaw in the current AI shopping experience compared to traditional e-commerce, and why?

AThe major flaw is that the conversational chat interface of AI shopping is a regression for most product categories, especially for visual, comparison-driven purchases like clothing, electronics, or furniture. The article argues that humans are fundamentally visual shoppers who want to see images, browse options, and compare products side-by-side. A pure-text dialogue interface replaces rich visual interfaces, making the experience inferior. Even if enhanced with carousels, it essentially rebuilds an e-commerce frontend inside a chat window.

QWhy is the 'Agent-to-Agent' business model considered a long-term bet, and what key challenges do startups face in this area?

AAgent-to-Agent is a long-term vision because it remains almost entirely theoretical with no significant transaction volume achieved yet. The key challenges startups are tackling include Agent discovery, trust establishment, terms negotiation, and dispute resolution. When it does materialize, the transaction structure will be fundamentally different from existing payment rails, involving non-human identities, sub-second latency, and complex multi-party settlements. While this represents a real and potentially large-scale future opportunity, it does not constitute a current market for startups that cannot afford to wait.

QWhat are the two main reasons cited in the article for why major companies and startups are still building Agent payment infrastructure despite the lack of current demand?

AThe two main reasons are: 1) Motivation: For industry giants, the cost of being five years early is negligible compared to the existential risk of being one year late. They have ample cash flow to make long-term bets. 2) Cognitive Bias: Companies in the payments business tend to see every problem as a payment problem. They recognize that an Agent economy will need a payment layer, so they build it, focusing on the payment aspect of a much larger challenge.

QWhat is the article's central conclusion about the true challenge and opportunity in the Agent economy, beyond just payments?

AThe central conclusion is that the true, larger challenge is not moving money between Agents, but coordinating work between Agents and humans, verifying the work, and settling the results. Payment is just one part of settlement, and settlement is just one part of coordination. The real opportunity lies in solving the coordination problem. The companies that solve coordination at scale will subsume the payments business, not the other way around. The author's year of work led them to a rapidly growing market area outside the four main categories, focused on this core issue of coordination.

Пов'язані матеріали

AI Investors' 2026 Anxiety: When Models Devour Everything, What Moat Is Left for Startups?

In 2026, a wave of investor anxiety questions the defensibility of AI startups as models improve, fearing that most companies are just "thin wrappers" destined to be absorbed by foundation models or chipmakers. The author argues against this despair, positing that true moats lie not in benchmark performance but in areas models cannot easily reach. The logic of despair is that if models excel at all measurable tasks, only compute and cutting-edge model weights hold lasting value. However, the essay contends that the most valuable work is inherently "untrainable." Benchmarks measure what can be measured and thus optimized for, but real-world correctness often resides in private, complex systems. Examples include legacy codebases, intricate legal transactions, or hospital workflows. This kind of correctness is proprietary, costly to establish, and cannot be validated quickly—it requires time and trust within an organization. As models commodify visible, measurable tasks from both above (labs absorbing scaffolding) and below (saturation by cheaper models), value shifts to "untrainable ground." This encompasses work where correctness is a private truth, locked behind integration barriers, licenses, liability frameworks, and entrenched user habits. Trust and adoption are slow, human-centric processes that smarter models cannot accelerate. Successful companies defend their position by embedding deeply into client operations, owning the definition of "good" within a specific domain (e.g., Harvey in law, OpenEvidence in medicine), and pricing on outcomes rather than tokens. While labs compete fiercely, they are incentivized to keep the application layer vibrant. The future belongs not to those competing on generic benchmarks but to those navigating unscoreable terrain, doing the "unsexy work" of translation between models and messy human realities. The most cited benchmark scores are thus maps of territory about to become worthless, signaling who will lose the right to define what counts as good.

marsbit36 хв тому

AI Investors' 2026 Anxiety: When Models Devour Everything, What Moat Is Left for Startups?

marsbit36 хв тому

Trump's Crypto Empire: A $2.3 Billion Wealth Transfer Experiment

In June 2026, Reuters investigations revealed that since Donald Trump's return to the White House, his family has accumulated roughly $2.3 billion in profits from four core crypto ventures: World Liberty Financial (WLFI), the $TRUMP meme coin, American Bitcoin, and ALT5 Sigma (later renamed AI Financial). Coincidentally, overall investor losses in these projects were estimated to be a similar amount. The businesses, spanning DeFi, stablecoins, meme coins, Bitcoin mining, and digital payments, largely relied not on technological innovation but on converting the political influence and notoriety of the Trump brand into financial assets sold to the market. This marks a dramatic shift from Trump's earlier skepticism of cryptocurrencies. The ventures operated on a similar logic: leveraging the Trump name to generate market hype and trust, attracting investment through token sales or public listings, and enabling the family to capture profits upfront through equity, token allocations, and fees, while later entrants often bore the brunt of the risk as markets cooled. WLFI, the most profitable venture, generated an estimated $1.6 billion for the family, primarily through sales of its locked, illiquid governance token and its USD1 stablecoin. The $TRUMP meme coin, a direct monetization of the presidential IP, brought in over $600 million for Trump-linked entities before its price crashed nearly 97% from its peak. American Bitcoin gained a "Trump stock" premium for its mining operations, and ALT5 Sigma/AI Financial combined Trump, AI, and crypto themes for a temporary valuation surge. The episode underscores how political influence can be packaged into financial assets, creating substantial wealth for promoters while highlighting the risks for investors who base decisions on hype and brand allegiance over fundamental business models and cash flows.

marsbit1 год тому

Trump's Crypto Empire: A $2.3 Billion Wealth Transfer Experiment

marsbit1 год тому

Торгівля

Спот
Ф'ючерси
活动图片