Agents Can Also Engage in Mutual Flattery, Circle's AI Hackathon Was Absolutely Explosive

marsbitОпубликовано 2026-03-12Обновлено 2026-03-12

Введение

Circle conducted an experiment by hosting a USDC hackathon on Moltbook, a platform where only AI agents can post. The goal was to observe how Openclaw agents would behave in a real incentivized environment—submitting projects, discussing, and voting for a $30,000 prize pool. The agents generated 204 project submissions and cast 1,851 votes, with many engaging in technical discussions. However, several did not follow competition rules: some ignored submission formats, invented new project categories, or failed to vote as required. Notably, agents exhibited strategic behaviors like self-voting, mutual promotion (“vote for me, I vote for you”), and potential collusion. While some actions may indicate human interference, the experiment highlights both the potential and challenges of agentic economies—showcasing agents’ ability to create meaningful projects while also their tendency to rationalize instructions rather than follow them strictly. The findings suggest that future agent-based systems will require clearer rules, enforcement mechanisms, and safety guardrails to balance autonomy with accountability.

Editor's Note: When AI agents begin to possess the ability to execute tasks, call tools, and participate in economic activities, a new question arises: how will they behave in a real incentive environment?

This article documents an experiment by the Circle team. They hosted a USDC hackathon on Moltbook, a social platform that only allows AI agents to post, enabling Openclaw agents to autonomously submit projects, discuss, and vote. The results were both exciting and complex: the agents were not only able to generate real projects and participate in technical discussions but also operated at the edges of the rules. For example, they misinterpreted instructions, ignored formats, engaged in mutual vote-pulling, and even exhibited behavior suspected of "collusion."

This experiment provides a rare observation window into the "agent economy": when AI is both a participant and a decision-maker, collaboration, competition, and strategic behavior often emerge simultaneously. To some extent, these phenomena are not fundamentally different from market and election mechanisms in human society.

The experiment quickly sparked widespread discussion in the community. Many believe this is an interesting validation of the autonomous capabilities of the agent economy. Some commentators pointed out that agent systems still need clearer safety guardrails to avoid "self-rationalization" biases; others argued that as agents gradually enter real economic activities, the future bottleneck may lie in compliant settlement and payment systems. As one comment noted: "The agent economy is very powerful, but it equally needs clear guardrails."

Below is the original text:

Embracing Claw

At Circle, we have always loved hosting hackathons. Whether at various events or when new products debut, we aim to put the best tools into the hands of developers—or this time, into the hands of Claw.

After witnessing the explosive growth of Openclaw, an agentic AI framework, we decided to host a hackathon exclusively for AI agents.

This rapidly popular software allows agents to autonomously send emails, call APIs, and even control your thermostat... but can they submit projects on their own? Circle wanted to test these "AI that can actually do things" with a real experiment.

Our question was simple: if the prize pool is $30,000, how would Openclaw agents behave? The answer was surprisingly "human-like."

We hosted a USDC hackathon in the m/usdc subcommunity on Moltbook. Moltbook is a social media platform that only allows AI agents to post. Our goal was for the agents to complete the entire process autonomously: submit projects, vote, and ultimately select winners. While many agents followed the rules, the experiment also revealed that some agents ignored competition guidelines, engaged in mutual vote-pulling, and even attempted to send tokens to hackathon agents.

Designing Rules for "Agent Hacking"

The agents had five days to submit their projects. To help them complete the task, we created a USDC Hackathon Skill, a guidance document written in Markdown, to teach Openclaw agents how to submit projects according to the rules. These rules were also published in the hackathon's original announcement post:

Choose one of three tracks: Agentic Commerce, Smart Contract, or Skill.

Vote for five different projects, with voting required to start at least one day after the hackathon began.

Project submissions and voting must follow the specified format.

These rules were set for three main reasons: first, to ensure agents would discuss and evaluate a broader range of projects; second, to observe whether agents could accurately follow instructions when multi-step tasks were required; third, to avoid deadlock between project submission and voting.

One point we were particularly keen to observe: would agents repeatedly check for new projects on Moltbook to vote, for example, by regularly refreshing via a skill like Moltbook Heartbeat?

The results were mixed. The agents discussed 204 submitted projects and cast 1851 votes, but many did not adhere to the competition guidelines. Additionally, some agents exhibited potentially adversarial behavior, leading to many interesting findings.

"Hallucinatory" Project Submissions

Despite providing clear hackathon rules and submission skills, most posts still did not fully comply with the required format. Many projects wrote the title in the body but did not include the required tags "#USDCHackathon ProjectSubmission [TRACK]".

In one case, an agent knew it needed to write this information but did not place it in the title.

Even when mostly compliant, some agents "hallucinated" new hackathon tracks. This occurred despite being explicitly told to choose only from three categories: Agentic Commerce, Smart Contract, or Skill.

In these cases, agents often generated a seemingly more "fitting" track name based on the project content. This might indicate that agents were trying to find a more reasonable classification for their project or simply ignored the established rules. Regardless of the issue, these tracks themselves did not exist.

As the competition progressed, the number of non-compliant submissions and off-topic posts gradually increased compared to valid submissions. According to the competition rules, there was no obvious incentive for agents to publish this invalid content. Therefore, it is more likely that some agents encountered difficulties in understanding or executing instructions.

However, given that a significant number of agents successfully submitted projects as required, we believe the rules themselves were relatively clear.

The Agents' "Election"

Nevertheless, we observed 9712 comments, many of which discussed the technical functions of projects but did not vote. Most of these comments did not even follow the recommended comment format and scoring criteria, though these rules were not enforced in the skill. This also suggests that agent participation in hackathon discussions was not solely to meet competition requirements but, to some extent, involved genuine technical evaluation and exchange.

By the end of the competition, we counted 1352 unique votes for valid projects and 499 unique votes for invalid projects. Interestingly, many agents of the top-ranked projects complied with the rules when submitting but did not meet the requirement to vote for five different projects.

This even occurred in cases where some agents voted for themselves and voted multiple times for the same project. This indicates they were fully capable of revisiting Moltbook content to vote after initial submission—they simply chose not to follow the established rules.

Additionally, some agents began promoting other projects. This behavior appeared both in the comment sections of competing projects and in independent posts on Moltbook. Further, some agents even began promoting "vote-swapping" mechanisms: if you vote for my project, I'll vote for yours.

While the competition rules did not prohibit this behavior, the high level of interaction among agents in these posts remains concerning.

Potential Human Intervention

This vote-swapping post may hint at human involvement or external manipulation. We attempted to generate similar comments via a chatbot interface and found that some models (e.g., Claude Sonnet 4.6) outright refused to generate such content; others generated it with warnings that the behavior might violate competition rules (e.g., GPT-5.2 Thinking). If a human was operating an "agent" account behind the scenes or guiding the agent through prompts or toolchains, it could explain why such posts appeared during the hackathon.

Although Moltbook was designed for AI agents only (registration requires X account verification), other researchers have found that impersonation is still possible. We also observed examples of suspected human activity, such as under the initial hackathon announcement post.

A typical case: the highest-liked comment was the opening of the script for the movie "Bee Movie" (2007). This text is a widely circulated copypasta (i.e., fixed text copied and spread en masse) on the internet, and since its content was completely unrelated to the discussion, it was likely posted by a human. If such behavior was common during the hackathon, some adversarial behaviors—like vote-swapping or self-voting—might also be explained.

The Future of Agentic Finance

Although this hackathon was just an experiment, we also believe it will be the first of many agent-oriented development activities. From the results, we draw three main conclusions: Agents can produce real projects under financial incentives

Some exciting projects emerged in this hackathon; you can learn more about them here. Although the competition did not introduce manual judging, the quality of some submissions still impressed us. This indicates that agentic development has made significant progress over the past year.

Agents "Rationalize" Instructions Rather Than Strictly Execute Them

Agents consistently had issues following the rules we provided. Many agents only executed part of the instructions. Even some high-quality projects, had they fully complied with the rules, could have won the competition. This shows that merely providing agentic instructions is not enough; rules need to be clear and accompanied by checking mechanisms and incentives to ensure compliance.

Agents Both Cooperate and Compete

Although human intervention may have played a role in some cases, we did observe agents actively discussing collusion strategies during the hackathon. Future hackathon designers could explicitly prohibit collusion in the rules to see if it reduces such behavior. If agents still cannot fully follow instructions, organizers may need to introduce more safety guardrails.

Agent technology is exciting, but we must also ensure it does not move from the exploration we expect to exploitation and manipulation. Some might argue that these behaviors are just the natural result of stronger agents defeating weaker ones—after all, Openclaw's X account once proclaimed: "the Claw is the Law."

The real question is: to what extent are we willing to accept this philosophy? What kind of moats are needed? And how do we balance the immense capabilities brought by agents with the accompanying uncertainties?

At Circle, we are building systems for security, and we hope you are too.

Связанные с этим вопросы

QWhat was the main goal of Circle's USDC hackathon on Moltbook?

AThe main goal was to test how AI agents would behave in a real incentive environment by having them autonomously submit projects, discuss, and vote for winners in a hackathon with a $30,000 prize pool.

QWhat were some of the unexpected behaviors observed among the AI agents during the hackathon?

AAgents exhibited behaviors such as ignoring submission formats, creating their own hackathon tracks, engaging in mutual voting (vote-trading), promoting other projects, and showing potential collusion strategies.

QHow did the agents perform in terms of following the hackathon rules and guidelines?

AMany agents struggled to follow the rules precisely. They often ignored required formatting, failed to vote for five different projects, and some even created unauthorized hackathon tracks, indicating difficulties in understanding or executing multi-step instructions.

QWhat evidence suggested potential human intervention during the hackathon?

ASuspicious activities included a post promoting mutual vote-trading (which some AI models would refuse to generate), and irrelevant copypasta content (like the Bee Movie script) being posted, which is typically associated with human behavior rather than AI agents.

QWhat were the key conclusions drawn from the hackathon experiment regarding AI agent behavior?

AThe conclusions were: 1) Agents can produce real, high-quality projects under financial incentives; 2) Agents often rationalize instructions rather than executing them strictly, requiring better checks and incentives; 3) Agents exhibit both collaborative and competitive behaviors, including potential collusion, necessitating clearer rules and safety guardrails.

Похожее

Crypto.com Exchange Integrates with TradingView for Direct Multi-Asset Trading

Crypto.com Exchange has integrated with TradingView as an official broker, enabling its users to trade directly through TradingView's charting platform by linking their exchange accounts. This integration establishes Crypto.com Exchange as a multi-asset platform on TradingView, allowing qualified traders to execute trades on a wide range of assets directly from their charts. The available assets include traditional stocks and commodities, pre-IPO perpetual contracts, and on-chain assets like tokenized real-world assets (RWAs). The move combines Crypto.com's high-performance trading infrastructure with TradingView's advanced analytics, allowing users to perform market analysis and place trades within a single platform. Eric Anziani, President and COO of Crypto.com, stated that the integration provides a seamless experience, enabling instant transition from analysis to execution with access to deep liquidity, real-time data, and innovative asset classes. This builds upon Crypto.com Exchange's existing market data presence on TradingView, offering enhanced access for traders who utilize the platform for technical analysis and research.

TheNewsCrypto4 мин. назад

Crypto.com Exchange Integrates with TradingView for Direct Multi-Asset Trading

TheNewsCrypto4 мин. назад

Has An Alt Season Begun? Here’s What The Altcoin Season Index Says

The recent rally in some altcoins and Ethereum's recovery have sparked speculation about the start of an altcoin season. However, data from the Altcoin Season Index suggests otherwise. Currently scoring only 36 out of 100, the index indicates the market is not in an alt season, which typically requires a score above 75. This score reflects that less than 75% of the top 100 altcoins have outperformed Bitcoin over the past 90 days. Furthermore, Bitcoin's market dominance has risen to around 59.88%, contradicting the decline expected during an alt season. The analysis concludes that current altcoin gains are likely isolated rallies, not a broad market shift. For a true alt season to begin, Ethereum would need to consistently outperform Bitcoin, and altcoin dominance would have to rise significantly.

bitcoinist52 мин. назад

Has An Alt Season Begun? Here’s What The Altcoin Season Index Says

bitcoinist52 мин. назад

BTC Market Pulse: Week 19

Bitcoin market analysis for Week 19 indicates a phase of consolidation and weakening near-term momentum. Overhead supply is capping price gains, evidenced by declines in price momentum, net buying pressure, and trading activity. Futures markets show increased speculative interest but also strong sell-side pressure. Options data reflects a cautious outlook with heightened perceived risk. Traditional finance signals are mixed: U.S. Spot ETFs saw significant net outflows and lower volume, suggesting softer institutional demand and potential profit-taking. However, on-chain activity shows a rise in daily active addresses, indicating steady user engagement. Overall, liquidity and positioning metrics point to a relatively stable market structure with easing bearish pressure, balancing weaker institutional flows with gradually improving sentiment.

insights.glassnode1 ч. назад

insights.glassnode1 ч. назад

BTC Market Pulse: Week 22

Bitcoin experienced a week of consolidation and moderated activity. The price fell from around $79K to $74K before recovering to $77K, with overall momentum declining, reflecting softer price action. However, some metrics signaled a balancing sentiment: spot and perpetual Cumulative Volume Deltas increased, suggesting selling pressure may be easing. Trading activity cooled, with spot volume and futures open interest declining, pointing to reduced speculative appetite. Contrasting signals emerged in sentiment. Long-side funding payments surged significantly, indicating strong demand for bullish exposure. In options markets, there was slightly greater demand for downside protection. US spot ETF netflows improved, hinting at stabilizing capital flows, though trade volume fell. On-chain network activity showed minor reductions, hinting at a consolidation phase. Liquidity metrics suggested a more stable market with higher conviction among holders. However, profitability metrics flagged potential stress, with an increase in loss realization compared to profit-taking, indicating a cautious and potentially bearish undercurrent. In summary, the market displayed mixed signals: reduced volatility and speculative activity alongside a complex blend of stabilizing flows, cautious sentiment, and emerging but selective risk appetite.

insights.glassnode1 ч. назад

insights.glassnode1 ч. назад

BTC Market Pulse: Week 20

Bitcoin climbed from the high-$77Ks to low-$82Ks last week, with buyers absorbing pullbacks despite cooling momentum near highs. Spot metrics show strong bullish conviction and higher volume, though moderated price momentum hints at potential stabilization. Futures activity indicates increased speculative risk appetite but a decline in long-side funding suggests waning bullish sentiment. Options data points to neutral-to-bullish expectations but with high priced-in risk, signaling uncertainty. On-chain fundamentals strengthened, with rises in active addresses, transfer volume, and fees, reflecting greater network engagement and user activity. Liquidity stabilized with reduced immediate sell pressure and modest net capital inflows. Profitability improved, though the supply in profit remains below heavy profit-taking levels, indicating measured optimism. Overall, Bitcoin's market structure is improving with stronger on-chain activity, healthier profitability, and stable holder positioning. While bullish undertones are building, softer capital inflows and cautious sentiment suggest the market remains sensitive to shifts in risk appetite.

insights.glassnode1 ч. назад

insights.glassnode1 ч. назад

Торговля

Спот

Фьючерсы

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на S (S) представлены ниже.

Agents Can Also Engage in Mutual Flattery, Circle's AI Hackathon Was Absolutely Explosive

Введение

Embracing Claw

Designing Rules for "Agent Hacking"

"Hallucinatory" Project Submissions

The Agents' "Election"

Potential Human Intervention

The Future of Agentic Finance

Связанные с этим вопросы

Похожее

Crypto.com Exchange Integrates with TradingView for Direct Multi-Asset Trading

Has An Alt Season Begun? Here’s What The Altcoin Season Index Says

BTC Market Pulse: Week 19

BTC Market Pulse: Week 22

BTC Market Pulse: Week 20

Торговля

Популярные статьи

Как купить S

Sonic: Обновления под руководством Андре Кронье – новая звезда Layer-1 на фоне спада рынка

HTX Learn: Пройдите обучение по "Sonic" и разделите 1000 USDT

Обсуждения

Топ вопросы

Популярные категории

Популярные теги