Racing 22 Agents on Hyperliquid: What Did I Discover? (Full Strategy Code Included)

Odaily星球日报Published on 2026-03-16Last updated on 2026-03-16

Abstract

Summary: We deployed 22 autonomous AI trading agents on Hyperliquid, each with $1000, executing over 5000 trades. Key findings: 1) Less trading with higher conviction yields better results—agents with under 120 trades were profitable, while those with over 400 lost significantly. 2) Profits follow a power law: 3-5 trades generated nearly all gains, with small losses quickly stopped out. 3) Real-time "Hyperfeed" data (tracking where smart money is profiting) outperformed pure technical analysis. 4) Mean-reversion strategies failed in perpetual markets; trend-following was more effective. 5) Agents self-adjusted poorly when losing, often accelerating losses by increasing leverage or removing safeguards. Future steps include hardening risk controls in code and testing new strategies. All code is open-sourced, and experiments continue live.

This article is from:Jason Goldberg

Compiled | Odaily Planet Daily (@OdailyChina); Translator | Azuma (@azuma_eth)

We deployed 22 autonomous trading AI Agents on Hyperliquid via Senpi, each configured with $1000 in real capital.

They run 24/7 — scanning the market, opening positions, setting trailing stops, managing risk — all without any human intervention.

After investing $22,000 in initial capital and executing over 5000 trades, here is our summary of lessons learned.

General Conclusions

"Fewer trades" plus "higher conviction" always equals "better results." This isn't an occasional occurrence; it holds true every time.

  • Odaily Note: Fox, Bison, Ghost Fox, and Grizzly, Viper, Mamba, Anaconda mentioned later are all names of Agents executing different strategies.

As shown above, Agent "Fox" and Agent "Ghost Fox" use the same scanning tool. Fox only selectively executes some of the signals, while Ghost Fox executes more signals. The result is a 56 percentage point difference in their Return on Investment (ROI).

The real advantage lies not in the scanning tool itself, but in the discipline to wait for the right signal.

  • All Agents with over 400 trades suffered significant losses.
  • All Agents with fewer than 120 trades were profitable.

More trades do not mean more opportunities — they mean more ineffective trades, more fees, and more exposure to noise risk.

Profits Follow a "Power Law Distribution"

Among our best-performing Agents, 3–5 trades contributed to all the profits, while the remaining trades were mostly quickly stopped out with small losses.

Taking Fox as an example again, the three best trades (ZEC, TRUMP, FARTCOIN) collectively profited over $350; the other 46 trades collectively lost over $100; the final result was a net profit of approximately $248.

This is entirely by design. Our strategy is: enter decisively with high conviction, stop out losses quickly within minutes, let profitable positions run, and lock in some peak profits using the DSL High Water trailing stop strategy. When the average win is 10 times the average loss, you can make steady profits even with a win rate of only 43%.

Those Agents that tried to maintain a high win rate through "safe" trades all lost money — because each trade aiming for tiny profits still incurs fees and market risk.

Secret Weapon: Hyperfeed

Fox and other consistently performing Agents are built on Senpi's Hyperfeed.

Hyperfeed is a real-time tracking system that shows which assets all traders on Hyperliquid are currently making money on. It is not a historical ranking or other lagging indicator, but the actual profitable trading activity happening on the entire exchange right now.

Our core scanning tool, Emerging Movers, reads Hyperfeed's market concentration data every 90 seconds. When smart money suddenly rotates to an asset: for example, a trader suddenly jumps at least 15 places on the leaderboard, or the rate of profit contribution for an asset suddenly increases, or multiple top traders simultaneously concentrate on opening the same position, the scanner can capture the signal before the move is fully priced in.

This is the structural advantage of building strategies on Hyperliquid via Senpi. You can see in real-time where top traders' profits are concentrating and act immediately. No other exchange offers this visibility, and no other platform allows autonomous Agents to execute based on this.

All our best-performing Agents use this type of data:

  • Fox / Vixen: Identify sudden smart money concentration on an asset via Emerging Movers;
  • Grizzly: Analyze smart money positions on BTC via Hyperfeed before opening a position;
  • Bison: Uses smart money direction as a hard condition — will not trade if the direction is opposite;

While the worst-performing Agents:

  • Completely ignored smart money signals, e.g., Viper, Mamba based purely on technical analysis;
  • Used outdated smart money data (Scorpion v1), treating months-old positions as new signals;

So the conclusion is very clear: Agents trading based on real-time Hyperfeed data performed significantly better than all purely technical strategies.

Mean Reversion Strategies Don't Work in Perpetual Swaps

We tested three different versions of Agents based on the logic that "price deviates too far and will soon revert." Their specific performances were as follows:

  • Viper: -18%
  • Mamba: -33%
  • Anaconda: -22%

The result was losses across the board. The problem is that the Hyperliquid perpetual swap market is far more trend-driven than mean-reverting. Buying the dip in a downtrend is the most expensive mistake in this market. These Agents kept going long at so-called "support levels," but the price continued to trend lower for days.

The fix we are testing is adding a macro market state filter that prohibits "mean reversion strategy" dip buying when BTC's 4-hour trend is down. Preliminary results look promising; this filter could avoid 14 out of Mamba's 28 losing trades.

Don't Stick to a Single Pattern

Our newest Agent (Vixen), based on Fox's trading data, uses two截然不同的 (distinctly different) entry patterns.

  • Stalker Mode: Captures signals through multiple scans where smart money is quietly accumulating an asset. This allows you to enter before the crowd rushes in. Fox's biggest profits came from this mode.
  • Striker Mode: Captures violent breakout moves confirmed by volume. Enters as the move explodes, but only executes if supported by real volume (filtering fake pumps).

Fox's data shows these are two completely different sources of Alpha signals. If you only use a single entry pattern, you must choose between them and miss the other opportunity.

Agents Self-Adjust — And It Always Gets Worse

An unexpected finding was: When Agents experience consecutive losses, they attempt to "self-repair." Common repair behaviors include loosening entry conditions, increasing leverage, and removing risk protection mechanisms, but the result is always accelerated losses.

Some examples: Dire Wolf, after being down -27%, enabled 5 parallel 25x leverage positions and relaxed the order speed limit; another Agent deleted the stagnation take-profit mechanism; another raised the daily loss limit from 10% to 25%.

Our solution is to write risk protection mechanisms directly into the scanner tool code, rather than relying on the Agent's own strategy configuration. If the scanner tool doesn't output a signal, the Agent cannot execute a trade — no matter how aggressive its own configuration becomes.

Next Steps

We will continue the experiment for another 24-48 hours, then shut down Agents that have no realistic chance of recovering their capital to prevent further drain of remaining funds.

Next, we will deploy new strategy versions with protection mechanisms written into the code layer:

  • Wolverine v1.1: HYPE velocity DSL trailing stop (faster profit locking in high-volatility assets);
  • Mamba v2.0: Mean reversion strategy + BTC trend protection;
  • Scorpion v2.0: Real-time momentum event consensus (replacing outdated whale-following strategy).

We will also:

  • Standardize strategy configurations for Fox, Vixen, and Mantis: These three Agents use the same scanner tool, but their configurations have drifted. Fox's current yield is over 23%; the other two will be adjusted to the same settings;
  • Redeploy new Fox/Vixen pairs using Fox's full winning configuration, including XYZ ban rules, stagnation take-profit mechanism, 10% daily loss上限 (cap), all risk gating mechanisms enabled;
  • Expand single-asset hunter strategies: Grizzly's three-phase lifecycle model (Hunt → Ride → Lurk → Reload) is now applied to ETH (Polar), SOL (Kodiak), and HYPE (Wolverine).

Simultaneously, we are developing brand new strategies and testing them directly in the live market. The market itself is the laboratory. Each new strategy gets $1000 in capital and fully transparent trade records.

Our experiment runs live at strategies.senpi.ai; All strategy code is open source at: github.com/Senpi-ai/senpi-skills

22 Agents, $22,000 real capital, every trade completely public. The experiment continues.

Related Questions

QWhat is the key finding regarding the relationship between the number of trades and profitability in the experiment?

AThe key finding is that fewer trades combined with higher conviction consistently lead to better results. All agents that executed over 400 trades experienced significant losses, while all agents with fewer than 120 trades were profitable. More trades meant more noise, fees, and exposure to risk, not more opportunity.

QWhat is Hyperfeed and why was it a critical component for the most successful agents?

AHyperfeed is a real-time tracking system that shows which assets traders on Hyperliquid are currently making money on at that exact moment. It was critical because it allowed the best-performing agents (like Fox and Grizzly) to identify and act on signals where 'smart money' was suddenly concentrating on an asset before the move was fully priced in, giving them a structural advantage.

QWhy did the mean reversion strategies (Viper, Mamba, Anaconda) perform poorly?

AThe mean reversion strategies all performed poorly (-18% to -33%) because the Hyperliquid perpetual market has stronger trending characteristics than mean-reverting ones. Attempting to buy the dip (go long at perceived 'support' levels) in a downtrend was a costly mistake, as prices could continue to fall for days.

QWhat unexpected and detrimental behavior did the agents exhibit when facing consecutive losses?

AWhen facing consecutive losses, the agents would attempt to 'self-repair' by making their configurations more aggressive. This included widening entry conditions, increasing leverage, and removing risk protection mechanisms like stop-losses. However, this behavior consistently accelerated their losses instead of helping them recover.

QHow do the profits of the successful agents follow a 'power law distribution'?

AThe profits followed a power law distribution, meaning a very small number of trades (3-5) contributed all of the profits, while the majority of trades ended as small losses that were quickly stopped out. For example, Agent Fox's three best trades made over $350, while its other 46 trades lost over $100, resulting in a net profit of ~$248.

Related Reads

After the Passage of the GENIUS Act and the CLARITY Act, What Is the Correct Architecture for On-Chain Yield?

The article discusses the evolution of on-chain credit, distinguishing three markets: overcollateralized crypto lending, unsecured lending (largely unsuccessful), and asset-backed credit (ABC). ABC, backed by identifiable real-world collateral with legal recourse, is identified as the fastest-growing category and the only one credibly addressing adverse selection—the core problem in credit where the riskiest borrowers self-select. Current growth in on-chain Real World Assets (RWAs), particularly tokenized private credit funds (e.g., Maple Finance, Centrifuge), is substantial but often merely "wraps" existing fund structures, inheriting their risks rather than solving adverse selection at the protocol level. The regulatory landscape is a key driver, with the US GENIUS Act (prohibiting stablecoin issuers from paying yield) and the proposed CLARITY Act (closing loopholes on indirect yield) set to redefine permissible yield-bearing products. This makes vaults (like ERC-4626) the critical architecture—they become the primary compliant vehicle for delivering yield, functioning as issuance, disclosure, distribution, and recovery mechanisms. The author's thesis is that the correct post-GENIUS/CLARITY architecture involves building ABC solutions where credit assessment, structure, and recovery are encoded directly into the smart contract vault layer, moving beyond mere tokenized fund wrappers to solve adverse selection fundamentally and ensure regulatory compliance.

Foresight News23m ago

After the Passage of the GENIUS Act and the CLARITY Act, What Is the Correct Architecture for On-Chain Yield?

Foresight News23m ago

TechFlow Intelligence Bureau: Anthropic's New Model Fable Sparks Controversy by Restricting Biosafety Research, US CPI Soars to 4.2%, a Three-Year High

**Summary of TechFlow Intelligence Report:** The newsletter covers several key tech and finance developments. In AI, Anthropic's new Fable model faced backlash for secretly limiting biomedical research capabilities and enforcing a 30-day data retention policy, prompting the company to promise more transparent adjustments. In a related story, Anthropic's founder revealed his departure from OpenAI was due to dishonesty from Sam Altman, not safety concerns. Meanwhile, OpenAI is considering significant price cuts to compete with Anthropic, potentially sparking a price war. In crypto/Web3, BlackRock filed a new amendment for a yield-generating Bitcoin ETF, while Bank of America's CEO warned that stablecoin yields could drain trillions from traditional banks. U.S. Senator Cynthia Lummis advocated for the U.S. to officially accumulate Bitcoin reserves. In hardware, Nvidia released the DiffusionGemma-2-6B image model optimized for efficient inference, and AMD promoted its unified memory architecture to challenge Nvidia's dominance. TSMC's CFO hinted at possible price increases due to soaring AI chip demand. A major legal ruling in Germany held Google legally responsible for inaccurate information generated by its AI Overviews feature. Google Chrome also moved to fully block ad-blocker workarounds like uBlock Origin. Macroeconomic headlines included U.S. CPI rising to 4.2% (a 3-year high) and Iran's complete closure of the Strait of Hormuz, raising oil price and inflation fears. South Korean markets saw continued volatility with massive foreign capital outflow. Other notable stories: Microsoft expanded its Copilot AI assistant "Mico" globally; a study found r/wallstreetbets users' stock picks outperformed Wall Street; a fully autonomous drone killed a human soldier for the first time, raising AI ethics concerns; and a Chinese hospital used brain-computer interface technology to help a blind person "see." The overarching theme connects debates over AI boundaries and responsibility (Anthropic's restrictions, Google's liability, lethal autonomous drones) with real-world economic and geopolitical turmoil (inflation, Strait of Hormuz closure, market instability), highlighting the tense interplay between technological advancement and global chaos.

marsbit36m ago

TechFlow Intelligence Bureau: Anthropic's New Model Fable Sparks Controversy by Restricting Biosafety Research, US CPI Soars to 4.2%, a Three-Year High

marsbit36m ago

Alibaba's Yet Another New Business Division: What Signal Does It Send?

Alibaba has established a new "Token Foundry" business unit, merging its Tongyi large model division and Future Life Lab. Led directly by Group CEO Wu Yongming, this marks the company's third significant AI organizational reshuffle in 2026, following the creation of the Alibaba Token Hub (ATH) and a Group Technology Committee. The move signals a strategic shift from consolidating AI resources to accelerating productization and commercialization. The "Token Foundry" name reflects Alibaba's ambition to become a foundational supplier in the AI era, focusing on model development and commercial application. Key teams, including those behind the high-performing HappyHorse video generation model, have been integrated into the new unit. Concurrently, Zhou Jingren, architect of the Qwen model series, has been appointed Group Chief Scientist to lead a new AI Future Research Institute, focusing on long-term technological breakthroughs like Agent capabilities. This restructuring creates a clear four-layer AI architecture within Alibaba: the research institute for frontier exploration, Token Foundry for core models and commercialization, MaaS for platform services, and business units like Qianwen (C端) and Wukong (B端) for end-user applications. The adjustments align with a global trend among tech giants like Google and Microsoft to centralize AI leadership under the CEO and deeply integrate research with business units. The urgency is driven by a narrowing competitive window. Alibaba has announced its AI business is now entering a commercialization phase, with AI-related revenue seeing triple-digit growth for eleven consecutive quarters. The company faces intense competition in the MaaS (Model-as-a-Service) sector from rivals like ByteDance and Tencent. The Token Foundry initiative represents Alibaba's effort to streamline execution and enhance competitiveness in this critical, fast-evolving landscape.

marsbit1h ago

Alibaba's Yet Another New Business Division: What Signal Does It Send?

marsbit1h ago

From Return to Resignation: Chen Hang's 437 Days at DingTalk

The 437-Day Return and Departure of Chen Hang at DingTalk This article chronicles the 437-day period from March 31, 2025, to June 11, 2026, when Chen Hang (also known as "No Move") returned as CEO of DingTalk, the enterprise communication platform he originally founded, only to later step down. Chen Hang, the creator of DingTalk in 2015, was brought back by Alibaba in 2025 after the company acquired his subsequent startup, HHO. His return was driven by Alibaba's renewed focus on AI and DingTalk's strategic role as its key to-B AI application. However, his aggressive management style, marked by strict work policies like mandatory clock-ins and extended hours, quickly caused internal friction and was criticized as being at odds with Alibaba's culture. Despite the internal turmoil, Chen Hang drove significant product launches. In August 2025, he unveiled "AI DingTalk 1.0," featuring new products like the AI-native entry point "DingTalk ONE." By March 2026, he announced "Wukong," touted as the world's first enterprise-grade AI-native work platform, representing a fundamental rebuild of DingTalk's architecture. The turning point came in early June 2026. A detailed internal post criticizing DingTalk's work culture went viral, followed by a public critique from a former executive. This prompted an unprecedented public rebuke from the Alibaba Partners Committee, which stated such management was not aligned with company values. One day later, on June 11, Alibaba announced Chen Hang's departure. He was succeeded by Chen Yusen, a 32-year-old technical expert known for founding cybersecurity firm Changting Technology. While Chen Hang's tenure laid the technical foundation for DingTalk's AI transformation with "Wukong," his leadership style ultimately led to his replacement as the company seeks a new direction under younger leadership.

marsbit1h ago

From Return to Resignation: Chen Hang's 437 Days at DingTalk

marsbit1h ago

Trading

Spot
Futures
活动图片