AlphaGo's Creator Puts AI into a 23-Year-Old Artificial Society: All Three Toughest Challenges for AI Agents Are Here

marsbitPublished on 2026-05-25Last updated on 2026-05-25

Abstract

Demis Hassabis, CEO of DeepMind, has embarked on a new AI research venture by partnering with the long-running space MMO, EVE Online. This collaboration, announced in early May, aims to use the game's 23-year-old, player-driven persistent universe as a testbed for tackling three core challenges in AI agent research: long-horizon planning, memory, and continual learning. Unlike previous DeepMind environments like AlphaGo (Go) or AlphaStar (StarCraft II), EVE Online features no fixed end state. Its single-shard universe has fostered complex, emergent player societies with real economies, political alliances, and wars that can span months or years. These conditions naturally demand the very skills—long-term strategic planning, maintaining memories over extended periods, and adapting to constant change—that are hardest for current AI agents to master. The research will initially use an offline version of EVE, providing a controlled, complex sandbox without interfering with the live player server. This move continues DeepMind's trajectory of using increasingly complex and open-ended virtual worlds for AI training, from Atari games and Go to StarCraft II and the SIMA project. The EVE environment represents a significant step towards testing AI in a persistent, socially complex, and continuously evolving world shaped by human behavior over decades.

DeepMind CEO and AlphaGo creator Demis Hassabis has been using games for AI research for over a decade.

This time, he has thrown AI into a "living universe" that has been running for 23 years: the space-themed massively multiplayer online game EVE Online, a game whose new player tutorial alone can deter players.

Chess games have an end, but EVE does not.

In early May, DeepMind officially announced a research collaboration with EVE Online for a simple reason: EVE's complex, player-driven universe is the perfect safe sandbox to test AI memory, continual learning, and long-term planning.

DeepMind's collaboration with EVE is not about pursuing fun gameplay or enhancing game mechanics. Instead, it aims to tackle the three toughest, most widely recognized challenges in current AI agent research. Hassabis is betting on finding answers in a 23-year-old game.

Fenris Creations (formerly CCP Games) announces partnership with DeepMind

On the same day, May 6th, the company behind EVE Online announced four things:

Regained independence from its parent company Pearl Abyss;
Renamed to Fenris Creations;
Completed a $120 million transaction;
As part of this independence, Google acquired a minority stake in Fenris Creations and simultaneously initiated a research partnership with Google DeepMind.

Fenris Creations CEO Hilmar Veigar Pétursson stated in the announcement:

This transition does not involve layoffs or restructuring. The team, products, and development plans remain unchanged. EVE continues.

Looking at operational figures, this company came to the table with "real ammunition" for collaboration, not to sell assets for survival.

EVE Online's revenue in 2025 exceeded $70 million, with November setting a historical revenue record, and Q4 becoming the second-highest revenue quarter in the game's 20-year history.

Fenris Creations' independence means EVE now has a parent company that can autonomously decide on research collaborations, no longer constrained by the strategic goals of a larger game publishing company.

A box of a board game product published by Fenris in 1997. The name "Fenris" predates EVE Online by 6 years. Renaming to Fenris Creations is a look back, not a fresh start.

Why did DeepMind choose EVE?

A 23-Year "Artificial Society"

An AI Benchmark Difficult to Replicate

When many people hear "games + AI research," their first thought is of AlphaGo or AlphaStar. EVE is different from both.

Go and StarCraft share a common characteristic: a match has a beginning, an end, and clear win/lose rules.

AlphaGo's goal was to win a Go game. AlphaStar's goal was to win a StarCraft match. Both represent a "single-game intelligence" research paradigm. But EVE has no endgame.

EVE Online is famous for its "single-shard / single shared universe," where a vast number of players compete, trade, form alliances, and wage war in a persistent world over the long term.

Players here have built real economic systems, political alliances, military coalitions, trade routes, historical grudges, and warfare plans that span years.

Some campaigns take an entire year from preparation to conclusion. The rise and fall of some alliances are studied by later players as real history.

Hilmar stated in the announcement: "EVE is one of the few places where we can explore questions of intelligence in an environment that already operates like the real world."

Hassabis further explained that he has played games since childhood, his career started with designing AI simulation games, and his work on AlphaGo, AlphaStar, and SIMA has been deeply tied to games. EVE is the choice for the next stage:

I'm thrilled to partner with Fenris Creations to safely explore new game experiences and advance AI research within this player-created, uniquely complex universe.

Most AI benchmarks are like medical checkups. EVE is more like throwing AI into an "artificial society" that has been running for 23 years.

The Three Toughest Challenges for Agents

Happen to be Daily Life for EVE Players

The official announcement explicitly lists three research directions: long-horizon planning, memory, and continual learning.

These three directions are widely acknowledged as the three toughest challenges in current AI agent research.

If you know someone who has played EVE Online for over ten years, ask them to open their account and show you their friend list. You'll likely see dozens of groups and hundreds of names, with notes in the remarks field like "Debt owed from the 2018 Delve campaign," "Traitor within Goonswarm, do not cooperate," "This guy is a spy, everyone in the corp knows."

This isn't a context window; it's cross-session long-term memory spanning at least a decade.

EVE players navigate the memory challenge every day. The continual learning challenge is the same.

In January 2014, the B-R5RB battle lasted about 21 hours, involving over 7,500 characters, the destruction of 75 Titans, with losses equivalent to roughly $300,000 in real-world currency. The trigger for the entire battle was a sovereignty bill that failed to auto-pay.

After this battle, the entire game's fleet tactics were rewritten. Alliance fleet compositions and tactical systems for years after revolved around post-battle analysis and iteration. Updates were made monthly, with every failure broken down into actionable strategic updates.

As for long-horizon planning, the standard time unit for EVE alliance warfare isn't hours; it's months. From preparation to execution, a cross-regional war involves shipbuilding, logistics, diplomacy, infiltration, and counter-espionage, with hundreds of players spontaneously collaborating without any task manager to advance a common goal over months.

This collaborative system evolved organically from the players over 23 years.

The three hardest challenges recognized in current AI agent evaluation happen to be the daily life of EVE players.

Twenty-three years of player-driven evolution in EVE have produced an environment that is always changing, always complex, with no shortcuts. This level of complexity cannot be synthetically created in a lab.

DeepMind's SIMA 2, released in November 2025, has evolved from "executing instructions" to "understanding goals, reasoning about processes, and learning while playing."

From a research question perspective, the EVE project shares the same "games as a training ground for agents" path as SIMA 2. The difference is that the venue has been swapped for a real universe that has been running for 23 years.

In-game battle scene from EVE Online. These large-scale, player-organized battles, often lasting for hours, are the core reason DeepMind chose EVE as a research environment for long-horizon planning and continual learning.

DeepMind is Entering an Offline Sandbox

Not the Live Player Universe

DeepMind's collaboration method with Fenris is more conservative than one might imagine. DeepMind does not have direct access to the live player servers.

DeepMind officially stated in the announcement: Initial research will be conducted on an offline version of EVE Online, using local servers in a controlled environment to test and evaluate models, without connecting to EVE Online's live operational servers.

On one hand, the offline version means DeepMind will not consume live player PvP data or disrupt the actual server economy, avoiding any privacy and compliance complexities.

On the other hand, the offline version of EVE can still retain the complex rule systems, ship and economic mechanics, star system structure, and other core design elements.

DeepMind is getting a "complex world pressure-tested by players for 23 years" as the examination hall where its agents must survive.

From Atari to EVE

Where This Path Leads

Looking back at DeepMind's choice of training grounds over the past decade, there's a clear evolutionary line.

2013 to 2015: Atari was the starting point. DQN put agents into games like *Breakout* and *Space Invaders* with clear levels and closed rules. It tested reaction and value estimation.

2016 to 2017: AlphaGo and AlphaZero. Go has neat rules, a huge but closed action space. It tested search and long-chain reasoning.

2019: AlphaStar entered *StarCraft II*. The first entry into a real-time, imperfect-information, multi-threaded博弈 environment. It tested decision-making under partial observability.

2024: SIMA aimed to be a generalist agent across multiple games. It tested transfer and generalization.

2025: SIMA 2 upgraded: not just executing instructions, but also conversing with users, reasoning about goals, and self-improving during gameplay.

DeepMind's SIMA 2, released in 2025, has evolved from "executing instructions" to "understanding goals, reasoning processes, and learning while playing."

Each generation of environment incorporates more aspects of the "real world" than the last: from closed rules to open rules, from perfect information to imperfect information, from single-game对抗 to cross-game migration.

However, these previous environments were still relatively closed, segmentable, and repeatable task fields. For example, Atari has fixed-rule arcade games; AlphaStar faced StarCraft matches that ended one by one; SIMA tested cross-game generalization in multiple 3D virtual environments.

The difference with EVE is that it is a persistent world that has been running long-term, driven by players, with continuously evolving economic and political structures.

It has been organically evolved over 23 years by real players in an open-ruled world: a complete player-driven economy (ISK price fluctuations comparable to real financial markets), political structures across alliances (diplomacy, espionage, ceasefires), and a whole warfare ecosystem from small skirmishes to 21-hour mega-battles.

The consensus within the field on agent evaluation is increasingly clear: running point task benchmarks hasn't produced anything new for a long time, but long-term memory, planning across weeks, and learning from failure still lack decent evaluation arenas.

Therefore, DeepMind's choice this time is: rather than creating another synthetic environment, step into an "artificial society" that has already been pressure-tested by human players for 23 years.

But a bigger question then emerges:

An AI agent that can persist, continually learn, and plan within EVE—what is still missing between it and an autonomous agent operating in the real world?

References:

https://x.com/GoogleDeepMind/status/2052011542707630461

https://www.ccpgames.com/news/2026/studio-behind-eve-online-goes-independent-rebrands-as-fenris-creations-enters-research-partnership-with-google-deepmind

https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d-worlds/

This article is from the WeChat public account "新智元" (New Zhiyuan), author: ASI启示录 (ASI Revelation), editor: 元宇 (Yuanyu).

Trending Cryptos

CitreaCTR

wrapped stUSDTWSTUSDT

Velodrome FinanceVELODROME

BrevisBREV

PancakeSwapCAKE

JUSTJST

Explosive Growth in Trading Volumes of 15 Altcoins Observed in South Korea!

Major South Korean cryptocurrency exchanges Upbit and Bithumb have reported a significant surge in trading volumes for several altcoins. Over the past 24 hours, the total trading volume for the most popular altcoins reached approximately $347.7 million. MetaDAO (META) led the rankings with a trading volume of $65.84 million on Upbit alone, accounting for 12.39% of the exchange's total spot volume. Euler (EUL) followed in second place with a total volume of $47.65 million across both exchanges. XRP, which consistently attracts substantial interest from Korean investors, achieved a total volume of $38.11 million. Other notable altcoins in the top 15 by trading volume include ThunderCore (TT) at $35.64 million, Babylon (BABY) at $25.15 million, and Shiba Inu (SHIB) at $10.55 million.

cryptonews.ru18m ago

Explosive Growth in Trading Volumes of 15 Altcoins Observed in South Korea!

cryptonews.ru18m ago

Donald Trump's Company Sold Another Large Batch of Bitcoins!

Donald Trump's company, Trump Media & Technology Group, reportedly transferred another large batch of Bitcoin to the CryptoCom exchange. Blockchain analysis indicates that addresses linked to Trump Media moved approximately 2,628 BTC (worth around $165 million) to the exchange. Prior reports suggested the company had acquired a total of 11,542 BTC at an average price of $118,500. It is claimed that by 2026, about 7,281 BTC had been withdrawn from these addresses, with approximately 4,261 BTC still held on them. The total realized and unrealized losses from Trump Media's Bitcoin investments are estimated to be roughly $555 million. It is important to note that sending Bitcoin to an exchange does not definitively mean the assets were sold. Such transfers could also be for custody, liquidity management, or other financial operations. However, movements from cold wallets to centralized exchanges are commonly viewed as potential sales activity.

cryptonews.ru2h ago

Donald Trump's Company Sold Another Large Batch of Bitcoins!

cryptonews.ru2h ago

Parker Lewis Explains Why Bitcoin Remains the Best Money

Bitcoin analyst Parker Lewis criticized companies promoting themselves as "crypto treasuries" for selling perpetual preferred stock, calling it a distortion of Bitcoin's essence. He argues Bitcoin has no inherent yield, and promises of dividends from such corporate derivatives are risky, often relying on new investor inflows. Lewis highlighted the vast discrepancy between the $300 trillion global credit market and the $1 trillion perpetual preferred stock market, suggesting these instruments shift indefinite risks to retail investors. He also refuted the notion that Bitcoin is "too volatile," stating volatility is a natural mathematical outcome of a fixed-supply asset gaining mass adoption, as new users must bid higher to acquire it. Instead of buying shares of companies like MicroStrategy, Lewis advises direct Bitcoin ownership as safer. The focus on corporate derivatives distracts from the primary threat of fiat currency devaluation. Citing his informal "Ribeye Index," Lewis notes a steep rise in steak prices, indicating real inflation far exceeding official CPI figures. In conclusion, the most prudent strategy against inflation is direct ownership and self-custody of Bitcoin. Chasing corporate yield through crypto treasury stocks multiplies systemic risks, while understanding decentralized money protects savings from macroeconomic turmoil.

cryptonews.ru2h ago

Parker Lewis Explains Why Bitcoin Remains the Best Money

cryptonews.ru2h ago

Why Bitcoin Holds Above $64,000 After Fed's Hard Pause

**Bitcoin Stabilizes Near $64,000 Following Hawkish Fed Pause** The cryptocurrency market, led by Bitcoin, remained stable around $64,000 despite a volatile reaction to the latest U.S. Federal Reserve meeting. The Fed paused interest rates but signaled a hawkish stance, with three committee members voting for an increase—the highest dissent since 2016. This limits risk appetite but hasn't triggered panic selling. Key market highlights include Bitcoin ETFs seeing a net inflow of $32.1 million, breaking a streak of outflows, while Ethereum ETFs experienced outflows of $18.65 million. Liquidations affected about 90,000 traders. Technically, Bitcoin finds support around $63,000-$63,500, with major resistance near $66,000. While its price is about 49% below its all-time high, institutional demand via ETFs and the absence of mass capitulation support a potential recovery scenario in the second half of the year. Major altcoins showed mixed movements, with Solana attracting capital while Ethereum faced selling pressure despite strong on-chain metrics like a growing staking queue. Regulatory news took a pause as the U.S. Senate delayed the CLARITY Act vote until at least autumn. For the final trading day of July, U.S. inflation and consumer spending data will be crucial. Bitcoin's key levels to watch are $63,000 support and $66,000 resistance. Sustained ETF inflows and Bitcoin holding above $63,000 are seen as positive signs for a potential market recovery later in the year.

cryptonews.ru2h ago

Why Bitcoin Holds Above $64,000 After Fed's Hard Pause

cryptonews.ru2h ago

ARK Invest's Cathie Wood Buys 109,129 Circle Shares Worth $6.83 Million

ARK Invest, led by Cathie Wood, purchased approximately 109,129 shares of Circle for nearly $6.83 million across three of its ETFs: ARK Innovation, ARK Next Generation Internet, and ARK Fintech Innovation. This investment followed Circle's recent receipt of a trust charter license from the New York Department of Financial Services for its subsidiary, Circle New York Trust, which CEO Jeremy Allaire described as a long-term company goal. Despite this regulatory approval, Circle's stock (CRCL) fell 2.54% to $62.61 on July 31, as investors may not have viewed the license as a catalyst for growth. In the same period, ARK Invest also bought shares in Tesla, SpaceX, and Nvidia worth about $40.2 million amid a broader tech sell-off, while reducing its holdings in companies like Shopify, Cloudflare, and CrowdStrike.

cryptonews.ru2h ago

ARK Invest's Cathie Wood Buys 109,129 Circle Shares Worth $6.83 Million

cryptonews.ru2h ago

Trading

Spot

Hot Articles

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

Talus is a decentralized AI Agent framework built on the Sui, designed to solve the structural problems of current AI systems: centralization, opacity, and a lack of native economic identity.

43.4k Total ViewsPublished 2026.03.18Updated 2026.03.18

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

By 2026, the integration of artificial intelligence and cryptocurrency has advanced from proof-of-concept to a new stage of "system-level integration".

3.0k Total ViewsPublished 2026.03.26Updated 2026.03.26

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

In 2026, the U.S. IPO market has regained momentum.

36.9k Total ViewsPublished 2026.07.08Updated 2026.07.08

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

AlphaGo's Creator Puts AI into a 23-Year-Old Artificial Society: All Three Toughest Challenges for AI Agents Are Here

Abstract

A 23-Year "Artificial Society"

An AI Benchmark Difficult to Replicate

The Three Toughest Challenges for Agents

Happen to be Daily Life for EVE Players

DeepMind is Entering an Offline Sandbox

Not the Live Player Universe

From Atari to EVE

Where This Path Leads

Trending Cryptos

Related Questions

Related Reads

Explosive Growth in Trading Volumes of 15 Altcoins Observed in South Korea!

Donald Trump's Company Sold Another Large Batch of Bitcoins!

Parker Lewis Explains Why Bitcoin Remains the Best Money

Why Bitcoin Holds Above $64,000 After Fed's Hard Pause

ARK Invest's Cathie Wood Buys 109,129 Circle Shares Worth $6.83 Million

Trading

Hot Articles

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

Discussions

Top Questions