OpenAI’s latest paper exposes the risks of AI in smart contracts

ambcryptoPublished on 2026-02-19Last updated on 2026-02-19

Abstract

OpenAI's latest research paper highlights the dual role of AI in smart contract security, both as a tool for identifying vulnerabilities and as a potential threat capable of exploiting them. As smart contracts now manage over $400 billion in assets, their immutable nature makes security critical. To evaluate AI's capabilities, researchers developed EVMbench, a benchmark using 120 real vulnerabilities from 40 blockchain projects. The study found that frontier AI agents can successfully discover and exploit vulnerabilities end-to-end, with exploit success rates jumping from 31.9% to 72.2% in just six months. However, a recent incident involving Claude Opus 4.6 demonstrated significant risks when AI-generated code contained critical errors, leading to $1.78 million in losses. EVMbench has limitations, including a limited dataset, false positives, and an inability to fully replicate real-world conditions like cross-chain activity. The paper underscores the need for responsible AI development as smart contracts increasingly become tools for both innovation and cybercrime.

As smart contracts evolve from small experiments into major financial systems managing over $400 billion in assets, security has become increasingly critical.

Unlike traditional software, most blockchain programs cannot be changed after deployment, meaning even minor coding errors can cause permanent financial losses.

To evaluate how artificial intelligence performs in this high-risk environment, researchers from OpenAI, Paradigm, and OtterSec developed EVMbench.

Instead of simple test challenges, it uses 120 real vulnerabilities from 40 blockchain projects, making the evaluation closer to real-world conditions.

Remarking on which, the OpenAI blog post noted,

“We evaluate a range of frontier agents and find that they are capable of discovering and exploiting vulnerabilities end-to-end against live blockchain instances.”

It further added,

“We release code, tasks, and tooling to support continued measurement of these capabilities and future work on security.”

Is AI actually reshaping smart contract security?

While AI greatly improves auditing and bug fixing, it can also exploit system weaknesses. To resolve this, EVMbench helps researchers track these risks.

It also guides responsible AI development for high-value financial systems.

That being said, EVMbench tests AI agents in three stages.

Each stage represents a different level of technical difficulty, reflecting growing security responsibility.

The community appreciates this effort

Appreciating this move, an X user account noted,

“This is a watershed moment for smart contract security. The jump from 31.9% to 72.2% exploit success in just 6 months shows AI agents aren’t just getting better at reading code—they’re mastering the full attack chain.”

Echoing similar sentiments, another user added,

“The 6× jump in exploit success is wild progress, but kinda worrying how fast offensive skills are scaling.”

Recent incident that sent shockwaves

Yet, despite such optimism, something unreal happened soon after OpenAI launched EVMbench. An exploit involving Claude Opus 4.6 raised serious concerns about the risks of “vibe-coded” smart contracts.

In this case, the AI helped write vulnerable Solidity code that incorrectly set the price of the cbETH asset at $1.12 instead of its real value of around $2,200, triggering liquidations and causing losses of nearly $1.78 million.

This shows that trusting AI with critical financial logic without careful human review can turn small mistakes into major losses.

Limitations remain

EVMbench has clear limitations. It includes only 120 curated vulnerabilities and cannot evaluate newly discovered issues.

Detect Mode also produces false positives. While the small number of Patch and Exploit tasks reflects the heavy manual effort needed to create them.

In addition, the sandboxed environment fails to fully represent real-world conditions such as cross-chain activity, timing complexities, and long-term network history.

Needless to say, as blockchain adoption accelerates, its misuse is evolving just as quickly.

Recently, research by Group-IB also showed that the DeadLock ransomware is using Polygon smart contracts to conceal server infrastructure and evade detection.

Together, these developments signal a troubling shift where smart contracts, originally designed to enhance transparency and trust, are increasingly being repurposed as tools for cybercrime.

Final Summary

Tools like EVMbench help researchers measure AI capabilities in realistic security settings.
Limited datasets and controlled environments still fail to capture real-world blockchain complexity.

AI Chooses Currency: Bitcoin Wins Big, Fiat Money Unwanted

A recent study by the Bitcoin Policy Institute tested 36 AI models from six companies—Anthropic, OpenAI, Google, DeepSeek, xAI, and MiniMax—to evaluate their preferences between Bitcoin and traditional fiat currencies in various economic scenarios. In the experiment, AI models acted as autonomous economic agents and made choices across 28 scenarios covering core monetary functions like saving, payment, and settlement. Out of 9,072 responses collected, 22 of the 36 models selected Bitcoin as their preferred currency, while none chose fiat money as their first option. Bitcoin was strongly favored for long-term savings (79.1% of cases), while stablecoins were more commonly chosen for payments (53.2%) and settlements (43%). Among different AI developers, Anthropic's models showed the strongest preference for Bitcoin (68.0%), followed by DeepSeek (51.7%), Google (43.0%), xAI (39.2%), MiniMax (34.9%), and OpenAI (25.9%). The study emphasized that the AI’s choices were based on technical and economic attributes derived from their training data, rather than real-world predictive capability. Despite this limitation, the consistent preference for Bitcoin across diverse AI systems suggests a growing consensus on the perceived advantages of cryptocurrency as sound money.

marsbit27m ago

AI Chooses Currency: Bitcoin Wins Big, Fiat Money Unwanted

marsbit27m ago

Volatility Without Reward: Why Bitcoin’s MVRV Signals A High-Risk, Zero-Return Regime

Bitcoin is experiencing heightened volatility driven by geopolitical tensions and fragile liquidity, with weakened directional conviction and constrained risk appetite. Analysis indicates a sharp decline in risk-adjusted returns, as both 365-day and 180-day Sharpe Ratios have turned deeply negative, signaling that recent volatility has not been compensated by positive returns. The MVRV Z-Score currently sits at 0.49, indicating neutral valuation—neither overbought nor at capitulation levels. This suggests a high-risk, low-return environment, lacking the historical buy signals that appear when valuations are deeply negative. A clear catalyst is needed to determine the next directional move.

bitcoinist35m ago

Volatility Without Reward: Why Bitcoin’s MVRV Signals A High-Risk, Zero-Return Regime

bitcoinist35m ago

Bitcoin rejected at $70K again, but a short squeeze may still be brewing!

Bitcoin has faced repeated rejections at the $70,000 resistance level amid heightened market volatility, the highest since 2022. Despite geopolitical tensions and sustained outflows from Bitcoin Spot ETFs, recent data shows a shift to positive weekly inflows—a potential sign of improving sentiment. However, the market remains fragile, with nearly half of the supply in loss, indicating significant holder stress. Notably, the taker buy/sell ratio has stayed above 1, suggesting underlying buying interest. While profit-taking continues to dominate near all-time highs, the structure of liquidations indicates a potential short squeeze if Bitcoin breaks upward. Traders are advised to remain cautious as the market shows mixed signals between exhaustion and tentative recovery.

ambcrypto1h ago

Bitcoin rejected at $70K again, but a short squeeze may still be brewing!

ambcrypto1h ago

Web3 Lawyer Analysis: Where is the Future of U.S. Stock Tokenization?

The article analyzes the future of U.S. stock tokenization, focusing on recent moves by major exchanges like NYSE and Nasdaq to launch tokenized stock trading by 2026. It argues that traditional market constraints—such as limited trading hours due to banking schedules—are being overcome through tokenized settlements and stablecoin-based infrastructure, enabling 24/7 trading. The piece highlights how tokenization compresses traditional financial workflows, reducing reliance on intermediaries like banks and shifting value toward integrated链上 platforms. It also discusses competition from Web3 exchanges, wallets becoming key traffic entry points, and the role of stablecoins in reshaping monetary ecosystems. The author concludes that tokenization represents a structural shift in market infrastructure, merging traditional regulatory compliance with blockchain efficiency to redefine asset liquidity and global pricing power.

marsbit1h ago

Web3 Lawyer Analysis: Where is the Future of U.S. Stock Tokenization?

marsbit1h ago

Bloomberg Reports Twice, Hyperliquid Re-enters Wall Street's View

On February 28, during U.S. market off-hours, Hyperliquid’s crude oil perpetual contracts saw significant price and volume movements in response to geopolitical tensions involving Iran, as reported by open-source intelligence. The platform’s 24/7 trading capability allows continuous price discovery, especially during weekends when traditional markets are closed—highlighted in two recent Bloomberg articles. This has drawn Wall Street attention as crypto and traditional finance increasingly intersect. Hyperliquid’s native token HYPE has decoupled from Bitcoin, gaining 13% during the event, partly due to its tokenomics: 50% of trading fees from its HIP-3 protocol are used to buy back HYPE. The platform also saw substantial activity during earlier precious metal rallies, with silver volumes reaching $1.2 billion. However, challenges remain. While Hyperliquid performs well for retail-sized orders, its depth and slippage lag behind traditional venues like COMEX, especially during high volatility. Institutional adoption requires improvements in KYC, liquidity models, and risk infrastructure. Despite this, Hyperliquid’s ability to provide real-time, off-shore risk pricing positions it as a growing force in the evolution toward always-on financial markets.

marsbit1h ago

Bloomberg Reports Twice, Hyperliquid Re-enters Wall Street's View

marsbit1h ago

Trading

Spot

Futures

Hot Articles

In-Depth Research Report on Account Abstraction (AA): Generational Leap in Ethereum’s Account System & Landscape Reshaping in the Next Five Years

As a major evolution of Ethereum’s account system, AA is designed to address the fundamental security and experience bottlenecks of the “private key equals account” model in the EOA era.

2.2k Total ViewsPublished 2025.12.18Updated 2025.12.18

In-Depth Research Report on Account Abstraction (AA): Generational Leap in Ethereum’s Account System & Landscape Reshaping in the Next Five Years

Hot Tokens Learning Week 7: Privacy Coins Rally in Rotation, with RIVER Standing Out as 2026’s Surprise Performer

The privacy + payments narrative has been the primary catalyst driving rotation and substantial price gains in privacy coins such as DASH and XMR.

15.7k Total ViewsPublished 2026.01.20Updated 2026.01.20

Hot Tokens Learning Week 7: Privacy Coins Rally in Rotation, with RIVER Standing Out as 2026’s Surprise Performer

Hot Tokens Learning Week 8: ADA's Ouroboros Leios Mainnet Expected to Launch in 2026

ADA's Ouroboros Leios mainnet is expected to launch in 2026, and the hard fork to Protocol Version 11 is planned for Q1 2026.

38.7k Total ViewsPublished 2026.02.10Updated 2026.02.12

Hot Tokens Learning Week 8: ADA's Ouroboros Leios Mainnet Expected to Launch in 2026

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of S (S) are presented below.

OpenAI’s latest paper exposes the risks of AI in smart contracts

Abstract

Is AI actually reshaping smart contract security?

The community appreciates this effort

Recent incident that sent shockwaves

Limitations remain

Final Summary

Related Questions

Related Reads

AI Chooses Currency: Bitcoin Wins Big, Fiat Money Unwanted

Volatility Without Reward: Why Bitcoin’s MVRV Signals A High-Risk, Zero-Return Regime

Bitcoin rejected at $70K again, but a short squeeze may still be brewing!

Web3 Lawyer Analysis: Where is the Future of U.S. Stock Tokenization?

Bloomberg Reports Twice, Hyperliquid Re-enters Wall Street's View

Trading

Hot Articles

In-Depth Research Report on Account Abstraction (AA): Generational Leap in Ethereum’s Account System & Landscape Reshaping in the Next Five Years

Hot Tokens Learning Week 7: Privacy Coins Rally in Rotation, with RIVER Standing Out as 2026’s Surprise Performer

Hot Tokens Learning Week 8: ADA's Ouroboros Leios Mainnet Expected to Launch in 2026

Discussions

Top Questions

Hot Categories

Hot Tags