CertiK Test: How the Vulnerable OpenClaw Skill Bypassed Review and Took Over Computers Without Authorization

marsbit2026-03-22 tarihinde yayınlandı2026-03-22 tarihinde güncellendi

Özet

CertiK's latest research reveals critical security vulnerabilities in OpenClaw's third-party Skill ecosystem. Despite OpenClaw's three-layer review system—including VirusTotal scanning, static code analysis, and AI logic checks—malicious Skills can easily bypass these safeguards. CertiK demonstrated this by developing a seemingly benign "test-web-searcher" Skill that contained a hidden remote code execution vulnerability. It was approved without warnings, allowing unauthorized command execution on the host machine (e.g., launching system calculators via Telegram commands). The core issue is the industry’s overreliance on pre-release scans rather than runtime isolation and strict permission controls. Unlike iOS’s mandatory sandboxing, OpenClaw’s sandbox is optional and often disabled by users for functionality, leaving systems exposed. CertiK urges developers to enforce mandatory sandboxing and granular permissions for Skills, and advises users to deploy OpenClaw on isolated devices away from sensitive data or assets. The study underscores that scanning alone cannot secure high-permission AI agents; runtime isolation and damage containment are essential for safety.

Recently, the open-source self-hosted AI agent platform OpenClaw (commonly known as "Crawfish") has rapidly gained popularity due to its flexible scalability and self-controlled deployment features, becoming a phenomenon in the personal AI agent space. Its core ecosystem, Clawhub, serves as an app marketplace, gathering a vast number of third-party Skill plugins that enable agents to unlock advanced capabilities—from web search and content creation to encrypted wallet operations, on-chain interactions, and system automation—with a single click. The ecosystem's scale and user base have experienced explosive growth.

But for such third-party Skills running in high-privilege environments, where exactly is the platform's real security boundary?

Recently, CertiK, the world's largest Web3 security company, released new research on Skill security. The report points out that the current market has a misplaced perception of the security boundaries of AI agent ecosystems: the industry generally regards "Skill scanning" as the core security boundary, but this mechanism is almost useless against hacker attacks.

If OpenClaw is compared to an operating system for smart devices, Skills are the various APPs installed on the system. Unlike ordinary consumer APPs, some Skills in OpenClaw run in high-privilege environments, directly accessing local files, calling system tools, connecting to external services, executing host environment commands, and even operating users' encrypted digital assets. Once security issues arise, they can directly lead to serious consequences such as sensitive information leakage, remote device takeover, and theft of digital assets.

The current universal security solution for third-party Skills across the industry is "pre-listing scanning and review." OpenClaw's Clawhub has also built a three-layer review and protection system: integrating VirusTotal code scanning, static code detection engines, and AI logic consistency checks. It uses risk grading to push security alerts to users, attempting to safeguard ecosystem security. However, CertiK's research and proof-of-concept attack tests confirm that this detection system has shortcomings in real attack-defense scenarios and cannot bear the core responsibility of security protection.

The research first breaks down the inherent limitations of the existing detection mechanisms:

Static detection rules are easily bypassed. This engine primarily relies on matching code features to identify risks, such as flagging the combination of "reading sensitive environmental information + sending network requests" as high-risk behavior. However, attackers only need to make slight syntactic modifications to the code to completely bypass feature matching while fully retaining malicious logic. It's like rephrasing dangerous content with synonymous expressions, rendering the security scanner completely ineffective.

AI review has inherent detection blind spots. Clawhub's AI review is primarily positioned as a "logic consistency detector," which can only catch obvious malicious code where "declared functionality does not match actual behavior." However, it is helpless against exploitable vulnerabilities hidden within normal business logic, much like how it is difficult to find fatal traps buried deep in the clauses of a seemingly compliant contract.

More critically, the review process has underlying design flaws: even when VirusTotal's scan results are still "pending" and the full "health check" process is incomplete, Skills can still be directly listed publicly. Users can install them without any warnings, leaving an opening for attackers.

To verify the real危害性 of the risks, the CertiK research team completed full testing. The team developed a Skill named "test-web-searcher," which表面上 appears to be a fully compliant web search tool with code logic that完全符合常规开发规范. However, it actually implants a remote code execution vulnerability within the normal functional flow.

This Skill bypassed the detection of both the static engine and the AI review. While the VirusTotal scan was still pending, it was installed normally without any security warnings. Ultimately, by sending a remote command via Telegram, the vulnerability was successfully triggered, achieving arbitrary command execution on the host device (in the demo, it directly controlled the system to launch the calculator).

CertiK clearly stated in the research that these issues are not unique product bugs of OpenClaw but rather a common cognitive误区 across the AI agent industry: the industry普遍 regards "review scanning" as the core security防线, while neglecting the true security根基, which is runtime mandatory isolation and fine-grained permission control. This is similar to how the security core of Apple's iOS ecosystem has never been the strict review of the App Store, but rather the system's enforced sandbox mechanism and fine-grained permission management, ensuring each APP runs in its dedicated "isolation pod" without随意获取系统权限. OpenClaw's existing sandbox mechanism is optional而非强制的 and highly reliant on manual user configuration. Most users, to ensure Skill functionality, choose to disable the sandbox, ultimately leaving the agent in a "naked" state. Once a Skill with vulnerabilities or malicious code is installed, it can directly lead to catastrophic consequences.

Regarding the issues discovered, CertiK also provided security guidance:

● For developers of AI agents like OpenClaw, sandbox isolation must be set as the default mandatory configuration for third-party Skills, with a fine-grained permission control model. Third-party code must never默认继承 the host machine's high privileges.

● For ordinary users, Skills labeled "safe" in the marketplace merely indicate that no risks were detected; it does not equate to absolute safety. Before官方 makes底层强隔离机制 the default configuration, it is recommended to deploy OpenClaw on non-critical idle devices or virtual machines. Never let it near sensitive files, password credentials, or high-value加密资产.

The AI agent赛道 is currently on the eve of explosion. The speed of ecosystem expansion must not outpace the pace of security construction. Review scanning can only block初级恶意攻击 but can never become the security boundary for high-privilege agents. Only by shifting from "pursuing perfect detection" to "assuming risk exists and focusing on damage containment," and by establishing隔离边界强制 at the runtime底层, can the security底线 of AI agents truly be safeguarded, allowing this technological transformation to proceed steadily and go the distance.

Original Research: https://x.com/hhj4ck/status/2033527312042315816?s=20

https://mp.weixin.qq.com/s/Wxrzt7bAo86h3bOKkx6 UoA

İlgili Okumalar

The End of the Crypto Premium? Market Logic Shift Seen Through Gemini's Post-IPO Struggles

The article "The End of the Crypto Premium? Market Logic Shifts as Gemini Struggles Post-IPO" examines the dramatic downturn of cryptocurrency exchange Gemini following its public listing in September 2025. Initially part of a wave of crypto IPOs, including Bullish, which saw soaring valuations and massive investor interest, Gemini's stock price has since collapsed by over 80%, falling from $28 to around $5. The company has cut 30% of its workforce, exited international markets, and faces significant financial strain, including $330 million in Bitcoin-denominated debt. The core argument is that Gemini's struggles reflect a broader market shift where the "excess premium" once associated with crypto assets is disappearing. Two key factors are identified: the erosion of regulatory arbitrage, as compliance costs rise for all players (up 22.5% for small firms in 2026), and the decline of liquidity scarcity premiums, as institutional investors now access crypto via low-friction ETFs and stocks rather than volatile altcoins. The approval of Bitcoin and other crypto ETPs, which now manage $1.8 trillion globally, has diverted institutional capital away from altcoins, causing their liquidity to dry up and volatility to increase. For Gemini, its strategy of being "the most compliant exchange" became a liability in a bear market, as fixed compliance costs remained high while trading revenue fell. The article concludes that the era of narrative-driven crypto valuations is ending, giving way to a market logic focused on fundamentals like actual usage, liquidity depth, and sustainable institutional adoption.

marsbit2 saat önce

The End of the Crypto Premium? Market Logic Shift Seen Through Gemini's Post-IPO Struggles

marsbit2 saat önce

Utexo Partners with x402 to Provide Near-Instant USDT Settlement for the Agent Economy

Utexo, a Bitcoin-native stablecoin payment execution and settlement layer, has partnered with x402 to integrate USDT compatibility into the x402 payment protocol. This collaboration enables near-instant settlement for agent-to-agent transactions, with speeds as fast as 50 milliseconds. x402 is an open protocol that uses the HTTP 402 "Payment Required" status code to embed payment functionality directly into HTTP requests. This allows applications, APIs, and autonomous systems to pay for services in real-time without requiring pre-funded accounts. The integration expands x402’s initial USDC support to include USDT, one of the most widely used stablecoins globally. Utexo’s infrastructure is designed for high-frequency, low-latency transactions, making it well-suited for machine-driven payments. According to Utexo CEO Viktor Ihnatiuk, supporting USDT within the x402 framework significantly broadens access and provides developers the performance needed for real-time agent-based systems. Kevin Leffew of x402 at Coinbase added that expanding stablecoin access improves performance and accelerates developer adoption. This partnership supports growing use cases where software systems autonomously conduct transactions—such as paying for API calls, accessing data on-demand, and coordinating services across platforms without human intervention. By combining x402’s protocol with Utexo’s settlement infrastructure, the collaboration enables a payment model where transactions are as fast and efficient as the requests that trigger them.

marsbit5 saat önce

Utexo Partners with x402 to Provide Near-Instant USDT Settlement for the Agent Economy

marsbit5 saat önce

İşlemler

Spot
Futures
活动图片