Claude Code Leak: Unveiling the Five-Layer Architecture and Survival Philosophy of a Top AI Agent

marsbit發佈於 2026-04-02更新於 2026-04-02

文章摘要

A configuration error in the Bun build tool led to the leak of Claude Code's source code, revealing the architecture and internal mechanisms of Anthropic's AI coding agent. The exposed system consists of five core layers: Entrypoints (routing inputs), Runtime (TAOR loop), Engine (dynamic prompt assembly), Tools & Capabilities (40+ tools with strict permissions), and Infrastructure (caching and remote control, including a kill switch). Key innovations include a biologically inspired memory system with three layers (long-term, episodic, and working memory) and an "Auto-Dream" process that consolidates knowledge. Anthropic’s security measures are extensive, featuring an undercover mode for anonymous contributions, anti-distillation techniques to poison API data, and hardware-level authentication. Future development points to "KAIROS mode"—a always-on background agent capable of autonomous action via webhooks and cron jobs. While the leak offers a rare look into a production-scale AI agent, it also highlights Anthropic’s challenge in balancing transparency and security ahead of its planned IPO.

In the AI community, a packaging error has triggered a "butterfly effect" that is evolving into a top-tier public lesson for the tech world.

According to media reports, due to a configuration oversight in the Bun build tool, 1,900 TypeScript files containing a total of 512,000 lines of source code for Anthropic's programming agent Claude Code were accidentally leaked. This incident not only allowed outsiders a glimpse into the technical foundation of a top Agent but also exposed Anthropic's deeper logic regarding information control and product evolution.

Five-Layer Architecture Overview: This is More Than Just a "Shell" Interface

The leaked code reveals an extremely complex production-grade system, with its architecture clearly divided into five layers:

Entrypoint Layer: Unifies routing for CLI, desktop client, and SDK, standardizing multi-endpoint input.

Runtime Layer: Core is the TAOR loop (Think-Act-Observe-Repeat), maintaining the Agent's behavioral rhythm.

Engine Layer: The heart of the system, responsible for dynamic prompt assembly. Depending on the mode, it injects hundreds of prompt fragments, with safety rules alone amounting to a hefty 5,677 tokens.

Tools & Capabilities Layer: Includes about 40 independent tools, each with strict permission isolation.

Infrastructure Layer: Manages prompt caching and remote control, even including a remotely activatable "kill switch".

Bionic Design: Layered Memory and a "REM Sleep" Mechanism

Claude Code's memory system is highly aligned with cognitive science:

Three-Layer Memory: Divided into long-term semantic memory (RAG retrieval), episodic memory (conversation sequence), and working memory (current context). The core idea is "fetch on demand, never overload".

Auto-Dream Mechanism: The infrastructure layer includes a background process named "dreaming". Every 24 hours or after 5 sessions, the system initiates a sub-agent to consolidate memories, clean up noise, and solidify vague expressions into definitive knowledge.

Information Control Triad: Undercover Mode and Anti-Distillation

The "defense lines" exposed in the source code reflect Anthropic's rigorous information control mindset:

Undercover Mode: Automatically activates when operating on non-internal repositories, stripping all AI identifiers for "covert contributions".

Anti-Distillation Mechanism (ANTI_DISTILLATION): When enabled, it injects fake tool definitions into prompts to prevent competitors from training their own models using API traffic.

Native Authentication: Employs hardware-level authentication at the Bun/Zig layer to prevent third-party tampering or spoofing of the official client.

Future Roadmap: KAIROS and the "Never-Sleeping" Assistant

Leaked Feature Flags hint at next-generation functionality: KAIROS mode. This is a continuously running background agent supporting GitHub Webhook subscriptions and Cron scheduled refreshes. This signifies a shift for AI from a tool that "moves only when poked" to a 24/7 online collaborator capable of autonomous observation and proactive action.

Conclusion: Leaked Code, Unreplicable Accumulation

Although Anthropic has urgently taken down the relevant version and issued DMCA notices, the architectural ideas behind Claude Code are already proliferating wildly within the community. For the industry, this might be the Agent field's first large-scale, production-validated "best practice". For Anthropic, however, finding a renewed balance between high transparency and security will be a critical challenge on its path to an IPO in 2026.

你可能也喜歡

做芯片有多难？一个除法运算错误，4.75亿美元就打了水漂

中国科学院计算技术研究所的石侃（B站UP主“老石谈芯”）结合芯片从业经验与科普工作，探讨了芯片技术的难度与挑战。芯片作为现代社会的基石，其研发过程极为复杂，需要从沙子提纯开始，经过设计、制造等多个环节。芯片设计的难点在于必须一次性成功，无法像软件那样通过后期补丁修复错误。例如，英特尔曾因奔腾芯片浮点除法单元的设计漏洞，花费4.75亿美元召回产品。调查显示，仅24%的芯片项目能一次性成功，其余需多次流片，耗费巨大。芯片验证已成为开发周期中最耗时的环节，占比达70%。完全验证一颗CPU核心若用软件仿真需15000年，硬件仿真也需30年，凸显验证效率低下的问题。验证领域存在“不可能三角”：高性能、良好调试能力和低成本无法同时满足。尽管芯片验证研究冷门且不易出成果，石侃及其团队坚持探索，构建了基于FPGA的敏捷验证平台ENCORE，旨在提升验证效率与可调试性，适用于CPU、GPU及AI加速器等芯片。此外，石侃通过科普视频分享芯片知识，希望让更多人了解这一领域。他认为，芯片科研与科普都是“难而长期”的事情，但正因如此，才值得坚持。

marsbit2 分鐘前

marsbit2 分鐘前

Claude 强制“刷脸查户口”，7月起不交身份证不给用？

Anthropic向Claude用户发送邮件，宣布更新隐私政策。核心变化包括：从7月8日起，可能会要求消费级用户验证年龄或身份，以加强安全。验证将通过第三方服务Persona进行，需上传政府颁发的带照片证件（如护照、身份证）并拍摄实时自拍照进行比对。官方强调验证数据不用于模型训练，且仅存储在第三方。此外，政策更新还涉及：当Claude执行多步骤任务或连接第三方应用时，用户数据可能在用户与第三方之间流动；以及参与研究时可能收集更多信息。这些变化主要针对免费版、Pro和Max版个人用户，商业客户不受影响。此举被视为在AI代理能力日益复杂化的背景下，建立责任追溯机制的重要一步，可能意味着AI行业监管的全面收紧。用户未来在使用Claude进行复杂操作时，可能会频繁遇到身份验证要求。

链捕手14 分鐘前

链捕手14 分鐘前

区块链用了 18 年终于开始驶向主航道

本月初，老牌加密风投机构Variant完成新基金募集，将投资主题从“数字所有权”扩展为“自主性”。这背后传递出关键信号：加密正从一个独立赛道，转变为嵌入AI、金融、社交等主流领域的底层技术范式。面对AI浪潮的冲击，加密VC给出的回应并非直接竞争，而是寻求成为AI世界的底层金融轨道。当前，加密市场自身财富效应减弱，叙事轮动带来的回报逻辑正在失灵。加密基金不仅要与其他加密基金竞争，更要与AI、机器人等所有代表未来的增长资产竞争。这迫使越来越多加密VC主动模糊边界，如Paradigm、Haun Ventures等已将投资范围扩展至AI和前沿科技。文章指出，AI智能体可能是加密技术实现大规模落地的关键。加密不一定要成为用户直接使用的前台应用，而可以成为AI时代机器与机器、应用与应用之间的经济结算层。AI智能体需要钱包进行支付、需要开放账户体系携带身份、需要可验证的机制建立信任——这些正是加密技术多年积累的能力。 Tether投资德国机器人公司NEURA Robotics是这一趋势的典型案例。该公司的机器人平台计划集成Tether的钱包工具，未来机器人可通过完成任务获得微支付并自主交易，这为稳定币等加密基础设施开辟了全新的高频、小额应用场景。然而，“AI+加密”并非万能公式。许多项目只是粗暴拼接，缺乏真实需求和产品壁垒。真正有价值的结合，应满足“没有加密就无法成立，或有了加密明显更好”的条件，例如为AI智能体提供自托管钱包、为数据市场提供开放结算等。结语认为，加密行业亟需找到新的真实需求，而非仅仅依赖新叙事。当AI智能体和机器人成为新的经济参与者时，加密技术搭建的钱包、稳定币、智能合约等基础设施，或将首次迎来高频、刚需且非投机化的巨大应用场景。

链捕手29 分鐘前

链捕手29 分鐘前

Y Combinator 联创：如何赚到十亿美元？

**摘要：** 本文作者、Y Combinator联合创始人保罗·格雷厄姆，针对“不靠不正当手段不可能赚到十亿美元”的观点进行了反驳，并通过数学计算论证，通过创办成功的初创企业合规获取巨额财富是完全可行的。文章的核心论点是：创业者财富积累主要取决于两个变量——**增长速度**和**增速持续时长**。通过计算可知，即使从200万美元起步，若能保持每月93%的增速，仅需约9个半月即可达到十亿美元估值；即使采用更保守的每月15%增速，持续五年也能实现数千倍的增长，足以创造亿万财富。这种增长在初创企业中并非罕见。实现高增长的关键在于打造一款**用户真正喜爱并愿意自发推荐的产品**。格雷厄姆建议年轻创业者，**最好的创业点子往往源于解决自己和朋友的真实需求**，因为年轻人的需求代表着未来趋势。与其刻意寻找“好项目”，不如从感兴趣的事情入手，深度理解特定用户群体，真诚地做出改善他们生活的产品。因此，通过创办初创企业成为亿万富翁，其根本驱动力是**同理心和创造价值**，而非剥削或欺诈。理解指数增长的力量和初创企业的本质，对于认识现代社会的财富创造逻辑至关重要。

Foresight News31 分鐘前

Foresight News31 分鐘前

英伟达力推的800V电压标准，利好哪些基建厂商？

英伟达正力推800VDC高压直流架构，以应对AI机柜功率快速攀升带来的供电挑战。传统低压供电在百千瓦以上高密度机柜中面临线缆粗、损耗高、占用空间大等问题，而800V方案通过提升电压、降低电流，有望提升能效、节省用铜并改善总拥有成本。该架构将重塑AI基础设施生态，涉及从数据中心配电、机柜电源、功率器件到液冷与整柜测试的整个链条。受此趋势影响，电源设备商（如Vertiv、施耐德、台达）、功率半导体厂商（如英飞凌、意法半导体）、高可靠性连接件与母线供应商，以及具备整柜液冷与满载测试能力的ODM厂商将迎来新的机遇。英伟达已联合多家生态伙伴推进相关设计，并计划在2027年Kyber机柜系统中实现规模部署。当前阶段，800VDC主要面向前沿高密度AI工厂，而非全面替换现有数据中心。投资关注点应从“概念”转向实际订单验证、测试能力与交付节奏。随着Rubin等下一代平台功耗继续上升，电力、散热与整柜稳定性将成为影响算力交付与资本效率的关键变量。

marsbit51 分鐘前

marsbit51 分鐘前

交易

現貨

合約

Claude Code Leak: Unveiling the Five-Layer Architecture and Survival Philosophy of a Top AI Agent

文章摘要

相關問答

你可能也喜歡

做芯片有多难？一个除法运算错误，4.75亿美元就打了水漂

Claude 强制“刷脸查户口”，7月起不交身份证不给用？

区块链用了 18 年终于开始驶向主航道

Y Combinator 联创：如何赚到十亿美元？

英伟达力推的800V电压标准，利好哪些基建厂商？

交易

熱門文章

如何購買LAYER

相關討論

熱門問答

熱門分類

熱門標籤