You Use Claude and Codex Every Day, but Meta Has Restricted Internal Use

marsbit发布于2026-06-30更新于2026-06-30

文章摘要

In May, Meta imposed internal restrictions on its engineers regarding the use of Claude Code and Codex, two widely used AI programming tools. Despite being a major client, Meta's guidelines, still in effect, prohibit these external models from being used for specific tasks to prevent potential "escalations with partners." The core concern is "distillation"—the risk that outputs from Claude or Codex could inadvertently contaminate the training data and evaluation processes for Meta's in-house AI coding assistant, MetaCode. If MetaCode is trained or evaluated using data generated by these external models, it risks learning their capabilities rather than developing its own, blurring the line of intellectual origin. The restrictions are precise: engineers cannot use the external models to generate test questions, debug source code, or suggest test cases. AI-generated content is also barred from environments accessible to MetaCode. However, AI can still assist with peripheral tasks like workflow setup and code organization, provided all outputs are manually reviewed. This caution reflects a broader industry dilemma. While distillation is a common technique, using a competitor's model output for training raises legal and ethical questions about the ownership of derived capabilities. Contractual terms from companies like OpenAI and Anthropic explicitly forbid using their outputs to build competing products, putting enforcement power in the hands of rivals. The move is also financ...

In May of this year, Meta drew a clear line for its own engineers.

People in the Applied AI Engineering department can no longer freely use Claude Code and Codex.

According to an internal guide obtained by The Information, a memo even directly called for a pause on certain tasks involving these two models. The wording was severe, stating this could trigger a "serious escalation with partners."

However, the strangeness lies precisely here.

Meta is one of Claude Code's largest global customers. Its total internal AI bill this year is heading towards tens of billions of dollars.

A tool used daily, purchased by the company at great cost, is now being restricted internally. And the reason for the restriction is probably something you wouldn't expect.

It's not that they aren't useful. On the contrary, it's because they are *too* useful.

This Red Line is Still in Effect

According to The Information report, these restrictions were set in May and are still in effect today.

To understand why Meta is so tense, we need to start with an internal AI coding assistant project.

This year, it formed an Applied AI Engineering team, focusing on its self-developed AI coding assistant MetaCode (formerly DevMate).

The goal is to stop Meta from spending huge sums continuing to use others' AI coding models and to train its own.

The official interface of Claude Code. Together with OpenAI's Codex, they have become the de facto standard for professional developers doing agentic programming.

But training a model that can write code is not simple.

You need to feed it massive amounts of high-quality data, and also generate enough, sufficiently tricky programming problems for it to practice on and be graded on. This set of problems and evaluations almost determines how powerful a coding model ultimately becomes.

But the problem lies precisely here.

The difficulty Meta encountered is how to prevent employees from becoming too reliant on these external tools while building the internal replacement.

What it worries about is the outputs from these external models seeping into the training data, causing the model it builds to secretly learn the competitor's capabilities.

To understand this concern, you need to know how a model "learns": You feed it what kind of data, it becomes that kind of model.

MetaCode wants to become stronger by relying on the training data and programming problem sets accumulated by engineers.

But once these problems, answers, and even grading criteria come from Claude or Codex, what MetaCode learns is no longer "skills trained by human engineers," but "Claude's skills."

It's copying answers from the competitor's test paper, becoming more and more like the competitor.

Even more hidden is the evaluation part.

Every time a model answers a question, something must tell it if it answered well so it knows where to improve.

If both problem creation and grading are handed to Codex, then MetaCode is evolving towards what "Codex thinks is correct," essentially copying the competitor's judgment standards bit by bit into its own mind.

This is why Meta's guide prohibits AI from being the problem creator or grader, and even governs whether "AI-generated materials can enter the environment accessible to the model under test."

As long as the competitor's output seeps into the training or evaluation chain in any way, the line of "who taught whom" becomes blurred.

Ultimately, Meta's pause on certain tasks is about isolating the training data.

It fears that the AI writes so well that it becomes unclear which skills were trained internally and which were learned from Claude and Codex.

And the latter set of capabilities is rented, not its own.

Surprisingly Detailed Restrictions

It must be clarified first that Meta's internal documents show no actual records of employees violating rules.

A Meta spokesperson also responded that the company has "clear policies" governing the use of AI tools. So this document is more like an internal early warning.

What tasks can't AI handle? Mainly the following three categories:

First, you cannot use Claude or Codex outputs to create test questions for your own model. The guide's exact words are, this "clearly falls into the category where engineers are not in the driver's seat," "We do not want tasks derived from models."

Second, AI cannot find bugs in source code, nor can it help you think about "what to test" based on code analysis.

Third, anything generated by AI cannot be placed anywhere accessible by the model under test.

Simply put, as long as AI participates in the judgment of "what to test" or "whether the answer is correct," the competitor's skills might mix in. The three rules block this opening.

What tasks can AI still do?

Setting up workflows, organizing code and files, building test frameworks for internal tools—these daily chores are allowed. The guide calls this type of work "test scaffolding" and "solution calibration," essentially assisting and building frameworks.

Even for these tasks, there is one ironclad rule: Every line of AI output must be reviewed by a human first.

In Meta's view, once you let a competitor's model create the test and grade it, it becomes unclear whose test this is.

What it truly wants to protect is that line of "who taught whom."

The Unavoidable "Distillation Trap"

What Meta worries about has a specific name in the industry: distillation.

The meaning is easy to understand: Use a stronger model to continuously answer questions, then use these answers to train a weaker model.

It's a bit like having the top student redo the entire exam paper, and the struggling student copies it, catching up to years of effort in months.

The massive investment others put into data, computing power, and research, you almost get for free.

Training a cutting-edge model from scratch costs astronomical sums of money and time. Distillation, however, might only require a batch of outputs from the other model, reducing costs and timelines to a fraction.

Distillation itself is standard industry practice; big companies also often distill their own large models to create smaller, cheaper versions for users.

The trouble only arises: Once you are copying someone else's model, the capabilities you train—are they your own, or borrowed? It's unclear.

Some call this the "distillation trap": The more you rely on the strongest model to build your own foundation, the harder it is to prove where your intelligence actually came from.

In the United States, the law does not explicitly prohibit distillation, and AI-generated content is not protected by copyright. Using the other's output to train your own model basically passes the legal hurdle.

The only barrier is the contract.

Both OpenAI's and Anthropic's terms of service contain similar restrictions: You cannot use the model's outputs to create something that competes with them.

Moreover, the enforcement power for this barrier lies entirely with the competitor.

Last year, Anthropic directly cut off OpenAI's API access to Claude, even though OpenAI claimed it was only for evaluating capabilities and safety, a "standard industry" practice.

Even Musk was forced to admit in a court hearing this past April that his xAI "partially" distilled OpenAI's models.

April 30, 2026, in the witness stand at a California federal court, Musk was asked if xAI used distillation techniques on OpenAI models to train Grok. He first said this was common practice for AI companies.

When pressed if this amounted to a "yes," he replied "partially."

The rules are fuzzy, and "enforcement power" is held by competitors. Who dares to bet their billions in investment that a competitor won't turn hostile?

From this perspective, Meta's tension is not at all excessive.

Here, there's also the consideration of saving money.

According to internal memos, Meta will burn tens of billions of dollars this year just on internal AI use. It has even started setting token usage limits for employees. Even a cash-rich giant like Meta is starting to find AI too expensive and is calculating carefully.

If development work can be shifted from expensive external tools to its own MetaCode, it saves money while avoiding the minefield of distillation—killing two birds with one stone.

A Tightrope-Walking Map

Regarding Meta's internal documents, tech law scholar and legal advisor Mark Leiser has a vivid phrase: This is "almost like a map for walking a tightrope."

On one side, you need to gain the benefits of external models; on the other, you must prevent their capabilities from slipping into your own system.

Of course, Meta isn't the only company walking this tightrope; it touches a vital point for the entire industry.

When you use a sufficiently smart AI to build another equally smart AI, in the end, you might find it hard to say clearly: Is this intelligence something you trained yourself, or did you secretly learn it from someone else's AI?

And this issue isn't that far from ordinary people either.

The code you write with AI, the plans you modify, the materials you compile—feeding them back becomes nourishment for the next generation of models.

In this cycle, who is standing on whose shoulders? That line has become increasingly blurred.

When AI starts helping us build AI, can we still tell whose capabilities are whose?

References:

https://x.com/kimmonismus/status/2071591755351224344

https://www.theinformation.com/articles/internal-docs-show-meta-putting-limits-claude-codex-fearing-distillation

This article is from the WeChat public account "New Zhiyuan", author: ASI Apocalypse

热门币种推荐

相关问答

QWhy did Meta restrict its engineers from using Claude Code and Codex internally?

AMeta restricted their internal use to prevent knowledge distillation, where the outputs from these powerful external AI models could inadvertently influence and shape the training of Meta's own in-house AI coding assistant, MetaCode. The concern is that if MetaCode is trained on data or evaluations generated by Claude or Codex, its capabilities would be learned from the competitor's model rather than developed independently.

QWhat specific tasks does Meta's policy forbid using Claude or Codex for?

AThe policy forbids three main tasks: 1) Using their outputs to create test cases or benchmarks for Meta's own models. 2) Using them to find bugs in source code or to suggest what should be tested. 3) Placing any AI-generated content in an environment accessible by the model being trained (MetaCode).

QWhat is 'distillation' in the context of AI model training, and what is the 'distillation trap' mentioned in the article?

AIn AI, distillation refers to using a larger, more powerful model's outputs to train a smaller or weaker model. The 'distillation trap' is the dilemma where a company heavily relies on a competitor's model outputs to build its own. This makes it difficult to prove that the resulting model's intelligence and capabilities were developed independently rather than being copied or derived from the competitor.

QWhat are the potential consequences for a company if it's found to have distilled a competitor's AI model?

AWhile not explicitly illegal under current US law, using a competitor's model outputs for training likely violates their Terms of Service. The primary consequence is that the competitor can take action, such as cutting off API access (as Anthropic did to OpenAI). This creates significant business and legal risk, potentially jeopardizing a company's multi-billion dollar AI investments.

QBesides avoiding 'distillation,' what is another key reason for Meta to develop and push its own MetaCode assistant?

AAnother key reason is cost reduction. Meta's internal AI usage is projected to cost tens of billions of dollars this year. By shifting development work from expensive external tools like Claude and Codex to its own MetaCode, the company can save significant money while also mitigating the legal and strategic risks associated with knowledge distillation.

你可能也喜欢

Bitmine以太坊储备增至98亿美元:"加密货币最好的年份尚未到来"

比特浸入科技(Bitmine Immersion Technologies)近期再次成为头条,其在一周内增持了27,084枚以太坊(ETH)。这使得其以太坊总持有量达到5,700,040枚,按每枚1,569美元计算,价值约90.1亿美元,占以太坊总供应量的4.7%。此次增持发生在以太坊价格从约1780美元下跌至1578.54美元(撰稿时)的一周内。同时,根据SoSo Value数据,以太坊ETF在整个六月大部分时间出现资金外流,总额达5.0139亿美元。 针对疲软的市场状况,比特浸入科技董事长汤姆·李(Tom Lee)表示,近期市场对加密货币投资者颇具挑战,并指出临近季度末的“粉饰橱窗”行为导致投资者减持过去三个月表现不佳的资产是常见现象。此外,迈克尔·赛勒(Michael Saylor)的公司Strategy正面临持续审查,据报道其持有约140亿美元未实现亏损,而其普通股和优先股价格均跌破100美元水平,引发加密社区部分人士建议其停止扩张比特币持仓。 由于比特浸入科技常被称为“以太坊的Strategy”,市场担忧其持续的以太坊积累行为可能面临类似困境与批评。目前上市公司共持有价值约749.4亿美元的比特币和114.8亿美元的以太坊,Strategy是最大的比特币持仓上市公司。 然而,目前这些担忧仅是推测。比特浸入科技并非单纯积累以太坊,其每年质押收入估计达2.11亿美元,同时持有5.55亿美元现金及等价物以及488万枚质押的ETH。该公司还于6月26日被纳入罗素1000大型股指数。汤姆·李强调,公司计划稳步增长至2026年,并认为市场正开启新一轮牛市周期,代币化和人工智能的快速进展将推动区块链和去中心化加密领域的指数级需求增长。 最终摘要: * 新增持后,比特浸入科技持有5,700,040枚ETH,价值约90.1亿美元。 * 尽管以太坊价格疲软、ETF资金外流且Strategy面临批评,比特浸入科技仍持续购入以太坊。

ambcrypto2小时前

Bitmine以太坊储备增至98亿美元:"加密货币最好的年份尚未到来"

ambcrypto2小时前

你天天用的Claude和Codex,Meta内部不让随便用了

今年5月,Meta为其应用AI工程部门的工程师划定了红线:限制内部使用Claude Code和Codex这两款流行的AI编程工具,相关限制至今仍在生效。作为这些工具的主要客户之一,Meta此举并非因其不好用,而是恰恰相反——担心其过于强大和好用。 Meta正在自研名为MetaCode的AI编程助手,旨在替代外部模型以节省成本并掌握核心技术。限制使用外部模型的核心原因,是防止“蒸馏陷阱”:即担忧员工在构建MetaCode的训练数据、编程题库和评测标准时,过度依赖或掺入Claude/Codex的输出。这会导致自研模型在不知不觉中学习对手的“本事”和判断标准,使能力来源模糊,并可能违反与OpenAI、Anthropic等竞争对手的服务条款,引发法律风险。 内部指南明确禁止了可能让外部AI模型“定义能力”的三类任务:不能用其输出来生成测试题目、不能用其分析代码或设计测试点、其生成内容不得进入被测模型的访问环境。仅允许AI处理搭建工作流、整理文件等“打下手”的辅助性任务,且所有AI产出必须经过人工审核。 这一事件揭示了AI行业的一个普遍困境:在利用强大外部工具加速自身研发的同时,如何清晰界定并守护自身模型能力的原创性,避免陷入知识产权与合同风险。随着AI参与创造AI的循环加深,“本事究竟是谁的”这条界线正变得越来越模糊。

marsbit3小时前

你天天用的Claude和Codex,Meta内部不让随便用了

marsbit3小时前

交易

现货

热门文章

如何购买PEOPLE

欢迎来到HTX.com!我们已经让购买ConstitutionDAO(PEOPLE)变得简单而便捷。跟随我们的逐步指南,放心开始您的加密货币之旅。第一步:创建您的HTX账户使用您的电子邮件、手机号码注册一个免费账户在HTX上。体验无忧的注册过程并解锁所有平台功能。立即注册第二步:前往买币页面,选择您的支付方式信用卡/借记卡购买:使用您的Visa或Mastercard即时购买ConstitutionDAO(PEOPLE)。余额购买:使用您HTX账户余额中的资金进行无缝交易。第三方购买:探索诸如Google Pay或Apple Pay等流行支付方法以增加便利性。C2C购买:在HTX平台上直接与其他用户交易。HTX场外交易台(OTC)购买:为大量交易者提供个性化服务和竞争性汇率。第三步:存储您的ConstitutionDAO(PEOPLE)购买完您的ConstitutionDAO(PEOPLE)后,将其存储在您的HTX账户钱包中。您也可以通过区块链转账将其发送到其他地方或者用于交易其他加密货币。第四步:交易ConstitutionDAO(PEOPLE)在HTX的现货市场轻松交易ConstitutionDAO(PEOPLE)。访问您的账户,选择您的交易对,执行您的交易,并实时监控。HTX为初学者和经验丰富的交易者提供了友好的用户体验。

1.3k人学过发布于 2024.03.29更新于 2026.06.02

如何购买PEOPLE

相关讨论

欢迎来到HTX社区。在这里,您可以了解最新的平台发展动态并获得专业的市场意见。以下是用户对PEOPLE(PEOPLE)币价的意见。

活动图片