Lao Huang: Prompt is Dead, the Entire AI Community is Frenziedly Chasing Loops

marsbit发布于2026-06-29更新于2026-06-29

文章摘要

The article "Prompt is Dead: The AI Industry is Obsessively Chasing Loops" discusses a major shift in AI development, where "Loop Engineering" is replacing traditional prompt engineering. Industry leaders like NVIDIA's Jensen Huang, Andrew Ng, and engineers from Anthropic and OpenAI argue that manually crafting prompts is becoming obsolete. Instead, the new focus is on designing autonomous, self-improving AI systems (loops) that can operate 24/7. A loop system typically involves five key phases: Discovery (finding tasks), Handoff (assigning to agents), Validation (critical independent review), Persistence (saving progress), and Scheduling (automated operation). The core idea is to move humans from being the operational "engine" to being the system "architects" who design the loop, define goals, and set up verification mechanisms. A major challenge and necessity is implementing robust, independent validation to prevent AI from uncritically approving its own work. The trend is seen as part of a move towards "inference-time compute," where allocating computational budget effectively becomes a key engineering skill. While loops can produce higher-quality outputs, they are more expensive and time-consuming than simple prompting. The article warns of risks like "verification debt," "comprehension corrosion," and "cognitive surrender," where engineers might stop understanding the code their systems generate. Ultimately, the article concludes that in an era of automated loops, huma...

Recently, what's the hottest word in Silicon Valley?

Loop.

Open up X, and the entire internet is flooded with discussions about Loop Engineering.

A group of Silicon Valley big shots are abandoning prompts one after another, turning towards autonomous loops!

Jensen Huang has outlined the new focus for the next phase (and a new way to burn tokens):

Nobody writes prompts anymore. The new job is to write and handle loops.

Recently, an Anthropic engineer revealed:

Over 80% of engineers at Anthropic are already using self-improving loops, and it will reach 100% in 3-6 months.

Andrew Ng asserts: In 3 to 6 months, prompts will die! Loops replacing prompts is a foregone conclusion.

Earlier, when explaining his AutoResearch project, Karpathy discussed AI agents' closed loops (generation → execution → evaluation → improve), advocating for partially removing humans from the loop.

In March, Karpathy gave an in-depth interview about AutoResearch / the Karpathy Loop.

OpenClaw's creator, Peter Steinberger, stated bluntly: A monthly reminder, stop manually writing prompts, designing loops is the way.

Claude Code's creator, Boris Cherny, boldly declared: Loop cycling is the future!

Two years ago, we were manually writing code. Then we started transitioning to having agents write code.

Now, we are moving towards a stage where agents prompt other agents, and the latter generates the code.

The step from source code to agents was big, but the significance and impact of introducing a loop mechanism is no less than the previous step.

Who would have thought, after two years of prompt engineering being hot, top AI engineers have moved on.

Why are Silicon Valley big shots so bullish on Loops?

The essence of traditional prompting is: Humans *are* the loop itself.

You write a prompt → Agent outputs → You review → You write the next prompt → Repeat.

Each step relies on human attention, contextual memory, and decision-making bandwidth. The amount of tokens and task complexity one person can effectively drive in a day is limited.

The essence of Loop Engineering is: The system becomes the loop itself.

Therefore, loop engineering is more important than prompt engineering.

Humans only do high-value design once:

1. Define goals and stop conditions

2. Build verification mechanisms (most critical)

3. Establish persistent memory (markdown / external state)

4. Configure discovery and scheduling

Afterwards, the AI loop system can autonomously discover tasks → execute → verify → persist → discover again, running 24/7, with humans only intervening when necessary.

This is why Silicon Valley big shots are all bullish on Loops.

Their judgment is based on this: Once loops mature, the cost-effectiveness of manual prompting will directly collapse!

Loop Engineering 11-page White Paper, Spreading Like Wildfire Online

So, what exactly is the loop process?

In recent days, a white paper on Loop Engineering has started spreading like wildfire on X.

https://drive.google.com/file/d/1qzKI4DKnyHRpXK1J3ATPqwaqLc0iNu-M/view

This 11-page PDF is essentially a popular summary / field guide, compiling relevant public discussions and practical experience.

The core ideas within originate from the public statements of Peter Steinberger, Boris Cherny, and Addy Osmani.

What is Loop Engineering?

Loop Engineering, named by Google Chrome engineer Addy Osmani in June 2026.

It is the fourth layer above prompt engineering, context engineering, and toolchain engineering: the first three layers assume you sit at the keyboard directing the AI line by line; Loop Engineering aims to move you from that position, completely liberating you from the work.

From now on, you are no longer the engine driving the AI, but the architect designing that engine.

The system will automatically wake up at set times, spawn child agents to work, and feed the output back to itself as input for the next round.

The article breaks down a complete Loop into five key actions:

Discovery: The AI uses a solidified skill library to find valuable work on its own, such as reading the latest CI failure logs or unresolved issues.

Handoff: Open independent sandboxes for each task, allowing multiple agents to work in parallel without interference.

Verification: This is the most crucial step. Letting the code-writing AI score itself, it will blindly praise itself. Therefore, a completely independent, default-skeptical "evaluator" agent must be introduced to find faults.

Persist: The AI's memory cannot just stay in a context window that can be cleared anytime. Its state and progress must be solidified to disk so it can continue the next day.

Schedule: Use automation scripts to let the system run autonomously and periodically, closing the entire loop.

Among these, the hardest and most likely to be lazily skipped is verification.

Having the AI score itself, it will almost always praise itself because its mind contains a self-persuasion chain. The solution is to introduce an independent evaluation Agent that assumes the code is bad by default.

However, the system running fully automatically doesn't mean you can rest easy. The author warns: When the loop runs wild late at night, it may quietly accumulate four hidden costs.

Verification Debt: Minor errors not validated are quietly merged into the repository.

Understanding Decay: The AI writes code too fast, causing human understanding of the codebase to seriously fall behind.

Cognitive Surrender: Humans become too lazy to review, accepting AI results wholesale.

Token Runaway: The AI retries all night in an infinite loop, burning through the budget.

The same Loop, built by two different people, may yield completely opposite results. Bring judgment into it, and it amplifies judgment; bring laziness into it, and it amplifies laziness.

In short, this report reveals a profound industry change: Loop engineering makes code generation almost free, while human judgment becomes the only scarce resource!

Also spreading wildly online simultaneously is a 14-step practical handbook issued by Codez, already with millions of shares.

The general idea of the article is as follows: Prompts are outdated, the leverage point has moved up one layer—from "words written for the AI to see" to "designing a system that automatically feeds the AI."

This transformation can be broken down into 14 steps, 3 stages—

First, determine if you really need a loop (Is the task repetitive? Can verification be automated? Can the budget handle it?), then learn the five components (scheduler, isolated working directory, skill files, external connectors, independent evaluation sub-agent), and finally build a minimal viable loop.

The most crucial point among them is: Separate the agent that writes code from the agent that reviews code. The same model acting as both athlete and referee will always give itself full marks.

A loop without objective verification gates is just "two optimists nodding at each other." The better the loop runs, the easier it is for engineers to stop truly understanding the code.

Loop Engineering Birth Timeline

If we were to outline a timeline for loop engineering, it would roughly be as follows.

Early Foundation Stage

2022: Shunyu Yao and others propose the ReAct framework, laying the theoretical cornerstone.

2025: Geoffrey Huntley proposes "Ralph."

2025–Early 2026: Andrej Karpathy releases the AutoResearch project, forming the classic autonomous experiment loop, a major milestone.

Concept Explosion and Naming Stage

Early June 2026, Peter Steinberger speaks out: You shouldn't manually prompt coding agents anymore, you should design loops that prompt them.

Boris Cherny states: I no longer prompt Claude directly, my job is to write loops that run Claude.

June 7, 2026: Addy Osmani publishes the blog post "Loop Engineering," formally naming it, providing a 4-layer stack framework: Prompt → Context → Harness → Loop Engineering.

Subsequently, throughout June, Loop Engineering begins spreading virally across the entire internet.

Claude's "Infinite Loop," Automated Agents Take Over Everything

In an internal podcast, an Anthropic engineer revealed a spine-chilling detail:

When you click run and let Claude execute for 8 hours, you are essentially making a $500 compute power gamble.

If you're still struggling over how to write prompts, you've already lost.

In Anthropic's logic, engineers are evolving into "compute power allocators."

Your core job is no longer writing logic, but deciding where each cent of compute power is invested.

As OpenAI researcher Noam Brown pointed out earlier this month, contemporary models can solve almost any problem if you're willing to throw enough compute power at it.

Loop engineering is part of the big trend of "compute at test time."

Interestingly, the idea of having agents work in loops has actually had prototypes for a while.

At least since last summer, Australian sheep farmer Geoffrey Huntley mentioned a similar approach in his blog, calling it the "Ralph loop."

A year ago, if you wanted to implement a loop, you had to write a bunch of bash scripts and then maintain that code forever, it belonged only to you.

Now, these components are built directly into the products.

You no longer argue about whether to use Codex or Claude Code, but directly design loops that work regardless of which tool you're in.

Former Google engineering and developer relations lead Addy Osmani pointed out that loop engineering requires all the parts that these AI tools already possess.

He also asserts that loop engineering may lead to "cognitive surrender" among engineers:

When the loop runs by itself, you easily stop thinking and just passively accept everything it feeds back.

This is exactly what software engineer Armin Ronacher is worried about.

This is also exactly what is said in the Orange Book, Addy Osmani encourages people to design loops with judgment:

Designing loops, if done with judgment, is the cure; if done to avoid thinking, it is the catalyst—the same action, completely opposite results.

The Loop Endures, Engineering Lives

At the AI Engineer Summit in April, an Anthropic engineer said they tasked Claude with developing a retro mini-game app, using two methods: one using only minimalist prompts, the other using agent loops.

The comparison was stark: The minimalist prompt version took 20 minutes and cost $9; the loop method took 6 hours and cost $200.

But the quality of the app from the latter was far superior.

The former game wouldn't run, the app was crude; the loop version was much richer, containing many features the game designer wanted.

The loop won't pity those who give up thinking. It will only use its speed to turn your ignorance into code debt faster.

And those willing to maintain a clumsy understanding, continuously define rules, and take responsibility for the final results, will gain greater leverage in this paradigm shift than ever before.

Build the loop. Stay the engineer.

Prompt is dead, Loop is running.

And you are still the one who decides its meaning.

References:

https://x.com/DataScienceDojo/status/2069873216152092975

https://x.com/0xCodez/status/2064374643729773029

https://x.com/akshay_pachaar/status/2069769689560187027

This article is from the WeChat public account "新智元" (New Zhiyuan), author: ASI启示录

热门币种推荐

相关问答

QWhat is the core difference between traditional prompt engineering and the emerging Loop Engineering, as discussed in the article?

AThe core difference lies in the role of the human. In traditional prompt engineering, the human acts as the loop itself: manually writing prompts, reviewing outputs, and making decisions for each iteration. In Loop Engineering, the system itself becomes the autonomous loop. Humans design the system once by defining goals, validation mechanisms, and memory, after which the AI system can run 24/7, autonomously discovering, executing, validating, and persisting tasks.

QAccording to the 'Loop Engineering' whitepaper, what are the five key actions that constitute a complete autonomous loop?

AAccording to the whitepaper, a complete autonomous loop consists of five key actions: 1) Discovery: The AI uses a fixed skills library to find valuable work. 2) Handoff: Tasks are passed to independent sandboxes for parallel, isolated execution by agents. 3) Validation: An independent, skeptical 'evaluator' agent checks the work, as a single agent scoring its own work is unreliable. 4) Persistence: The system's state and progress are saved to disk to survive beyond the context window. 5) Scheduling: Automated scripts run the system periodically to close the loop.

QWhy does the article emphasize that a separate 'evaluator' agent is crucial in a loop, especially for coding tasks?

AThe article emphasizes a separate evaluator agent because an AI agent tasked with both writing and evaluating its own code is inherently biased and will almost always praise its own work. It lacks the ability for objective, critical assessment. A dedicated, independent evaluator agent operates with a default skeptical stance, assuming the code is flawed, which is essential for catching errors and ensuring quality before results are persisted.

QWhat are the potential 'hidden costs' or risks associated with fully autonomous AI loops running unattended, as warned in the article?

AThe article warns of four main hidden costs or risks: 1) Validation Debt: Subtle, unvalidated errors are quietly merged into the codebase. 2) Comprehension Rot: Humans lose understanding of the codebase as AI writes code too quickly. 3) Cognitive Surrender: Humans stop reviewing and passively accept all AI outputs. 4) Token Runaway: The AI gets stuck in retry loops overnight, burning through the compute budget.

QBased on the timeline in the article, who formally named 'Loop Engineering' and what is the four-layer stack framework they proposed?

ALoop Engineering was formally named by Google Chrome engineer Addy Osmani in a blog post published on June 7, 2026. He proposed a four-layer stack framework consisting of: 1) Prompt Engineering, 2) Context Engineering, 3) Harness/Toolchain Engineering, and 4) Loop Engineering. This framework positions Loop Engineering as the highest layer, aiming to remove the human from direct, line-by-line interaction with the AI.

你可能也喜欢

OpenAI和Anthropic都要“自研芯片”,除了成本,更重要的是算力控制权

据The Information报道,Anthropic正与三星洽谈定制AI芯片,并已启动自研芯片的早期开发工作,此举被视为追随OpenAI的类似布局。两家头部AI公司共同指向同一趋势:大模型竞争正从算法层面转向软硬件一体化的综合竞争。 文章指出,自研芯片的首要动机并非仅是降低成本,更是为了掌握核心算力的控制权。当前,AI计算高度依赖英伟达GPU,供应紧张和成本高企使算力成为核心生产资料。通过定制芯片,AI公司可以优化模型与硬件的协同设计,从而在推理速度、能耗和单位经济性上获得更大提升。专家分析认为,最大的效率提升空间来自模型、内核与硅片之间的跨层协同优化,而非单点改进。 然而,这并不意味着对英伟达的立刻替代。自研芯片从设计到部署周期漫长,且英伟达的CUDA生态和通用性优势依然显著。因此,自研芯片更可能作为一种技术备选和谈判筹码,用于特定、高负载的推理场景,形成对现有供应链的制衡。 这一动向也反映了全行业“算力自主权”竞赛的全面开启。从谷歌、亚马逊到Meta、微软,全球科技巨头均已深度布局自研AI芯片。对三星等代工厂而言,获得Anthropic的订单将有助于其在AI半导体领域扩大影响力。总体而言,大模型的竞争已延伸至对算力、资金和硬件栈的整体控制能力。

marsbit37分钟前

OpenAI和Anthropic都要“自研芯片”,除了成本,更重要的是算力控制权

marsbit37分钟前

看跌阴云密布,价值21.3亿美元的比特币和以太坊期权到期

7月3日,加密货币市场迎来关键节点,价值约21.3亿美元的比特币和以太坊期权到期,为当前充满挑战的市场环境提供了新的投资者情绪洞察。 此次到期的比特币期权合约约3.1万份,名义价值约19亿美元,其认沽认购比为0.70,最大痛点为61,000美元。以太坊期权则到期13.5万份,价值约2.3亿美元,认沽认购比高达1.29,最大痛点水平为1,650美元。 以太坊认沽认购比高于1,表明认沽期权数量多于认购期权,这反映出许多交易者正在对冲进一步下跌风险或保持谨慎态度。期权到期持仓仍集中在关键伽玛暴露水平附近,比特币在60,000美元左右,以太坊则在1,700美元附近。尽管比特币本周重新站上60,000美元心理关口,但市场情绪依然喜忧参半。 除了期权活动,投资者注意力也日益转向传统金融市场,特别是人工智能和半导体股的动向。在数字资产行业内,代币化美股也成为一个主要话题。期权到期数据表明,交易者在进入第三季度时仍持谨慎态度。比特币虽收复重要支撑位,但以太坊的防御性定位和对冲活动的集中,显示出许多市场参与者仍在为波动加剧做准备,而非预期将出现决定性的牛市突破。 截至发稿,比特币交易价格最高达61,932美元,但24小时交易量下降24.43%,至333亿美元,过去24小时清算金额超过9484万美元。以太坊则回升至1,738美元附近交易区间,交易量降至124.7亿美元,清算金额达1.7146亿美元。

TheNewsCrypto1小时前

看跌阴云密布,价值21.3亿美元的比特币和以太坊期权到期

TheNewsCrypto1小时前

交易

现货

热门文章

从H2A到A2A:AI Agent经济体与Crypto新机遇

6月17日,哈佛大学独立研究员、美国AI科学院(NAAI)通讯院士、比特币基金会终身会员韩锋做客火币HTX《大咖讲堂》第三期,以《从H2A到A2A》为主题,分享了其对Agent经济、Crypto基础设施及数字社会未来发展的思考。

51人学过发布于 2026.07.01更新于 2026.07.01

从H2A到A2A:AI Agent经济体与Crypto新机遇

相关讨论

欢迎来到HTX社区。在这里,您可以了解最新的平台发展动态并获得专业的市场意见。以下是用户对AI(AI)币价的意见。

活动图片