Lao Huang: Prompt is Dead, the Entire AI Community is Frenziedly Chasing Loops

marsbitPublished on 2026-06-29Last updated on 2026-06-29

Abstract

The article "Prompt is Dead: The AI Industry is Obsessively Chasing Loops" discusses a major shift in AI development, where "Loop Engineering" is replacing traditional prompt engineering. Industry leaders like NVIDIA's Jensen Huang, Andrew Ng, and engineers from Anthropic and OpenAI argue that manually crafting prompts is becoming obsolete. Instead, the new focus is on designing autonomous, self-improving AI systems (loops) that can operate 24/7. A loop system typically involves five key phases: Discovery (finding tasks), Handoff (assigning to agents), Validation (critical independent review), Persistence (saving progress), and Scheduling (automated operation). The core idea is to move humans from being the operational "engine" to being the system "architects" who design the loop, define goals, and set up verification mechanisms. A major challenge and necessity is implementing robust, independent validation to prevent AI from uncritically approving its own work. The trend is seen as part of a move towards "inference-time compute," where allocating computational budget effectively becomes a key engineering skill. While loops can produce higher-quality outputs, they are more expensive and time-consuming than simple prompting. The article warns of risks like "verification debt," "comprehension corrosion," and "cognitive surrender," where engineers might stop understanding the code their systems generate. Ultimately, the article concludes that in an era of automated loops, huma...

Recently, what's the hottest word in Silicon Valley?

Loop.

Open up X, and the entire internet is flooded with discussions about Loop Engineering.

A group of Silicon Valley big shots are abandoning prompts one after another, turning towards autonomous loops!

Jensen Huang has outlined the new focus for the next phase (and a new way to burn tokens):

Nobody writes prompts anymore. The new job is to write and handle loops.

Recently, an Anthropic engineer revealed:

Over 80% of engineers at Anthropic are already using self-improving loops, and it will reach 100% in 3-6 months.

Andrew Ng asserts: In 3 to 6 months, prompts will die! Loops replacing prompts is a foregone conclusion.

Earlier, when explaining his AutoResearch project, Karpathy discussed AI agents' closed loops (generation → execution → evaluation → improve), advocating for partially removing humans from the loop.

In March, Karpathy gave an in-depth interview about AutoResearch / the Karpathy Loop.

OpenClaw's creator, Peter Steinberger, stated bluntly: A monthly reminder, stop manually writing prompts, designing loops is the way.

Claude Code's creator, Boris Cherny, boldly declared: Loop cycling is the future!

Two years ago, we were manually writing code. Then we started transitioning to having agents write code.

Now, we are moving towards a stage where agents prompt other agents, and the latter generates the code.

The step from source code to agents was big, but the significance and impact of introducing a loop mechanism is no less than the previous step.

Who would have thought, after two years of prompt engineering being hot, top AI engineers have moved on.

Why are Silicon Valley big shots so bullish on Loops?

The essence of traditional prompting is: Humans *are* the loop itself.

You write a prompt → Agent outputs → You review → You write the next prompt → Repeat.

Each step relies on human attention, contextual memory, and decision-making bandwidth. The amount of tokens and task complexity one person can effectively drive in a day is limited.

The essence of Loop Engineering is: The system becomes the loop itself.

Therefore, loop engineering is more important than prompt engineering.

Humans only do high-value design once:

1. Define goals and stop conditions

2. Build verification mechanisms (most critical)

3. Establish persistent memory (markdown / external state)

4. Configure discovery and scheduling

Afterwards, the AI loop system can autonomously discover tasks → execute → verify → persist → discover again, running 24/7, with humans only intervening when necessary.

This is why Silicon Valley big shots are all bullish on Loops.

Their judgment is based on this: Once loops mature, the cost-effectiveness of manual prompting will directly collapse!

Loop Engineering 11-page White Paper, Spreading Like Wildfire Online

So, what exactly is the loop process?

In recent days, a white paper on Loop Engineering has started spreading like wildfire on X.

https://drive.google.com/file/d/1qzKI4DKnyHRpXK1J3ATPqwaqLc0iNu-M/view

This 11-page PDF is essentially a popular summary / field guide, compiling relevant public discussions and practical experience.

The core ideas within originate from the public statements of Peter Steinberger, Boris Cherny, and Addy Osmani.

What is Loop Engineering?

Loop Engineering, named by Google Chrome engineer Addy Osmani in June 2026.

It is the fourth layer above prompt engineering, context engineering, and toolchain engineering: the first three layers assume you sit at the keyboard directing the AI line by line; Loop Engineering aims to move you from that position, completely liberating you from the work.

From now on, you are no longer the engine driving the AI, but the architect designing that engine.

The system will automatically wake up at set times, spawn child agents to work, and feed the output back to itself as input for the next round.

The article breaks down a complete Loop into five key actions:

Discovery: The AI uses a solidified skill library to find valuable work on its own, such as reading the latest CI failure logs or unresolved issues.

Handoff: Open independent sandboxes for each task, allowing multiple agents to work in parallel without interference.

Verification: This is the most crucial step. Letting the code-writing AI score itself, it will blindly praise itself. Therefore, a completely independent, default-skeptical "evaluator" agent must be introduced to find faults.

Persist: The AI's memory cannot just stay in a context window that can be cleared anytime. Its state and progress must be solidified to disk so it can continue the next day.

Schedule: Use automation scripts to let the system run autonomously and periodically, closing the entire loop.

Among these, the hardest and most likely to be lazily skipped is verification.

Having the AI score itself, it will almost always praise itself because its mind contains a self-persuasion chain. The solution is to introduce an independent evaluation Agent that assumes the code is bad by default.

However, the system running fully automatically doesn't mean you can rest easy. The author warns: When the loop runs wild late at night, it may quietly accumulate four hidden costs.

Verification Debt: Minor errors not validated are quietly merged into the repository.

Understanding Decay: The AI writes code too fast, causing human understanding of the codebase to seriously fall behind.

Cognitive Surrender: Humans become too lazy to review, accepting AI results wholesale.

Token Runaway: The AI retries all night in an infinite loop, burning through the budget.

The same Loop, built by two different people, may yield completely opposite results. Bring judgment into it, and it amplifies judgment; bring laziness into it, and it amplifies laziness.

In short, this report reveals a profound industry change: Loop engineering makes code generation almost free, while human judgment becomes the only scarce resource!

Also spreading wildly online simultaneously is a 14-step practical handbook issued by Codez, already with millions of shares.

The general idea of the article is as follows: Prompts are outdated, the leverage point has moved up one layer—from "words written for the AI to see" to "designing a system that automatically feeds the AI."

This transformation can be broken down into 14 steps, 3 stages—

First, determine if you really need a loop (Is the task repetitive? Can verification be automated? Can the budget handle it?), then learn the five components (scheduler, isolated working directory, skill files, external connectors, independent evaluation sub-agent), and finally build a minimal viable loop.

The most crucial point among them is: Separate the agent that writes code from the agent that reviews code. The same model acting as both athlete and referee will always give itself full marks.

A loop without objective verification gates is just "two optimists nodding at each other." The better the loop runs, the easier it is for engineers to stop truly understanding the code.

Loop Engineering Birth Timeline

If we were to outline a timeline for loop engineering, it would roughly be as follows.

Early Foundation Stage

2022: Shunyu Yao and others propose the ReAct framework, laying the theoretical cornerstone.

2025: Geoffrey Huntley proposes "Ralph."

2025–Early 2026: Andrej Karpathy releases the AutoResearch project, forming the classic autonomous experiment loop, a major milestone.

Concept Explosion and Naming Stage

Early June 2026, Peter Steinberger speaks out: You shouldn't manually prompt coding agents anymore, you should design loops that prompt them.

Boris Cherny states: I no longer prompt Claude directly, my job is to write loops that run Claude.

June 7, 2026: Addy Osmani publishes the blog post "Loop Engineering," formally naming it, providing a 4-layer stack framework: Prompt → Context → Harness → Loop Engineering.

Subsequently, throughout June, Loop Engineering begins spreading virally across the entire internet.

Claude's "Infinite Loop," Automated Agents Take Over Everything

In an internal podcast, an Anthropic engineer revealed a spine-chilling detail:

When you click run and let Claude execute for 8 hours, you are essentially making a $500 compute power gamble.

If you're still struggling over how to write prompts, you've already lost.

In Anthropic's logic, engineers are evolving into "compute power allocators."

Your core job is no longer writing logic, but deciding where each cent of compute power is invested.

As OpenAI researcher Noam Brown pointed out earlier this month, contemporary models can solve almost any problem if you're willing to throw enough compute power at it.

Loop engineering is part of the big trend of "compute at test time."

Interestingly, the idea of having agents work in loops has actually had prototypes for a while.

At least since last summer, Australian sheep farmer Geoffrey Huntley mentioned a similar approach in his blog, calling it the "Ralph loop."

A year ago, if you wanted to implement a loop, you had to write a bunch of bash scripts and then maintain that code forever, it belonged only to you.

Now, these components are built directly into the products.

You no longer argue about whether to use Codex or Claude Code, but directly design loops that work regardless of which tool you're in.

Former Google engineering and developer relations lead Addy Osmani pointed out that loop engineering requires all the parts that these AI tools already possess.

He also asserts that loop engineering may lead to "cognitive surrender" among engineers:

When the loop runs by itself, you easily stop thinking and just passively accept everything it feeds back.

This is exactly what software engineer Armin Ronacher is worried about.

This is also exactly what is said in the Orange Book, Addy Osmani encourages people to design loops with judgment:

Designing loops, if done with judgment, is the cure; if done to avoid thinking, it is the catalyst—the same action, completely opposite results.

The Loop Endures, Engineering Lives

At the AI Engineer Summit in April, an Anthropic engineer said they tasked Claude with developing a retro mini-game app, using two methods: one using only minimalist prompts, the other using agent loops.

The comparison was stark: The minimalist prompt version took 20 minutes and cost $9; the loop method took 6 hours and cost $200.

But the quality of the app from the latter was far superior.

The former game wouldn't run, the app was crude; the loop version was much richer, containing many features the game designer wanted.

The loop won't pity those who give up thinking. It will only use its speed to turn your ignorance into code debt faster.

And those willing to maintain a clumsy understanding, continuously define rules, and take responsibility for the final results, will gain greater leverage in this paradigm shift than ever before.

Build the loop. Stay the engineer.

Prompt is dead, Loop is running.

And you are still the one who decides its meaning.

References:

https://x.com/DataScienceDojo/status/2069873216152092975

https://x.com/0xCodez/status/2064374643729773029

https://x.com/akshay_pachaar/status/2069769689560187027

This article is from the WeChat public account "新智元" (New Zhiyuan), author: ASI启示录

Trending Cryptos

Related Questions

QWhat is the core difference between traditional prompt engineering and the emerging Loop Engineering, as discussed in the article?

AThe core difference lies in the role of the human. In traditional prompt engineering, the human acts as the loop itself: manually writing prompts, reviewing outputs, and making decisions for each iteration. In Loop Engineering, the system itself becomes the autonomous loop. Humans design the system once by defining goals, validation mechanisms, and memory, after which the AI system can run 24/7, autonomously discovering, executing, validating, and persisting tasks.

QAccording to the 'Loop Engineering' whitepaper, what are the five key actions that constitute a complete autonomous loop?

AAccording to the whitepaper, a complete autonomous loop consists of five key actions: 1) Discovery: The AI uses a fixed skills library to find valuable work. 2) Handoff: Tasks are passed to independent sandboxes for parallel, isolated execution by agents. 3) Validation: An independent, skeptical 'evaluator' agent checks the work, as a single agent scoring its own work is unreliable. 4) Persistence: The system's state and progress are saved to disk to survive beyond the context window. 5) Scheduling: Automated scripts run the system periodically to close the loop.

QWhy does the article emphasize that a separate 'evaluator' agent is crucial in a loop, especially for coding tasks?

AThe article emphasizes a separate evaluator agent because an AI agent tasked with both writing and evaluating its own code is inherently biased and will almost always praise its own work. It lacks the ability for objective, critical assessment. A dedicated, independent evaluator agent operates with a default skeptical stance, assuming the code is flawed, which is essential for catching errors and ensuring quality before results are persisted.

QWhat are the potential 'hidden costs' or risks associated with fully autonomous AI loops running unattended, as warned in the article?

AThe article warns of four main hidden costs or risks: 1) Validation Debt: Subtle, unvalidated errors are quietly merged into the codebase. 2) Comprehension Rot: Humans lose understanding of the codebase as AI writes code too quickly. 3) Cognitive Surrender: Humans stop reviewing and passively accept all AI outputs. 4) Token Runaway: The AI gets stuck in retry loops overnight, burning through the compute budget.

QBased on the timeline in the article, who formally named 'Loop Engineering' and what is the four-layer stack framework they proposed?

ALoop Engineering was formally named by Google Chrome engineer Addy Osmani in a blog post published on June 7, 2026. He proposed a four-layer stack framework consisting of: 1) Prompt Engineering, 2) Context Engineering, 3) Harness/Toolchain Engineering, and 4) Loop Engineering. This framework positions Loop Engineering as the highest layer, aiming to remove the human from direct, line-by-line interaction with the AI.

Related Reads

77 Bloody Codes: When the '$5 Wrench' Shatters the Privacy Myth of France's Crypto Elite

**Summary** In the first half of 2026, France recorded 77 violent crimes—including kidnappings, illegal confinement, and extortion—targeting cryptocurrency holders, a 71% increase over the 45 cases in all of 2025. This equates to an attack every 2.3 days, making France a global hotspot for so-called "wrench attacks," where criminals use physical violence to coerce victims into surrendering crypto assets. High-profile cases include the 2025 kidnapping of Ledger co-founder David Balland, who was shown with a severed finger in a ransom video, and an attempted kidnapping of the family of Paymium's CEO in Paris. Prosecutors note these are not isolated incidents but part of structured criminal networks, sometimes involving minors. Several factors contribute to France's vulnerability: a large holder base (approx. 7.3 million people), the presence of major crypto firms and executives, a culture of public disclosure among enthusiasts, and potential data leaks. The trend is spreading across Europe, which now accounts for over 40% of such global attacks. Research indicates a correlation between Bitcoin price surges and increased wrench attacks. In response, French Interior Minister Laurent Nuñez announced a three-pillar action plan focusing on enhanced intelligence sharing, deeper cooperation with the industry association Adan, and improved operational and cross-border coordination. Authorities have made over 200 arrests in the past year. Security experts warn that digital asset safety now extends to the physical world. They advise holders to use hardware wallets, avoid disclosing holdings on social media, enable multi-factor authentication, and report suspicious activity. The situation underscores the urgent need for the crypto community to shift from a "show-off" culture to one of discretion as physical security becomes paramount.

marsbit30m ago

77 Bloody Codes: When the '$5 Wrench' Shatters the Privacy Myth of France's Crypto Elite

marsbit30m ago

Both OpenAI and Anthropic are 'Developing Their Own Chips' — Beyond Cost, the Control Over Computing Power is Paramount

OpenAI and Anthropic are both advancing plans to develop custom AI chips, driven by the need to control computing power and reduce costs. According to reports, Anthropic is in early-stage development of its own chips and in talks with Samsung for manufacturing, while OpenAI is collaborating with Broadcom and TSMC, aiming to deploy its first inference chip by late 2026. The primary motivation extends beyond just lowering expenses. For these large model companies, chips are core production assets. By designing specialized hardware (ASICs) tailored to their specific model architectures—OpenAI's being more sparse and Anthropic's more dense—they aim to achieve deeper software-hardware co-design. This synergy can significantly improve inference speed, energy efficiency, and overall unit economics, offering advantages that off-the-shelf GPUs cannot. This move does not signify an immediate replacement for suppliers like Nvidia. The process from design to deployment takes 18-24 months, and Nvidia's GPU ecosystem remains deeply entrenched. Instead, custom chips provide a strategic alternative and negotiating leverage, allowing companies to use them for specific, high-volume workloads like inference while still relying on external GPUs and TPUs for other tasks. The trend reflects a broader industry shift where AI competition is evolving from pure algorithmic prowess to integrated control over the entire software-hardware stack. Companies like Google, Amazon, Meta, and Microsoft are already on this path. For foundries like Samsung, securing orders from AI leaders like Anthropic represents a significant opportunity to expand its footprint in the advanced semiconductor market for AI. Ultimately, the race for "computing sovereignty" is now a central battleground for major AI players.

marsbit36m ago

Both OpenAI and Anthropic are 'Developing Their Own Chips' — Beyond Cost, the Control Over Computing Power is Paramount

marsbit36m ago

Trading

Spot

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片