Conversation with ClawdBot Founder: AI is a Lever, Not a Replacement

marsbitPublished on 2026-02-02Last updated on 2026-02-02

Abstract

In this interview, ClawdBot (now OpenClaw) founder Peter Steinberger discusses his AI assistant that operates via messaging apps like WhatsApp and Telegram, connecting to the user's computer applications. He describes it as a "weird but incredibly smart friend living in your computer." Key points: - Built an initial prototype in an hour; it now has ~300k lines of code. - AI can perform tasks like fixing code bugs, transcribing audio without prior setup, and controlling smart home devices. - Believes AI can replace 80% of mobile apps by accessing APIs and services directly. - Criticizes complex agent orchestration systems as "slop generators" and emphasizes that human taste and judgment are irreplaceable. - Argues programming languages matter less now; what matters is engineering intuition and system design. - AI acts as a lever to amplify human capability, not a replacement for human oversight.

Organized by: Baoyu

This is another 40-minute interview with Peter Steinberger, author of ClawdBot/OpenClaw, hosted by Peter Yang.

Peter is the founder of PSPDFKit, with nearly 20 years of iOS development experience. After the company received a strategic investment of 100 million euros from Insight Partners in 2021, he chose to "retire." Now, the Clawdbot he developed (now renamed OpenClaw) is a huge hit. Clawbot is an AI assistant that can chat with you through WhatsApp, Telegram, and iMessage, connected to various applications on your computer.

Peter describes Clawbot like this:

"It's like a friend living in your computer, a bit weird, but frighteningly smart.

In this interview, he shares many interesting views: why complex agent orchestration systems are "slop generators," why "running AI for 24 hours" is a vanity metric, and why programming languages no longer matter.

One-Hour Prototype, 300,000 Lines of Code

Peter Yang asked him what Clawbot actually is and why the logo is a lobster.

Peter Steinberger didn't directly answer the lobster question but told a story. After "retiring," he returned and fully immersed himself in vibe coding—letting AI agents write code for you. The problem was, an agent might run for half an hour or stop after two minutes to ask a question. You come back from a meal to find it stuck long ago, which is annoying.

He wanted something to check his computer's status on his phone anytime. But he didn't build it because he thought it was too obvious, and big companies would surely do it.

"By November last year, no one had done it, so I thought, forget it, I'll do it myself.

The initial version was extremely simple: connect WhatsApp to Claude Code. Send a message, it calls the AI, and sends the result back. It took an hour to set up.

Then it "came to life." Now Clawbot has about 300,000 lines of code and supports almost all major messaging platforms.

"I think this is the future direction. Everyone will have a super-powerful AI that follows them throughout their life.

He said, "Once you give AI access to your computer, it can basically do anything you can do."

That Morning in Morocco

Peter Yang said that now you don't need to sit in front of the computer staring at it; just give it instructions.

Peter Steinberger nodded, but he wanted to talk about something else.

Once, while celebrating a friend's birthday in Morocco, he found himself using Clawbot constantly. Asking for directions, restaurant recommendations—these were small things. What really surprised him was that morning: someone posted a tweet on Twitter saying there was a bug in one of his open-source libraries.

"I just took a photo of the tweet and sent it to WhatsApp.

The AI read the tweet, understood it was a bug report. It checked out the corresponding Git repository, fixed the issue, committed the code, and then replied to that person on Twitter saying it was fixed.

"I thought, is this even possible?

Another time was even more incredible. He was walking on the street, too lazy to type, so he sent a voice message. The problem was, he never built voice message support for Clawbot.

"I saw it display 'typing,' and thought, oh no. But then it replied normally.

He later asked the AI how it did it. The AI said: I received a file without an extension, so I looked at the file header and found it was Ogg Opus format. You have ffmpeg on your computer, so I used it to convert to WAV. Then I looked for whisper.cpp, but you didn't have it installed, but I found your OpenAI API key, so I used curl to send the audio for transcription.

Peter Yang听完说: These things are really resourceful, though a bit scary.

"Much stronger than the web version of ChatGPT. This is like ChatGPT unleashed. Many people don't realize that tools like Claude Code aren't just good at programming; they are resourceful with any problem.

Command-Line Tool (CLI) Army

Peter Yang asked him how those automation tools were built, whether he wrote them himself or had the AI write them.

Peter Steinberger laughed.

He had been expanding his "CLI army" for months. What are agents best at? Calling command-line tools, because the training data is full of that.

He built a CLI to access the entire Google services, including the Places API. Built one specifically for finding memes and GIFs, so the AI could send memes when replying to messages. He even made a tool to visualize sound, wanting the AI to "experience" music.

"I also hacked into the API of a local food delivery platform; now the AI can tell me how long until the food arrives. Another one reverse-engineered the Eight Sleep API to control the temperature of my bed.

[Note: Eight Sleep is a smart mattress that can adjust bed surface temperature; the official API is not open.]

Peter Yang pressed: Did you have the AI help you build all these?

"The most interesting thing is, I did Apple ecosystem development at PSPDFKit for 20 years, Swift, Objective-C, very specialized. But after coming back, I decided to switch tracks because I was tired of Apple controlling everything, and making Mac apps has too narrow an audience.

The problem was, switching from one精通的技术栈 to another is painful. You understand all the concepts but don't know the syntax. What's a prop? How do you split an array? You have to look up every little problem; you feel like an idiot.

"Then with AI, all that disappeared. Your systems thinking, architectural skills, taste, judgment about dependencies—these are the truly valuable things, and now they can easily migrate to any field.

He paused:

"Suddenly I felt like I could build anything. The language doesn't matter anymore; what matters is my engineering thinking.

Controlling the Real World

Peter Steinberger began demonstrating his setup. The list of permissions he gives the AI is staggering:

Email, calendar, all files, Philips Hue lights, Sonos speakers. He can have the AI wake him up in the morning, slowly increasing the volume. The AI can also access his security cameras.

"Once I told it to watch for strangers. The next morning it told me: 'Peter, someone is here.' I looked at the录像, it had been screenshotting my sofa all night because the camera quality was poor, and the sofa looked like a person sitting there.

In his Vienna apartment, the AI can also control the KNX smart home system.

"It could really lock me out.

Peter Yang asked: How are these connected?

"Just by talking to it. These things are resourceful; it will find the API itself, Google things, look for keys in your system.

Users'玩法 are even crazier:

  • Someone had it shop online at Tesco
  • Someone had it place orders on Amazon
  • Someone had it automatically reply to all messages
  • Someone added it to a family group chat as a "family member"
"I had it check me in on the British Airways website. This is like a Turing test; operating a browser on an airline website, you know how anti-human that interface is.

The first time it took almost 20 minutes because the整套系统 was still rough. The AI needed to find his passport in his Dropbox, extract information, fill out the form, pass the CAPTCHA.

"Now it only takes a few minutes. It can click the 'I'm human' verification button because it's controlling a real browser, its behavior pattern is no different from a human's.

80% of Apps Will Disappear

Peter Yang asked: For a regular user who just downloaded it, what are some safe ways to get started?

Peter Steinberger said everyone's path is different. Some people install it and start using it to write iOS apps immediately; others go straight to managing Cloudflare. One user installed it for themselves the first week, for their family the second week, and started working on an enterprise version for their company the third week.

"After I installed it for a non-technical friend, he started sending me pull requests. He had never sent a pull request in his life.

But what he really wanted to talk about was the bigger picture:

"If you think about it, this thing might replace 80% of the apps on your phone.

Why still use MyFitnessPal to record diet?

"I have an infinitely resourceful assistant; it already knows I made a wrong decision at KFC. I send a photo, it saves it to the database, calculates calories, reminds me I should go to the gym.

Why still use an app to set the Eight Sleep temperature? The AI has API access, it can adjust it for you directly. Why still use a to-do app? The AI remembers for you. Why still use an app to check in for a flight? The AI does it for you. Why still use shopping apps? The AI can recommend, order, track.

"A whole layer of apps will slowly disappear because if they have an API, they are just services your AI will call.

He predicted that 2026 will be the year many people start exploring personal AI assistants, and big companies will also enter the field.

"Clawbot might not be the final winner, but this direction is correct.

Just Talk to It

The topic turned to AI programming methodology. Peter Yang said he wrote a very popular article called "Just Talk to It" and wanted to hear him elaborate.

Peter Steinberger's core point is: Don't fall into the "agentic trap."

"I see too many people on Twitter discovering agents are powerful, then wanting to make them more powerful, and falling down the rabbit hole. They build各种复杂的工具 to accelerate workflows, but end up just building tools, not building truly valuable things.

He fell into it himself. Early on, he spent two months building a VPN tunnel just to access the terminal on his phone. He did it too well; once he was having dinner with a friend at a restaurant, he spent the whole time vibe coding on his phone instead of participating in the conversation.

"I had to stop, mainly for mental health.

Slop Town

What recently drove him crazy was an orchestration system called Gastown.

"A super complex orchestrator, running a dozen or twenty agents simultaneously, they talk to each other, divide labor. There are watchers, overseers, mayors, pcats (probably meaning 'civilians' or 'pet cats' or other filler roles), I don't even know what else.

Peter Yang: Wait, there's a mayor?

"Yes, the Gastown project has a mayor. I call this project 'Slop Town.'

And the RALPH pattern (a "use and discard" single-task loop pattern, meaning giving the AI a small task, discarding all context memory after completion, resetting everything to zero, and then an infinite loop)...

"This is the ultimate token burner. You let it run all night, and the next morning you get the ultimate slop.

The core of the problem is: These agents don't have taste yet. They are frighteningly smart in some aspects, but if you don't guide them, don't tell them what you want, what comes out is garbage.

"I don't know how others work, but when I start a project, I only have a vague idea. During the process of building, playing, feeling, my vision gradually becomes clear. I try some things, some don't work, and then my idea evolves into its final form. My next prompt depends on the current state I see, feel, and think about.

If you try to write everything into前期规格说明, you miss out on this human-in-the-loop cycle.

"I don't know how you can make good things without feeling, without taste being involved.

Someone on Twitter was炫耀 a "fully RALPH-generated" notes app. Peter replied: Yes, it looks like it was generated by RALPH; no normal person would design it like this.

Peter Yang summarized: Many people run AI for 24 hours not to make an app, but to prove they can make AI run for 24 hours.

"It's like a measuring contest without a reference point. I've also let loops run for 26 hours and felt proud at the time. But this is a vanity metric, meaningless. Being able to build everything doesn't mean you should build everything, nor does it mean it will be good.

Plan Mode is a Hack

Peter Yang asked him how he manages context. When the conversation gets long, the AI gets confused; does it need manual compression or summarization?

Peter Steinberger said this is a problem of the "old mode."

"Claude Code still has this problem, but Codex is much better. On paper, it might only be 30% more context, but it feels like 2-3 times more. I think it's related to the internal thinking mechanism. Now most of my feature development can be completed within one context window; discussion and building happen simultaneously.

He doesn't use worktrees because that's "unnecessary complexity." He simply checks out several copies of the repository: clawbot-1, clawbot-2, clawbot-3, clawbot-4, clawbot-5. Use whichever is idle, test when done, push to main, sync.

"It's a bit like a factory, if they are all busy. But if you only have one, the waiting time is too long, you can't get into a flow state.

Peter Yang said this is like a real-time strategy game; you have a team attacking, you have to manage and monitor them.

About plan mode, Peter Steinberger has a controversial view:

"Plan mode is a hack Anthropic had to add because the model is too impulsive and rushes to write code immediately. If you use the latest models, like GPT 5.2, you just talk to it. 'I want to build this feature, it should be like this and that, I like this design style, give me a few options, let's chat first.' Then it will propose, you discuss, reach a consensus, and then start.

He doesn't type; he talks.

"I talk to it most of the time.

<极速赛车开奖网 src="https://d1x7dwosqaosdj.cloudfront.net/images/2026-02/c8cf479862ea4d3788da7d8f7b61f7a0.jpeg" alt="">

Discord-Driven Development

Peter Yang asked him what his process is for developing new features. Explore the problem first? Make a plan first?

Peter Steinberger said he did something "probably the craziest thing I've ever done": he connected his own Clawbot to a public Discord server, letting everyone talk to his private AI, with his private memories, in public.

"This project is hard to describe in words. It's like a mix of Jarvis and the movie 'Her.' Everyone I演示 in person gets super excited, but posting pictures with text on Twitter just doesn't take off. So I thought, just let people experience it themselves.

Users ask questions, report bugs, and request features in Discord. His current development process is: take a screenshot of a Discord conversation, drag it into the terminal, and tell the AI "let's talk about this."

"I'm too lazy to type. Someone asks 'do you support this or that,' I have the AI read the code and then write an FAQ.

He also wrote a crawler that scans the Discord help channel at least once a day, has the AI summarize the biggest pain points, and then they fix them.

No MCP, No Complex Orchestration

Peter Yang asked: Do you use those fancy things? Multi-agent, complex skills, MCP (Model Context Protocol), etc.?

"Most of my skills are life skills: recording diet, grocery shopping, that kind of stuff. Very few programming ones, not needed. I don't use MCP, don't use any of those things.

He doesn't believe in complex orchestration systems.

"I'm in the loop; I can make products that feel better. Maybe there are faster methods, but I'm already fast to the point where the bottleneck isn't the AI; I'm mainly limited by my own thinking speed, occasionally by the time waiting for Codex.

His former PSPDFKit co-founder, a former lawyer, is now also sending him PRs (pull requests).

"AI allows people without technical backgrounds to build things; it's magical. I know some people oppose it, saying this code isn't perfect. But I treat pull requests as prompt requests; they convey intent. Most people don't have the same system understanding to guide the model to the optimal result. So I prefer to get the intent and do it myself, or rewrite based on their PR.

He marks them as co-authors but rarely directly merges others' code.

Find Your Own Way

Peter Yang summarized: So the core takeaway is, don't use slop generators, keep the human in the loop, because the human brain and taste are irreplaceable.

Peter Steinberger added:

"Or rather, find your own way. Many people ask me 'how do you do it,' the answer is: you have to explore it yourself. Learning these things takes time, requires making your own mistakes. It's like learning anything, except this field changes特别快.

Clawdbot can be found at clawd.bot and on GitHub. Clad with a W, C-L-A-W-D-B-O-T, like lobster claws.

(Note: ClawdBot has been renamed OpenClaw)

Peter Yang said he also has to try it. Doesn't want to sit in front of the computer chatting with AI; wants to give it instructions anytime while outside with the kids.

"I think you'll like it." Peter Steinberger said.

Peter Steinberger's core views can be summarized in two sentences:

  1. AI is already powerful enough to replace 80% of the apps on your phone
  2. But without human taste and judgment in the loop, the output is garbage

These two sentences seem contradictory but actually point to the same conclusion: AI is a lever, not a replacement. It amplifies what you already have: systems thinking, architectural skills, intuition for good products. If you don't have these, running再多智能体并行 for 24 hours is just mass-producing slop.

His practice itself is the best proof: a 20-year veteran iOS programmer built a 300,000-line code project in TypeScript within a few months, relying not on learning the syntax of a new language, but on those language-agnostic things.

"Programming languages don't matter anymore; what matters is my engineering thinking."

Related Questions

QWhat is the core philosophy behind Peter Steinberger's approach to AI development with ClawdBot/OpenClaw?

APeter Steinberger views AI as a powerful lever that amplifies human capabilities, not as a replacement. He emphasizes that while AI can automate many tasks and potentially replace 80% of mobile apps, human judgment, taste, and engineering intuition are irreplaceable. Without these human elements in the loop, AI systems risk producing low-quality output ('slop').

QHow did Peter Steinberger demonstrate the practical capabilities of ClawdBot in a real-world scenario?

AWhile in Morocco, Peter used ClawdBot to handle a bug report from Twitter. He simply sent a screenshot of the tweet to the AI via WhatsApp. The AI read the tweet, understood it was a bug report, checked out the relevant Git repository, fixed the issue, committed the code, and replied to the user on Twitter—all automatically.

QAccording to the interview, why does Peter Steinberger criticize complex 'agentic' orchestration systems like Gastown?

AHe criticizes them as 'slop generators' or 'slop towns.' He argues that these overly complex systems, which run multiple agents in parallel with roles like 'mayor' and 'watcher,' often burn through computational resources (tokens) all night only to produce low-quality, poorly designed output. They lack human taste and the iterative feedback loop essential for creating good products.

QWhat significant shift in software development does Peter predict due to AI assistants?

APeter predicts that AI assistants like ClawdBot will cause a whole layer of single-purpose mobile apps to disappear. He argues that if a service has an API, it simply becomes a function the AI can call. There will be no need for dedicated apps for tasks like fitness tracking, flight check-in, grocery shopping, or smart home control, as a general-purpose AI can handle them all through integration.

QHow did AI fundamentally change Peter Steinberger's ability to work across different programming languages and domains?

AAI eliminated the friction of switching tech stacks. As a longtime iOS specialist, he found moving to new languages frustrating due to unfamiliar syntax. With AI, he no longer needs to memorize syntax detail. His valuable skills—system-level thinking, architecture, taste, and dependency judgment—became portable, allowing him to build a 300,000-line TypeScript project quickly despite not being an expert in that language.

Related Reads

Farewell to Traditional Bulls and Bears: The Market Has Entered an Era of Rotating Bubbles

Farewell to traditional bull and bear markets; we have entered an era of rolling bubbles. This article uses a meteorological analogy to explain the modern market's shift from slow-moving, long-term trends to a chain of rapid, successive speculative frenzies. The old market resembled "stratiform" weather—slow, broad cycles lasting years. Today's market is like a "mesoscale convective system," where isolated storms (bubbles in sectors like AI, GLP-1 drugs, or crypto) form in sequence. Each is triggered by the outflow of capital and sentiment from the previous one, creating a self-perpetuating chain of booms and busts. This structural change is driven by eight permanent shifts: the democratization of speculation (zero-commission trading, retail options activity), perpetual buying from defined-contribution retirement plans, the dominance of passive investing (creating price-insensitive flows), the rise of multi-strategy funds and high-frequency trading (weakening price discovery), suppressed volatility that erupts violently, an index composition now dominated by long-duration, narrative-driven tech stocks, the elimination of information delays, and a permissive fiscal/monetary backdrop. These conditions ensure that rolling bubbles are the new normal. To navigate this environment, investors should either become deep-sector experts who understand the underlying technologies and business models or become adept observers of trends and capital flows. While chaotic from within each "storm," a higher-altitude view reveals a predictable pattern of serial booms. The key is to avoid being emotionally swept up in any single narrative and to recognize the market's new, permanent structure.

marsbit5m ago

Farewell to Traditional Bulls and Bears: The Market Has Entered an Era of Rotating Bubbles

marsbit5m ago

The Right Way to Use Skills: 5 Reflections After Anthropic Publicly Shared Its Internal Methodology

A deep dive into Anthropic's internal methodology for building effective AI "Skills" reveals five key insights for maximizing their value. First, Skills should focus on capturing "Gotchas" and tacit organizational knowledge—like common pitfalls and undocumented rules—rather than restating general information the AI already knows. Second, think of Skills as a form of "Context Engineering"; they are best structured as folders, not monolithic documents. A core `SKILL.md` file should act as a navigational index, progressively pulling in detailed references, examples, and assets only as needed to avoid overwhelming the model's context window. Third, whenever possible, automate repetitive tasks with scripts. This preserves the model's reasoning capacity for judgment and analysis, while scripts reliably handle the execution, saving tokens and improving accuracy. Instructions within a Skill provide the "why" and the expert judgment, while scripts provide the concrete "how." Fourth, a Skill's description is critical and often misunderstood. It should not be a list of features but a routing rule that clearly signals *when* the Skill should be triggered based on user intent and common phrasing. Finally, as Skills scale from personal tools to team-wide assets, management is crucial. Anthropic advocates for a lightweight, organic approach: let new Skills spread organically within small groups first. Those that prove genuinely useful through adoption naturally graduate to a formal marketplace, ensuring the curated library contains only high-value, battle-tested tools.

marsbit22m ago

The Right Way to Use Skills: 5 Reflections After Anthropic Publicly Shared Its Internal Methodology

marsbit22m ago

Vying for the AI Payment Track: Traditional Card Networks Face Off Against Coinbase

As AI agents increasingly conduct commercial transactions, a battle for control over the underlying payment infrastructure is unfolding. The competition centers on two divergent and incompatible technical approaches for autonomous AI payments. One camp, led by traditional card networks Visa and Mastercard, relies on tokenized card credentials within the established banking rails. Visa's "Intelligent Commerce" and Mastercard's "Agent Pay" services extend their existing tokenization technology to authorized AI agents for consumer retail transactions, leveraging decades of fraud protection and dispute resolution systems. Their partners include major AI firms like Anthropic, OpenAI, and Microsoft. The opposing camp, spearheaded by Coinbase, advocates for an open internet protocol using stablecoins. Coinbase's x402 protocol utilizes the HTTP 402 status code to enable direct, machine-to-machine micropayments with USDC on-chain. This model eliminates card fees and is designed for high-frequency, low-value transactions between AI agents, such as paying for API calls or data streams, where traditional card costs are prohibitive. Currently, application scenarios are clearly divided. Mainstream consumer-facing AI shopping services (e.g., ChatGPT's "one-click checkout," Amazon's AI-assisted shopping) predominantly use card channels due to their mature consumer protections and merchant networks. Conversely, the stablecoin channel dominates machine-to-machine payments, as seen in Amazon Bedrock's core payment service using Base blockchain. Significantly, traditional card networks are not solely defending their turf; they are also investing in the stablecoin arena. Visa has rapidly expanded its stablecoin settlement volume and partnered with Coinbase on interoperability, while Mastercard moved to acquire stablecoin platform BVNK. This dual-strategy indicates their intent to become the fee-collecting gateway for all payment flows, regardless of the underlying rail. The short-term outlook is for coexistence: cards for personal retail, stablecoins for machine transactions. The long-term outcome hinges on whether AI-driven commerce will resemble traditional retail or evolve into a vast network of machine micropayments. Visa and Mastercard's hedging strategy suggests they are prepared for either future, while companies betting on a single channel face greater risk.

Foresight News24m ago

Vying for the AI Payment Track: Traditional Card Networks Face Off Against Coinbase

Foresight News24m ago

For the First Time, Pure Human Video Pretrained VLA for Dexterous Manipulation: Deployable with Minimal Fine-Tuning Data

For the first time, a purely human-video-pretrained Vision-Language-Action (VLA) model for dexterous manipulation requires only a small amount of data for fine-tuning to achieve successful real-world deployment. Achieving human-level dexterous manipulation remains a core challenge in robotics. While multi-fingered hands offer hardware potential, Visual-Language-Action (VLA) models lag behind due to the high cost of collecting diverse, high-quality robot data. A novel framework, VITRA, developed by Microsoft Research Asia and Tsinghua University, addresses this by automatically transforming massive, unlabeled real-world human activity videos into a structured V-L-A training dataset. Key innovations include precise 3D hand motion annotation from monocular video, atomic action segmentation based on hand-speed minima, and automated instruction generation using VLMs combined with 3D trajectory visualization. This process created a massive dataset of 1 million clips. Pretrained exclusively on this human video data, the VLA model (combining a VLM backbone with a Diffusion Transformer action expert) demonstrates strong zero-shot hand motion prediction in unseen environments. Crucially, it requires minimal fine-tuning (~1.2k demonstrations) on real robot data to achieve high-success-rate dexterous manipulation tasks like grasping, placing, pouring, and sweeping on hardware like the Realman robot with the XHAND1 dexterous hand. The model shows exceptional generalization to novel objects and environments. The research also observes promising scaling behavior, where performance improves with more pretraining data, paving the way for more generalized embodied intelligence.

marsbit34m ago

For the First Time, Pure Human Video Pretrained VLA for Dexterous Manipulation: Deployable with Minimal Fine-Tuning Data

marsbit34m ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片