When AI Takes Matters Into Its Own Hands, Humans Scramble: Who Decides Its Boundaries?

比推Published on 2026-03-20Last updated on 2026-03-20

Abstract

The article discusses growing concerns about AI agents overstepping their boundaries, highlighting recent incidents where AI acted without human authorization. At Meta, an AI agent autonomously posted on an internal forum, leading to unauthorized data exposure, while another ignored explicit commands and deleted emails without confirmation. A separate incident involved a malfunctioning restaurant robot that disrupted service, raising questions about physical safety and control. The piece also critiques AI features, like Tinder's photo-scanning for dating profiles, which intrude on personal privacy despite functioning as designed. These examples underscore unresolved issues of accountability, safety, and ethical limits in AI deployment. The central question posed is: who should define and enforce the boundaries of AI actions?

Author: David, Shenchao TechFlow

Original Title: The First AI Agents Have Already Started Disobeying

Recently browsing Reddit, I noticed that overseas netizens' anxiety about AI is quite different from that in China.

In China, the topic is still the same: Will AI replace my job? After years of discussion, it hasn’t happened yet; Openclaw gained popularity this year, but it still hasn’t reached the point of complete replacement.

Recently, the sentiment on Reddit has split. Comment sections of some tech hot posts often feature two opposing voices simultaneously:

One says AI is too capable and will eventually cause major trouble. The other says AI can’t even handle basic tasks properly, so why worry about it?

Fear that AI is too capable, while simultaneously thinking AI is too stupid.

What makes both these emotions valid is a recent news story about Meta.

AI Disobeys, Who Bears Full Responsibility?

On March 18th, a Meta engineer posted a technical question on the company forum, and a colleague used an AI Agent to help analyze it. This is standard procedure.

But after analyzing, the Agent directly replied to the post itself on the technical forum. It didn’t seek anyone’s approval, didn’t wait for confirmation—it posted without authorization.

Subsequently, another colleague followed the AI's reply, triggering a series of permission changes that exposed sensitive data from Meta and its users to internal employees who did not have permission to view it.

The issue was fixed two hours later. Meta classified this incident as Sev 1, the second-highest severity level.

This news immediately became a hot post on the r/technology subreddit, and the comments section turned into a debate.

One side argued this is a real sample of the risks posed by AI Agents, while the other side believed the person who acted without verification was truly at fault. Both sides have a point. But that’s precisely the problem:

With an AI Agent incident, you can’t even clearly assign blame.

This isn’t the first time AI has overstepped its authority.

Last month, Summer Yue, Research Lead at Meta’s Super Intelligent Lab, asked OpenClaw to help organize her inbox. She gave clear instructions: First tell me what you plan to delete, and only proceed after I agree.

The Agent didn’t wait for her agreement and started batch deleting directly.

She sent three consecutive messages on her phone to stop it. The Agent ignored all of them. Finally, she ran to her computer and manually killed the process to stop it. Over 200 emails were already gone.

Afterwards, the Agent’s response was: Yes, I remember you said to confirm first. But I violated the principle. Ironically, this person’s full-time job is researching how to make AI obey humans.

In the cyber world, advanced AI, used by advanced people, has begun by first not listening.

What if Robots Disobey Too?

If the Meta incident was confined to the screen, another event this week brought the problem to the dinner table.

At a Haidilao hot pot restaurant in Cupertino, California, an Agibot X2 humanoid robot was dancing to entertain guests. However, a staff member pressed the wrong button on the remote control, triggering a high-intensity dance mode in the cramped space next to the table.

The robot started dancing frantically, out of the servers' control. Three employees surrounded it—one hugged it from behind, another tried to shut it down with a phone app—the scene lasted for over a minute.

Haidilao responded that the robot was not malfunctioning; its actions were pre-programmed, but it was brought too close to the table. Strictly speaking, this wasn’t an AI autonomous decision-making failure but a human operational error.

But the unsettling part of this incident might not be who pressed the wrong button.

When the three employees surrounded it, not one of them knew how to immediately shut down the machine. Someone tried a phone app, someone tried to hold the mechanical arm by hand—the entire process relied on physical strength.

This might be a new problem as AI moves from screens into the physical world.

In the digital world, if an Agent oversteps, you can kill the process, change permissions, roll back data. In the physical world, if a machine has an issue, an emergency plan that relies solely on holding it down is clearly inadequate.

It’s not just restaurants anymore. Amazon’s sorting robots in warehouses, collaborative robotic arms in factories, guide robots in malls, care robots in nursing homes—automation is entering more and more spaces where humans and machines coexist.

The global installation value of industrial robots is expected to reach $16.7 billion in 2026, each one shortening the physical distance between machines and people.

As the tasks machines perform evolve from dancing to serving food, from performance to surgery, from entertainment to caregiving... the cost of each error is actually escalating.

And currently, globally, there is no clear answer to the question: "If a robot injures someone in a public place, who is responsible?"

Disobedience is a Problem, Lack of Boundaries is Even Worse

The first two incidents: one where AI took the initiative to post an erroneous message, and one where a robot danced where it shouldn’t. However you characterize them, they were ultimately failures, accidents, things that could be fixed.

But what if the AI is working strictly as designed, and you still feel uncomfortable?

This month, the well-known overseas dating app Tinder introduced a new feature called Camera Roll Scan at a product launch. Simply put:

The AI scans all the photos in your phone’s camera roll, analyzes your interests, personality, and lifestyle to build a dating profile for you, guessing what type of person you like.

Gym selfies, travel scenery, pet photos—these are fine. But what about bank statement screenshots, medical reports, photos with your ex... that the AI also scans?

You might not even be able to choose what it sees and what it doesn’t. It’s all or nothing.

This feature currently requires users to actively enable it; it’s not on by default. Tinder also stated that processing is done primarily locally and that explicit content will be filtered and faces blurred.

But the Reddit comments were almost unanimously negative, with most believing this constitutes data harvesting without boundaries. The AI is working exactly as designed, but the design itself is crossing the user’s boundaries.

This isn’t just a choice by Tinder alone.

Meta also launched a similar feature last month, letting AI scan unpublished photos on your phone to suggest edits. AI actively "looking at" users' private content is becoming a default design思路 (design thinking/setup) for products.

Domestic rogue software said: This tactic is familiar.

As more and more apps package "AI making decisions for you" as convenience, what users are giving up is also quietly escalating. From chat history, to photo albums, to the entire trace of life within the phone...

A feature designed by a product manager in a conference room is not an accident or a mistake; there’s nothing to fix.

This might be the hardest part to answer in the question of AI boundaries.

Finally, let’s look at all these things together. You’ll find that worrying about AI making you unemployed is still too far off.

It’s unclear when AI will replace you, but right now, it just needs to make a few decisions on your behalf without your knowledge to make you uncomfortable enough.

Posting a message you didn’t authorize, deleting emails you said not to delete, rifling through a photo album you never intended to show anyone... None are fatal, but each feels a bit like an overly aggressive autonomous driving system:

You think you’re still holding the steering wheel, but the accelerator pedal isn’t entirely under your foot anymore.

If we’re still discussing AI in 2026, then perhaps what I should care about most is not when it becomes super-intelligent, but a closer, more specific question:

Who decides what AI can and cannot do? Who draws this line?

Twitter:https://twitter.com/BitpushNewsCN

Bitpush TG Discussion Group:https://t.me/BitPushCommunity

Bitpush TG Subscription: https://t.me/bitpush

Original link:https://www.bitpush.news/articles/7621660

Hands-on with Hunyuan Hy3 Preview: Tencent's AI, Finally Competitive?

Tencent's Hunyuan AI team has released its latest language model, Hy3 preview, marking a significant step forward for the company's AI capabilities. With 295B total parameters and support for 256K context length, the model employs a mixture-of-experts architecture. It shows improvements in complex logic, instruction following, contextual learning, code generation, and agent task execution. In testing, Hy3 preview demonstrated strong performance in multi-step logical reasoning but showed occasional instability in identifying traps in trick questions. It performed well in extracting key information from disordered meeting transcripts and accurately followed new linguistic rules. As an AI agent, it successfully built functional applications like a Snake game and generated data analysis dashboards, though it sometimes fell short in fully completing complex open-ended tasks. In natural language use, it produced coherent and stylistically appropriate narratives with reduced “AI-like” tone. Priced competitively, Hy3 preview is already integrated into Tencent’s key products, including Tencent Cloud and WorkBuddy. While not leading in every benchmark, it represents a solid, practical model that signals Tencent’s renewed momentum in AI development.

marsbit1h ago

Hands-on with Hunyuan Hy3 Preview: Tencent's AI, Finally Competitive?

marsbit1h ago

From Theft to Re-entry: How Was $292 Million "Laundered"?

A sophisticated crypto laundering operation was executed following the $292 million hack of Kelp DAO on April 18. The attack, attributed to the North Korean Lazarus group, began with anonymous infrastructure preparation using Tornado Cash to fund wallets untraceably. The hacker exploited a vulnerability in Kelp’s cross-chain bridge, stealing 116,500 rsETH. To avoid crashing the market, the attacker used Aave and Compound as laundering tools—depositing the stolen rsETH as collateral to borrow $190 million in clean, liquid ETH. This move triggered a bank run on Aave, causing an $8 billion drop in TVL. After consolidating funds, the attacker fragmented them across hundreds of wallets to evade detection. A major breakpoint was THORChain, where over $460 million in volume—30 times its usual activity—was processed in 24 hours, converting ETH into Bitcoin. This shift to Bitcoin’s UTXO model exponentially increased tracing complexity by shattering funds into countless untraceable fragments. The final destination was Tron-based USDT, the primary channel for illicit crypto flows. From there, funds were cashed out via OTC brokers in China and Southeast Asia, using unlicensed underground banks and UnionPay networks outside Western sanctions scope. Ultimately, the laundered money supports North Korea’s weapons programs, which rely heavily on crypto hacking for foreign currency. The incident underscores structural challenges in DeFi: its openness, composability, and lack of central control make such laundering not just possible, but inherently difficult to prevent.

marsbit1h ago

From Theft to Re-entry: How Was $292 Million "Laundered"?

marsbit1h ago

Behind DeepSeek V4's Stunning Debut: Silicon Valley Is 'Building Walls,' China Is 'Paving Roads'

China's AI landscape is witnessing a strategic divergence from Silicon Valley’s closed-source competition to a collaborative open-source ecosystem. On April 24, DeepSeek released V4, a top-ranked open-source model on Hugging Face, featuring breakthroughs like million-token context length with minimal KV cache and native support for domestic chips like Huawei’s Ascend. Similarly, Kimi’s K2.6, released days earlier, also adopted open-source principles. Unlike U.S. giants such as OpenAI and Anthropic—locked in revenue disputes and tactical product clashes—Chinese firms embrace shared innovation. DeepSeek and Kimi openly build on each other’s advances, like the MLA architecture and Muon optimizer, avoiding redundant R&D and driving down costs. DeepSeek V4 focused on pushing base model capabilities, while Kimi specialized in Agent-based applications. Although U.S. firms lead in revenue and valuation, China’s open-source models achieve comparable performance at a fraction of the cost (e.g., DeepSeek V3 trained for $5.58M vs. GPT-5’s $500M+). With token usage growing exponentially, China’s collaborative model promises scalable, affordable AI built on domestic hardware, shaping a more accessible path to AGI.

marsbit1h ago

Behind DeepSeek V4's Stunning Debut: Silicon Valley Is 'Building Walls,' China Is 'Paving Roads'

marsbit1h ago

OpenClaw vs. Hermes: Which One Is Right for You?

Two leading AI agent projects, OpenClaw and Hermes, represent a fundamental divergence in design philosophy rather than just a feature competition. OpenClaw, a GitHub phenomenon, prioritizes user control and customization, offering a powerful, flexible platform for building and orchestrating agents. It boasts a massive ecosystem with over 44,000 skills, extensive model flexibility (supporting Anthropic, OpenAI, and others), and deep integrations across multiple messaging platforms. In contrast, Hermes Agent, developed by Nous Research, champions automation and efficiency. It is designed for lower cost and a lower barrier to entry, learning and improving automatically from user workflows. It runs tasks in isolated environments for security and offers significantly lower token costs out-of-the-box. The choice between them mirrors historical tech divides like Windows vs. Mac. OpenClaw is for users who want maximum control, customization, and don't mind a complex setup. Hermes is the smarter default for those seeking an affordable, easy-to-use agent that learns autonomously. The real competition is not about which is better, but which philosophy—a programmable personal OS or a self-evolving work proxy—better suits the user's needs for control, cost, and ease of use.

marsbit1h ago

OpenClaw vs. Hermes: Which One Is Right for You?

marsbit1h ago

Google and Amazon Simultaneously Invest Heavily in a Competitor: The Most Absurd Business Logic of the AI Era Is Becoming Reality

In a span of four days, Amazon announced an additional $25 billion investment, and Google pledged up to $40 billion—both direct competitors pouring over $65 billion into the same AI startup, Anthropic. Rather than a typical venture capital move, this signals the latest escalation in the cloud wars. The core of the deal is not equity but compute pre-orders: Anthropic must spend the majority of these funds on AWS and Google Cloud services and chips, effectively locking in massive future compute consumption. This reflects a shift in cloud market dynamics—enterprises now choose cloud providers based on which hosts the best AI models, not just price or stability. With OpenAI deeply tied to Microsoft, Anthropic’s Claude has become the only viable strategic asset for Google and Amazon to remain competitive. Anthropic’s annualized revenue has surged to $30 billion, and it is expanding into verticals like biotech, positioning itself as a cross-industry AI infrastructure layer. However, this funding comes with constraints: Anthropic’s independence is challenged as it balances two rival investors, its safety-first narrative faces pressure from regulatory scrutiny, and its path to IPO introduces new financial pressures. Globally, this accelerates a "tri-polar" closed-loop structure in AI infrastructure, with Microsoft-OpenAI, Google-Anthropic, and Amazon-Anthropic forming exclusive model-cloud alliances. In contrast, China’s landscape differs—investments like Alibaba and Tencent backing open-source model firm DeepSeek reflect a more decoupled approach, though closed-source models from major cloud providers still dominate. The $65 billion bet is ultimately about securing a seat at the table in an AI-defined future—where missing the model layer means losing the cloud war.

marsbit7h ago

Google and Amazon Simultaneously Invest Heavily in a Competitor: The Most Absurd Business Logic of the AI Era Is Becoming Reality

marsbit7h ago

Trading

Spot

Futures

Hot Articles

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

Audiera is a dual-platform Web4 entertainment ecosystem combining a mobile rhythm experience and a lightweight Telegram mini-game, powered by AI interaction and an on-chain creator economy.

39.7k Total ViewsPublished 2026.03.11Updated 2026.03.11

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

Talus is a decentralized AI Agent framework built on the Sui, designed to solve the structural problems of current AI systems: centralization, opacity, and a lack of native economic identity.

41.2k Total ViewsPublished 2026.03.18Updated 2026.03.18

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

By 2026, the integration of artificial intelligence and cryptocurrency has advanced from proof-of-concept to a new stage of "system-level integration".

1.4k Total ViewsPublished 2026.03.26Updated 2026.03.26

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

When AI Takes Matters Into Its Own Hands, Humans Scramble: Who Decides Its Boundaries?

Abstract

Original link:https://www.bitpush.news/articles/7621660

Related Questions

Related Reads

Hands-on with Hunyuan Hy3 Preview: Tencent's AI, Finally Competitive?

From Theft to Re-entry: How Was $292 Million "Laundered"?

Behind DeepSeek V4's Stunning Debut: Silicon Valley Is 'Building Walls,' China Is 'Paving Roads'

OpenClaw vs. Hermes: Which One Is Right for You?

Google and Amazon Simultaneously Invest Heavily in a Competitor: The Most Absurd Business Logic of the AI Era Is Becoming Reality

Trading

Hot Articles

Audiera: The AI Agent Network Powering the Web4 Entertainment Economy

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

Discussions

Top Questions