The First to Bring an AI OS to 1.4 Billion People Might Actually Be WeChat?

marsbit發佈於 2026-06-10更新於 2026-06-10

文章摘要

WeChat has introduced a significant AI update, allowing mini-program developers to integrate their services with WeChat AI. Developers can choose an "automatic mode," where WeChat AI autonomously analyzes and operates mini-programs without additional coding, or a "development mode" for creating customized skills. This move effectively transforms WeChat's vast ecosystem—including millions of mini-programs, WeChat Pay, and official accounts—into an execution layer for AI. The technical documentation reveals that WeChat's approach aligns with industry standards like MCP (Model Context Protocol) and incorporates practical lessons from AI-agent development. Key design principles include a clear "attention weight" system for API calls and a "fact + action" response structure to ensure reliable operations. Unlike Apple's Siri, which struggles with third-party app integration, WeChat's centralized control over mini-program code provides a "God's-eye view," enabling seamless AI orchestration across services. This development revives the concept of "WeChat OS," where the app could function as a natural-language-operated platform for daily tasks—from booking flights to ordering food—all within a chat interface. While challenges remain in areas like payment security and user trust, WeChat's existing service network and massive user base position it uniquely to advance AI agents from conversation to actionable assistance, potentially making complex tasks feel effortless for its 1.432 bi...

WeChat's AI has finally moved.

On the very same day as Apple's WWDC, WeChat did something potentially more significant than Apple, releasing an unassuming announcement: "Guidance for Developers to Access the WeChat AI Ecosystem."

Starting today, Mini Program developers can grant authorization, allowing WeChat AI to read, operate, and invoke the functionalities of Mini Programs.

WeChat offers two access modes. The first is "Automatic Mode," with a nearly zero barrier to entry. Developers simply toggle a switch, and the platform itself reads the source code, analyzes pages, figures out what the Mini Program can do, and then the AI can directly operate it, all without writing a single line of code.

The other mode is called "Developer Mode," where developers create customized Skills, which are called by the AI after passing review. Both modes can be enabled simultaneously. Meituan has already announced its integration.

This should not be understood merely as the launch of another new feature. Instead, it signals that WeChat is turning its entire ecosystem—millions of Mini Programs, WeChat Pay, service notifications, Official Accounts—into the execution layer for AI.

Examining the Skill Documentation: How WeChat AI Invokes Mini Programs

The WeChat open documentation publicly shares the technical specifications for Mini Programs to integrate AI Skills. A close look reveals many design details hidden within.

Official skill documentation guide 👇🏻:

https://developers.weixin.qq.com/miniprogram/dev/ai/best-practices.html

From an architectural perspective, those familiar with AI development will immediately recognize it: it's essentially MCP (Model Context Protocol). The `mcp.json` declares the function and parameters of each atomic interface, and `SKILL.md` describes how the entire business process runs. This is almost identical to the MCP+Skills architecture found in Claude, Cursor, or VS Code. WeChat didn't reinvent the wheel; it directly adopted the industry-standard that is currently converging.

In the guidance, WeChat provides a clear "attention weight" system. When the AI decides which interface to call and what parameters to generate, it gives the highest priority to the content returned by the interface (five stars), followed by the interface description (four stars) and parameter descriptions (four stars) in `mcp.json`. `SKILL.md` ranks last (three stars). This means where developers write something matters more than what they write—the weight the AI gives to the same rule is completely different if it's written in the interface return versus in `SKILL.md`.

At the interface return level, there is a core specification: a two-stage "Fact + Action" format. First, tell the AI "what happened," then tell it "what to do next." If only the action is written without the fact, the AI might interpret "display card" as "prepare to call the next interface" and skip user confirmation. This is a rule learned after stepping into many pitfalls.

Fourth, parameter passing prioritizes using IDs over natural language. Taking the "Coffee Ordering" scenario in the diagram as an example, after the user states a need, the AI understands the vague intent and handles selection, modification, specification changes, and payment processing, all without leaving the chat dialog.

This design reveals a signal: WeChat has already run enough practical cases, knows where the pitfalls lie in AI calling external services, and has solidified these experiences into developer norms.

In fact, comparing WeChat Mini Programs, which are also known for their "ecosystem," to Apple's apps, WeChat possesses a kind of "God's-eye view" over its own ecosystem. This is the prerequisite for all this implementation.

Why It Might Be More Important Than Apple's AI

This year at WWDC, Apple released the new Siri AI. Despite having Google Gemini integrated at the underlying level and supporting natural language creation for Shortcuts, it didn't spark much discussion.

A closer look reveals the gap: Apple is making AI coordinate some native functions within the iOS system. Once it involves third-party applications—those apps installed on your phone—it becomes strained.

Take Ele.me, for example. Its code runs on Ele.me's own servers, which Apple cannot read. For Siri to call Ele.me, Ele.me's engineers must proactively connect to the App Intents API, negotiating and integrating one by one, a time-consuming and labor-intensive process.

What WeChat is doing is enabling AI to directly operate millions of third-party services, because Mini Programs are different. The code for every Mini Program, from developer submission, through WeChat's review, to finally running on the user's phone, remains entirely within WeChat's technical system throughout the entire process. During the review phase, WeChat can scan the code, automatically analyzing "what pages this Mini Program has, what it can do, what its inputs and outputs are."

This is why "Automatic Mode" is possible—developers don't need to write a single line of code. They just flip a switch, and WeChat itself can translate your Mini Program into a tool the AI can invoke. WeChat's foundational architecture naturally supports this. It possesses a "God's-eye view," enabling scheduling based on centralization.

Apple does not have this architectural advantage, and neither does Google.

Also noteworthy is the recent rumor that WeChat is collaborating with Huawei, Honor, Xiaomi, OPPO, and vivo to launch A2A (Agent-to-Agent) assistant capabilities, allowing users to directly initiate WeChat audio/video calls or send messages via their phone's voice assistant.

Internally, WeChat AI can invoke millions of Mini Programs. Externally, smartphone manufacturers' AI assistants can invoke WeChat. WeChat is becoming the super connector of the AI era, a service hub that all AIs can access.

The Old Prophecy of "WeChat OS"

When Mini Programs were launched, many joked that WeChat wanted to become "WeChat OS." Back then, it was more of a figure of speech—Mini Programs replaced some app functionalities but were essentially a "light application platform."

More coincidentally, the centralized review mechanism designed at the time was for quality and security control. Nine years later, this design, initially criticized as "excessive control," has unexpectedly become an infrastructure advantage in the AI era. The distributed App ecosystem (Apple/Android) seemed more "free" at the time, but now it has become an obstacle to AI integration.

An old prophecy, due to the emergence of new-era technology—AI—has taken on a transformative change.

Previously, when writing about OpenClaw and Feishu, I mentioned a judgment: IM (Instant Messaging) is the most natural entry point for AI Agents because dialogue itself is the most natural interaction between humans and AI, and the service ecosystem (bots, payments, mini-programs) inherent to IM allows AI not only to "chat" but also to "do." Feishu is already moving in this direction, launching enhanced Bot APIs and AI Agent nodes.

However, Feishu is an enterprise collaboration tool, covering work scenarios. WeChat has a distinctly different breadth—1.432 billion monthly active users, hundreds of Mini Programs across niche sectors, covering almost all service needs of a person's daily life, from ordering takeout to hospital registration to buying flight tickets to paying utility bills.

If WeChat AI can indeed smoothly invoke these Mini Programs to complete tasks, then, as the prophecy said, it becomes an operating system operated with natural language.

A user says, "Help me book a high-speed rail ticket from Beijing to Shanghai for tomorrow at 3 pm." The AI deconstructs the intent, invokes the 12306 Mini Program to check tickets, select seats, completes the order via WeChat Pay, all without leaving WeChat. This path can theoretically be executed today.

Of course, there is distance between theory and reality. AI invocation involving payment scenarios requires a near-zero error tolerance—ordering the wrong coffee is a minor issue, but buying the wrong flight ticket is a major one. The accuracy requirements for the underlying model are far higher than for conversational scenarios. This is also a common bottleneck facing AI Agent implementation globally: the gap between "able to chat" and "able to get things done" is not measured by technical metrics but by trust.

But WeChat has at least done one thing right: it didn't build a service network from scratch. Over the years, what ChatGPT has been doing is first having a smart brain, then connecting one by one to Shopify, DoorDash, Stripe, each connection built from the ground up. To this day, transaction-related queries still account for less than 3%.

The real change that is about to happen might be imperceptible to most users. One day, you type into WeChat, "Help me book a ticket to Shanghai for 9 pm tonight," and then it's done. You might not even know which Mini Program was called in the background or what payment process was followed.

This "imperceptible completion" is the true mark of a mature AI Agent. WeChat is closer to this step than anyone else.

This article is from the WeChat Official Account "APPSO", author: APPSO discovering tomorrow's products

你可能也喜歡

Kraken计划为美国专业交易者推出CFTC监管的永续期货

加密货币交易所Kraken计划通过其收购的衍生品交易所Bitnomial，为符合条件的美国专业交易者推出受美国商品期货交易委员会（CFTC）监管的永续期货合约。此举旨在为美国交易者提供一个受监管的国内渠道，以获取在海外加密货币市场中占据主导地位的衍生品。永续期货是全球加密货币交易中的重要产品，允许交易者在没有到期日的情况下持有多头或空头头寸，并通过资金费率机制使合约价格贴近现货市场。由于监管限制，美国交易者此前难以通过本土受监管平台获得此类产品。 Kraken计划将这款产品整合至其Kraken Pro平台，为交易者提供统一的现货、保证金和期货交易界面。该合约设计将包含连续定价、无到期日以及每八小时结算的资金费率。然而，该产品目前仅面向符合条件的美国专业交易者，并非对所有零售用户开放。此举若成功推行，可能吸引更多流动性，并推动其他竞争对手寻求类似的受监管途径，从而将永续期货进一步纳入美国受监管的市场基础设施。不过，产品的实际影响将取决于用户准入范围、流动性深度以及执行质量等因素。

bitcoinist5 小時前

bitcoinist5 小時前

沃什首秀：史上最懂Crypto的FED主席会为市场带来惊喜还是惊吓？

2026年6月16日，新任美联储主席凯文·沃什将迎来首次货币政策记者会。他面临通胀升温、国债被抛售与白宫降息压力的复杂局面。沃什因个人资产中持有Solana等多个加密资产而备受关注，是首位间接投资加密领域的联储主席。沃什的政策立场呈现双重性：一方面，他以通胀鹰派著称，可能倾向紧缩货币政策；另一方面，他长期关注加密资产，视其为宏观经济的“监测器”和美国竞争力的组成部分，有望推动更友善的监管框架。这种“鹰派利率+友善监管”的组合可能成为影响加密市场的关键。其上任可能从三方面影响加密市场：一是监管范式从“防御”转向“整合”，利好稳定币和DeFi发展；二是其清晰的沟通可能降低市场政策不确定性；三是其背景可能加速全球机构资金配置加密资产。发布会结果可能呈现两种情境：若释放鸽派基调并认可数字资产创新，市场或迎惊喜；若鹰派超预期，强调加息，风险资产可能普遍承压。尽管沃什已承诺出售加密持仓，但其对技术的深层理解，长期看将为加密资产主流化提供更稳固的基础。市场关注点在于能否从其信号中窥见一个更具连贯性的新时代轮廓。

marsbit5 小時前

marsbit5 小時前

XRP Ledger 发布 3.2.0 版本升级并启用 XRPLd 新品牌名

XRP Ledger发布了3.2.0版本，这是对其底层区块链基础设施的一次重要升级。本次更新的核心是将运行网络的软件名称从“rippled”更名为“xrpld”，以更好地反映整个项目生态。与此前侧重于前端功能的版本不同，3.2.0版本优先进行了后端升级和效率提升，旨在增强网络性能并为未来的扩展做准备。关键改进包括内存优化措施，预计可节省高达40%的服务器内存使用。此次升级引入了名为“fixCleanup3_2_0”的修改，为单资产金库、借贷协议、权限系统、去中心化交易所、多用途代币和权限域等多个模块带来了安全性增强。开发团队还新增了不变性检查，以确保已删除账户不会在账本上留下不一致的数据，从而加强整个网络的完整性和可靠性。对于开发者而言，新版本增加了一项重要功能：应用程序无需连接服务器即可检索XRP Ledger协议和服务器定义信息，这将极大便利钱包、区块链浏览器和API等的开发工作。在可扩展性和稳定性方面，更新包括可配置的区块大小、通过nuDB实现的高效数据库存储，以及将gRPC服务器的TLS/双向TLS支持改为可选，以提升企业用户的性能和连接性。此外，默认对等端口从51235更改为2459，并修复了涉及自动做市商、支付、代币托管、多用途代币、订单簿和RPC等多个方面的问题。出于性能考虑，3.2.0版本暂时禁用了交易不变性检查，但开发团队表示这不会构成安全威胁。

TheNewsCrypto6 小時前

TheNewsCrypto6 小時前

AGI不是终点，DeepMind新论文：迈向ASI，真正的AI进步才刚开始

DeepMind团队最新研究报告提出，通用人工智能（AGI）很可能不是AI发展的终点。AI将继续超越人类水平，走向超级人工智能（ASI）。报告区分了AGI（达到人类中位认知水平）、ASI（在所有重要领域整体超越人类专家集体）和UAI（理论智能上界）三个概念，并探讨了从AGI迈向ASI的四条潜在路径：持续扩展计算、模型与数据规模；算法持续演化乃至范式转变；系统通过递归自我改进形成正反馈；以及通过多智能体协作形成超越单体的集体智能。报告同时指出了发展过程中可能面临的六大关键瓶颈：高质量人类数据可能面临枯竭的“数据墙”；能源、芯片等经济和自然资源压力；现有神经网络范式在持续学习、稳定推理等方面的根本性局限；研究难度随领域成熟而增加；AI可能难以自主提炼新概念原语的“抽象壁垒”；以及监管、治理与社会反弹带来的影响。报告最后强调，一旦AI能力超越人类，现有以人类水平为参照的评估体系将失效，需要建立面向后AGI时代的新评估机制。ASI的发展仍受物理规律、资源等现实约束，其具体路径与速度存在高度不确定性，未来需持续开展跨学科研究以应对这一前景。

marsbit7 小時前

marsbit7 小時前

Kraken推出OpenAI和Anthropic的Pre-IPO永续合约，提供高达5倍杠杆

加密货币交易所Kraken推出了针对OpenAI和Anthropic的“IPO前永续合约”（pre-IPO perps），允许符合条件的交易者在这两家备受关注的私人人工智能公司公开上市前，进行高达5倍杠杆的多空交易。此举标志着加密货币衍生品平台正超越数字资产范畴，尝试对接私人公司股权等链下资产，为散户投资者提供原本难以触及的热门投资主题（如AI）的敞口。然而，这类产品与普通的加密货币永续合约存在显著差异。其定价基于不透明、非连续的私人市场估值（受融资轮次、二级交易、IPO预期等因素影响），而非流动的现货市场价格，因此风险管理更为复杂。高杠杆进一步放大了与估值波动、流动性及上市时间表相关的独特风险。总之，Kraken的推出显示了加密交易基础设施向更广泛投机标的的扩张，为投资者提供了新的表达观点的工具，但产品本身风险较高，投资者需充分了解其复杂性和风险后再谨慎参与。

bitcoinist7 小時前

Kraken推出OpenAI和Anthropic的Pre-IPO永续合约，提供高达5倍杠杆

bitcoinist7 小時前

交易

現貨

合約

The First to Bring an AI OS to 1.4 Billion People Might Actually Be WeChat?

文章摘要

Why It Might Be More Important Than Apple's AI

The Old Prophecy of "WeChat OS"

相關問答

你可能也喜歡

Kraken计划为美国专业交易者推出CFTC监管的永续期货

沃什首秀：史上最懂Crypto的FED主席会为市场带来惊喜还是惊吓？

XRP Ledger 发布 3.2.0 版本升级并启用 XRPLd 新品牌名

AGI不是终点，DeepMind新论文：迈向ASI，真正的AI进步才刚开始

Kraken推出OpenAI和Anthropic的Pre-IPO永续合约，提供高达5倍杠杆

交易

熱門文章

如何購買PEOPLE

相關討論

熱門問答

熱門分類

熱門標籤