The First to Bring an AI OS to 1.4 Billion People Might Actually Be WeChat?

marsbit发布于2026-06-10更新于2026-06-10

文章摘要

WeChat has introduced a significant AI update, allowing mini-program developers to integrate their services with WeChat AI. Developers can choose an "automatic mode," where WeChat AI autonomously analyzes and operates mini-programs without additional coding, or a "development mode" for creating customized skills. This move effectively transforms WeChat's vast ecosystem—including millions of mini-programs, WeChat Pay, and official accounts—into an execution layer for AI. The technical documentation reveals that WeChat's approach aligns with industry standards like MCP (Model Context Protocol) and incorporates practical lessons from AI-agent development. Key design principles include a clear "attention weight" system for API calls and a "fact + action" response structure to ensure reliable operations. Unlike Apple's Siri, which struggles with third-party app integration, WeChat's centralized control over mini-program code provides a "God's-eye view," enabling seamless AI orchestration across services. This development revives the concept of "WeChat OS," where the app could function as a natural-language-operated platform for daily tasks—from booking flights to ordering food—all within a chat interface. While challenges remain in areas like payment security and user trust, WeChat's existing service network and massive user base position it uniquely to advance AI agents from conversation to actionable assistance, potentially making complex tasks feel effortless for its 1.432 bi...

WeChat's AI has finally moved.

On the very same day as Apple's WWDC, WeChat did something potentially more significant than Apple, releasing an unassuming announcement: "Guidance for Developers to Access the WeChat AI Ecosystem."

Starting today, Mini Program developers can grant authorization, allowing WeChat AI to read, operate, and invoke the functionalities of Mini Programs.

WeChat offers two access modes. The first is "Automatic Mode," with a nearly zero barrier to entry. Developers simply toggle a switch, and the platform itself reads the source code, analyzes pages, figures out what the Mini Program can do, and then the AI can directly operate it, all without writing a single line of code.

The other mode is called "Developer Mode," where developers create customized Skills, which are called by the AI after passing review. Both modes can be enabled simultaneously. Meituan has already announced its integration.

This should not be understood merely as the launch of another new feature. Instead, it signals that WeChat is turning its entire ecosystem—millions of Mini Programs, WeChat Pay, service notifications, Official Accounts—into the execution layer for AI.

Examining the Skill Documentation: How WeChat AI Invokes Mini Programs

The WeChat open documentation publicly shares the technical specifications for Mini Programs to integrate AI Skills. A close look reveals many design details hidden within.

Official skill documentation guide 👇🏻:

https://developers.weixin.qq.com/miniprogram/dev/ai/best-practices.html

From an architectural perspective, those familiar with AI development will immediately recognize it: it's essentially MCP (Model Context Protocol). The `mcp.json` declares the function and parameters of each atomic interface, and `SKILL.md` describes how the entire business process runs. This is almost identical to the MCP+Skills architecture found in Claude, Cursor, or VS Code. WeChat didn't reinvent the wheel; it directly adopted the industry-standard that is currently converging.

In the guidance, WeChat provides a clear "attention weight" system. When the AI decides which interface to call and what parameters to generate, it gives the highest priority to the content returned by the interface (five stars), followed by the interface description (four stars) and parameter descriptions (four stars) in `mcp.json`. `SKILL.md` ranks last (three stars). This means where developers write something matters more than what they write—the weight the AI gives to the same rule is completely different if it's written in the interface return versus in `SKILL.md`.

At the interface return level, there is a core specification: a two-stage "Fact + Action" format. First, tell the AI "what happened," then tell it "what to do next." If only the action is written without the fact, the AI might interpret "display card" as "prepare to call the next interface" and skip user confirmation. This is a rule learned after stepping into many pitfalls.

Fourth, parameter passing prioritizes using IDs over natural language. Taking the "Coffee Ordering" scenario in the diagram as an example, after the user states a need, the AI understands the vague intent and handles selection, modification, specification changes, and payment processing, all without leaving the chat dialog.

This design reveals a signal: WeChat has already run enough practical cases, knows where the pitfalls lie in AI calling external services, and has solidified these experiences into developer norms.

In fact, comparing WeChat Mini Programs, which are also known for their "ecosystem," to Apple's apps, WeChat possesses a kind of "God's-eye view" over its own ecosystem. This is the prerequisite for all this implementation.

Why It Might Be More Important Than Apple's AI

This year at WWDC, Apple released the new Siri AI. Despite having Google Gemini integrated at the underlying level and supporting natural language creation for Shortcuts, it didn't spark much discussion.

A closer look reveals the gap: Apple is making AI coordinate some native functions within the iOS system. Once it involves third-party applications—those apps installed on your phone—it becomes strained.

Take Ele.me, for example. Its code runs on Ele.me's own servers, which Apple cannot read. For Siri to call Ele.me, Ele.me's engineers must proactively connect to the App Intents API, negotiating and integrating one by one, a time-consuming and labor-intensive process.

What WeChat is doing is enabling AI to directly operate millions of third-party services, because Mini Programs are different. The code for every Mini Program, from developer submission, through WeChat's review, to finally running on the user's phone, remains entirely within WeChat's technical system throughout the entire process. During the review phase, WeChat can scan the code, automatically analyzing "what pages this Mini Program has, what it can do, what its inputs and outputs are."

This is why "Automatic Mode" is possible—developers don't need to write a single line of code. They just flip a switch, and WeChat itself can translate your Mini Program into a tool the AI can invoke. WeChat's foundational architecture naturally supports this. It possesses a "God's-eye view," enabling scheduling based on centralization.

Apple does not have this architectural advantage, and neither does Google.

Also noteworthy is the recent rumor that WeChat is collaborating with Huawei, Honor, Xiaomi, OPPO, and vivo to launch A2A (Agent-to-Agent) assistant capabilities, allowing users to directly initiate WeChat audio/video calls or send messages via their phone's voice assistant.

Internally, WeChat AI can invoke millions of Mini Programs. Externally, smartphone manufacturers' AI assistants can invoke WeChat. WeChat is becoming the super connector of the AI era, a service hub that all AIs can access.

The Old Prophecy of "WeChat OS"

When Mini Programs were launched, many joked that WeChat wanted to become "WeChat OS." Back then, it was more of a figure of speech—Mini Programs replaced some app functionalities but were essentially a "light application platform."

More coincidentally, the centralized review mechanism designed at the time was for quality and security control. Nine years later, this design, initially criticized as "excessive control," has unexpectedly become an infrastructure advantage in the AI era. The distributed App ecosystem (Apple/Android) seemed more "free" at the time, but now it has become an obstacle to AI integration.

An old prophecy, due to the emergence of new-era technology—AI—has taken on a transformative change.

Previously, when writing about OpenClaw and Feishu, I mentioned a judgment: IM (Instant Messaging) is the most natural entry point for AI Agents because dialogue itself is the most natural interaction between humans and AI, and the service ecosystem (bots, payments, mini-programs) inherent to IM allows AI not only to "chat" but also to "do." Feishu is already moving in this direction, launching enhanced Bot APIs and AI Agent nodes.

However, Feishu is an enterprise collaboration tool, covering work scenarios. WeChat has a distinctly different breadth—1.432 billion monthly active users, hundreds of Mini Programs across niche sectors, covering almost all service needs of a person's daily life, from ordering takeout to hospital registration to buying flight tickets to paying utility bills.

If WeChat AI can indeed smoothly invoke these Mini Programs to complete tasks, then, as the prophecy said, it becomes an operating system operated with natural language.

A user says, "Help me book a high-speed rail ticket from Beijing to Shanghai for tomorrow at 3 pm." The AI deconstructs the intent, invokes the 12306 Mini Program to check tickets, select seats, completes the order via WeChat Pay, all without leaving WeChat. This path can theoretically be executed today.

Of course, there is distance between theory and reality. AI invocation involving payment scenarios requires a near-zero error tolerance—ordering the wrong coffee is a minor issue, but buying the wrong flight ticket is a major one. The accuracy requirements for the underlying model are far higher than for conversational scenarios. This is also a common bottleneck facing AI Agent implementation globally: the gap between "able to chat" and "able to get things done" is not measured by technical metrics but by trust.

But WeChat has at least done one thing right: it didn't build a service network from scratch. Over the years, what ChatGPT has been doing is first having a smart brain, then connecting one by one to Shopify, DoorDash, Stripe, each connection built from the ground up. To this day, transaction-related queries still account for less than 3%.

The real change that is about to happen might be imperceptible to most users. One day, you type into WeChat, "Help me book a ticket to Shanghai for 9 pm tonight," and then it's done. You might not even know which Mini Program was called in the background or what payment process was followed.

This "imperceptible completion" is the true mark of a mature AI Agent. WeChat is closer to this step than anyone else.

This article is from the WeChat Official Account "APPSO", author: APPSO discovering tomorrow's products

相关问答

QWhat is the core change that WeChat announced regarding its AI and Mini Programs?

AWeChat announced that Mini Program developers can now grant permission for its AI to read, operate, and call upon their Mini Programs. This allows AI to directly execute tasks within these programs.

QWhat is the key architectural advantage WeChat has over Apple's iOS in integrating AI with third-party services?

AWeChat has a 'God's-eye view' due to its centralized ecosystem. Every Mini Program's code runs within WeChat's technical framework, from submission to review and execution. This allows WeChat to automatically analyze and translate a program's functions for AI use without requiring developers to write new code for the AI integration.

QWhat industry-standard architecture did WeChat adopt for its AI Skill system, and what does it allow?

AWeChat adopted the MCP (Model Context Protocol) architecture for its AI Skill system. This industry-standard approach allows its AI to understand the functions and parameters of Mini Program interfaces and the overall business workflow, enabling seamless AI-agent interaction.

QAccording to the article, what is the 'final sign of true maturity' for an AI Agent like WeChat's?

AThe final sign of true maturity is 'unconscious completion.' This is when a user makes a request in natural language (e.g., 'Book me a ticket to Shanghai for 9 PM tonight'), and the AI completes the task using the necessary services and payment processes in the background without the user needing to know which specific Mini Program or flow was used.

QWhy does the article suggest WeChat's move might be more significant than Apple's new Siri AI announced at WWDC?

AIt's more significant because Apple's Siri AI is largely limited to coordinating native iOS functions. To integrate with third-party apps, each developer must manually adapt their app using Apple's App Intents framework. In contrast, WeChat's architecture gives it direct, centralized access to millions of third-party Mini Programs, allowing for broader and more immediate AI-powered service automation for its vast user base.

你可能也喜欢

历史底部信号再现?估值3亿的Messari以1000万贱卖

加密数据平台Messari曾估值3亿美元,近期以约1000万美元被竞争对手Blockworks收购,标志其八年创业历程结束。该公司衰落部分源于AI技术冲击——传统需耗时数周的研究报告如今可借AI工具快速生成,导致其核心业务价值锐减。 Messari的处境并非个例。2025年至2026年间,加密行业众多不发币、依赖产品服务营收的公司陷入困境:数据平台DappRadar、Parsec相继关停,CoinGecko寻求出售;媒体CoinDesk、Bankless大幅裁员或低价被购;链上数据公司Dune也进行了裁员。行业收缩浪潮明显。 风险投资(VC)领域同样遇冷。加密基金数量减半,新基金募资额骤降至峰值期的12%,投资额在半年内暴跌超80%。资本与人才大量流向AI领域,连Multicoin Capital等知名加密基金创始人也转向AI。有投资人形容当前环境为“大灭绝”。 然而,极端悲观信号集聚或暗示底部临近。比特币自高点跌近50%,恐慌贪婪指数长期处于“极度恐惧”区间;比特币长期持有者占比逼近80%,历史上类似情况常对应市场底部。VC交易活跃度回落至2020年水平,而当时正是新一轮牛市前夜。部分机构如Dragonfly Capital已逆势募资,Blockworks也正低价整合行业资产。历史显示,当多个底部信号共振后,往往孕育着下一轮周期起点。

marsbit31分钟前

历史底部信号再现?估值3亿的Messari以1000万贱卖

marsbit31分钟前

谷歌TPU出货量,上修50%

近期,多家海外机构上调了谷歌TPU的出货预期,将2027年需求预测从1000万颗上修至1500万颗,增幅达50%。这一变化扭转了市场对算力硬件的保守看法,并带动整条配套产业链需求同步提升。 谷歌TPU采用标准化全光互联架构,硬件配套关系固定。其中,NPO光引擎与TPU芯片按1:1匹配,光模块、OCS光交换、服务器电源、光纤及液冷等环节的需求均随芯片规模增长而确定增加。 液冷成为核心受益方向。因新一代TPU功耗大幅提升,风冷已达物理极限,谷歌集群已全面转向液冷方案。预计2026年为放量元年,下半年开始大规模交付。同时,海外厂商面临技术迭代慢、产能不足的瓶颈,为国产液冷厂商让出替代窗口。凭借快速迭代和稳定交付能力,国内企业正切入谷歌供应链,行业迎来“业绩提速+格局洗牌”的双击行情。预计伴随TPU出货量从2027年的1500万颗增长至2028年的3000-3500万颗,专属液冷市场规模将从千亿级突破至3000亿级。 光纤赛道逻辑亦被重塑。AI算力中心建设催生海量光纤需求,但光纤预制棒扩产周期长,导致供需缺口持续扩大。全球云厂商为锁定货源纷纷签订长期协议,使光纤价格与出货趋稳,摆脱周期性波动。国产光纤凭借产能与成本优势,预计2026年出口量将达2-3亿芯公里,占据全球AIDC需求的半壁江山。 此外,1.6T光模块、OCS光交换、服务器电源等配套环节均将受益于TPU放量,需求持续扩容。投资重心正从芯片算力博弈转向基础设施配套的确定性增量,产业链未来两年业绩确定性进一步增强。

marsbit1小时前

谷歌TPU出货量,上修50%

marsbit1小时前

币圈故事退潮后,华尔街真正想要的是什么

币圈故事退潮后,华尔街正将传统金融的核心资产与业务系统性地迁移至区块链上,其目标并非投机或去中心化叙事,而是构建一套可控、生息且合规的链上金融基础设施。 核心动向包括: 1. **资产代币化**:以贝莱德的BUIDL基金为例,它将短期美国国债等低风险资产代币化,提供链上即时结算与每日复投,成为链上金融的基石资产。过户代理机构Securitize即将上市,并与纽交所合作,旨在建立全天候的链上股票清算系统。 2. **波动率变现**:针对比特币等波动资产,贝莱德、高盛等机构推出备兑看涨期权ETF(如BITA),通过系统性卖出期权将波动转化为稳定的月度现金收益,将其包装为标准化的生息产品,以吸引传统大型资金。 3. **稳定币支付与清算**:稳定币正被定位为高效的支付与结算工具。Stripe支持商户用稳定币收款,万事达卡升级系统支持稳定币进行跨时区清算,连SWIFT也计划推出基于分布式账本的跨境清算方案,旨在释放被冻结的巨额结算准备金,提升效率。 4. **监管与合规驱动**:美国《GENIUS法案》等监管框架将合规稳定币明确定义为“支付工具”(禁止派息)并纳入强监管,使其成为美元金融体系的可编程延伸。 总之,华尔街正利用区块链技术的可编程性与效率,在链上复制并优化国债、期权、清算网络等传统金融产品与服务,每一步都紧密依托美元信用与现有监管体系,旨在打造一个更高效且由其主导的新金融管道。

marsbit1小时前

币圈故事退潮后,华尔街真正想要的是什么

marsbit1小时前

交易

现货
合约

热门文章

如何购买PEOPLE

欢迎来到HTX.com!我们已经让购买ConstitutionDAO(PEOPLE)变得简单而便捷。跟随我们的逐步指南,放心开始您的加密货币之旅。第一步:创建您的HTX账户使用您的电子邮件、手机号码注册一个免费账户在HTX上。体验无忧的注册过程并解锁所有平台功能。立即注册第二步:前往买币页面,选择您的支付方式信用卡/借记卡购买:使用您的Visa或Mastercard即时购买ConstitutionDAO(PEOPLE)。余额购买:使用您HTX账户余额中的资金进行无缝交易。第三方购买:探索诸如Google Pay或Apple Pay等流行支付方法以增加便利性。C2C购买:在HTX平台上直接与其他用户交易。HTX场外交易台(OTC)购买:为大量交易者提供个性化服务和竞争性汇率。第三步:存储您的ConstitutionDAO(PEOPLE)购买完您的ConstitutionDAO(PEOPLE)后,将其存储在您的HTX账户钱包中。您也可以通过区块链转账将其发送到其他地方或者用于交易其他加密货币。第四步:交易ConstitutionDAO(PEOPLE)在HTX的现货市场轻松交易ConstitutionDAO(PEOPLE)。访问您的账户,选择您的交易对,执行您的交易,并实时监控。HTX为初学者和经验丰富的交易者提供了友好的用户体验。

1.3k人学过发布于 2024.03.29更新于 2026.06.02

如何购买PEOPLE

相关讨论

欢迎来到HTX社区。在这里,您可以了解最新的平台发展动态并获得专业的市场意见。以下是用户对PEOPLE(PEOPLE)币价的意见。

活动图片