More and More 'Model Supermarkets' Are Opening: ByteDance, Alibaba, and Tencent Compete to Integrate

marsbit发布于2026-04-24更新于2026-04-24

文章摘要

Chinese tech giants like ByteDance, Alibaba, and Tencent are accelerating the rollout of integrated AI model subscription services—dubbed “model supermarkets”—to provide developers with bundled access to multiple leading domestic large language models (LLMs). ByteDance’s Volcengine recently upgraded its "Coding Plan" by adding newer models like GLM-5.1, Minimax M2.7, and Kimi k2.6, allowing subscribers to use various top models under a single monthly fee starting at ¥40. However, user feedback reveals significant issues, including rapid consumption of usage limits (e.g., hitting caps within hours), frequent server errors (like HTTP 429), and slow response times during peak hours. Complaints about misleading deduction rates—where calls to advanced models consume more quota—are also common. The trend is industry-wide: Alibaba, Tencent, and Baidu have all launched similar multi-model coding plans. While these platforms reduce trial costs for developers, they also expose challenges in balancing affordability with service quality and computational stability. Amid this shift, independent AI companies like Zhipu, MiniMax, and Moonlight Face (Kimi) are developing strategies to avoid becoming mere “pipes” in this ecosystem—focusing on vertical applications, autonomous agents, and long-context models to retain competitiveness. Analysts suggest that, while platform aggregation may pressure model firms in the short term, specialized and vertical AI capabilities will remain differentia...

ByteDance's Volcano Engine recently officially launched GLM-5.1 in its Coding Plan, with the official statement claiming "aligned with the original full capabilities, no purchase limits." Prior to this, Volcano's Coding Plan had long only offered older models like GLM-4.7. This update not only introduced GLM-5.1 but also integrated multiple latest domestic large models such as Minimax M2.7, Kimi k2.6, and DeepSeek-V3.2.

This means developers can call upon multiple leading models simultaneously with just one subscription fee. Market feedback indicates that this "bundled model" significantly reduces developers' trial-and-error costs. Currently, the Lite plan is priced at 40 yuan per month, and the Pro plan at 200 yuan per month, making many developers willing to "buy a spot first."

Zhipu's GLM-5.1 itself demonstrated impressive engineering capabilities in an update in early April 2026. In two official videos released by Zhipu, "Building a Linux Desktop from Scratch in 8 Hours" and "655 Iterations, Increasing Query Throughput of the Vector Database to 6.9 Times the Initial Official Version," it redefined public imagination regarding large models' "8-hour effective execution."

Journalist's On-the-Ground Visit to Developer Community: Majority of Users Report "Not Durable"

Upon entering a Volcano Coding developer exchange group, the journalist found that alongside posts sharing experience feedback, a large number of users reported a gap between expectations and actual experience. Scrolling through a few pages of the exchange community revealed numerous posts complaining and requesting refunds, with many netizens exclaiming "feel cheated."

The controversies mainly focus on two points:

One is the issue of usage limits being consumed too quickly. A user named "Hakimi" posted saying "a few rounds of dialogue in one task and the 5-hour limit is almost used up." Another netizen shared that the reason their "5-hour limit was triggered" was because the account had a continuous sliding window over 5 hours, with the actual number of requests exceeding 6004, surpassing the system limit.

The second is the decline in experience due to computational resource scheduling pressure. Many users reported encountering 429 errors (too many requests) and "first-character delays of over one minute during peak hours being the norm." One user bluntly stated: "The 5-hour limit triggers too frequently, making it unusable for serious development."

Simultaneously, behind the low price of 40 yuan per month for the Coding Plan, there is also a hidden "undercurrent" regarding different deduction coefficients for "a single call request" within the plan. For example, a user posted an image in the developer exchange group showing the "differences in deduction coefficients for calling different models." For instance, the Doubao series and Qwen series have a deduction coefficient of 1, the DeepSeek series is 2, and the MiniMax-M2.7, Kimi-K2.6, and GLM-5.1 series are 5.

This also reflects that building a "model supermarket" is not as easy as imagined. Developers are attracted by the "cost-effectiveness," but the shortcomings exposed initially in areas like computational resource scheduling have caused many developers to hesitate after trying it out. This also reveals the growing pains of the "bundled model" in its early stages. As users flock in, the carrying capacity of the computing platform faces challenges. Finding a sustainable balance between attracting users with low prices and maintaining service quality will be a long-term challenge for Volcano Engine and its followers.

Cloud Vendors Collectively Shift to "Model Supermarkets": Initial Signs of Stratification and Solidification

This "integrative" update by Volcano Engine's Coding Plan is not an isolated incident.

Since early 2026, mainstream cloud vendors like Alibaba Cloud, Baidu Intelligent Cloud, and Tencent Cloud have all been advancing multi-model integration layouts. For example, Alibaba Cloud, as an industry pioneer, earlier launched the multi-model subscription package "Bailian Coding Plan," currently supporting the Qwen series, kimi-k2.5, glm-5, MiniMax-M2.5, and other models. Currently, the Pro price is 200 yuan per month, and the Lite package stopped new purchases from March 20th and stopped renewals and upgrades from April 13th.

Tencent Cloud's large model Coding Plan subscription service was fully updated in March 2026, supporting multiple latest models including Tencent HY 2.0 Instruct, GLM-5, Kimi-K2.5, and MiniMax-M2.5. Baidu Qianfan officially launched its AI coding subscription service, Coding Plan, in February 2026, also one of the较早 (relatively early) domestic cloud vendors to offer such services.

The "model supermarket" model is not a choice of just one company but is becoming a track where cloud vendors are racing to layout. However, tearing open the aggregation strategy of cloud vendors, whoever can provide more stable services, more transparent quota rules, more flexible disaster recovery mechanisms, and whoever can extend beyond programming to more enterprise-level service capabilities, and whether the renewal rate can keep up, all become new core competitive factors.

Internationally, Amazon Bedrock and Microsoft Azure's model aggregation service platforms, though different in scenarios from the domestic Coding subscription model, belong to the same integration trend.

Overall, industry competition is shifting from "single model capability competition" to "platform integration capability + ecosystem service capability" competition, and industry concentration will rapidly increase.

Wang Kai, Chief Asset Allocation Analyst at Guosen Securities, told reporters that although industry differentiation is accelerating, judging the integration period might be slightly premature. "More accurately, this is the refinement and iteration of industry chain分工 (division of labor). Model vendors focus on algorithms, cloud vendors focus on engineering delivery, each leveraging their main business advantages." He believes that regardless of whether other cloud vendors follow suit, the competitive landscape will evolve from individual efforts to ecological niche differentiation.

Increased Pressure for Large Model Companies to Become "Pipelined"?

So-called "pipelining" does not mean model companies disappear, but rather that they lose product premium, user connection rights, and discourse power, with profits shifting towards the computing platform side, becoming a "dominated" role.

Under the aggregation wave of cloud vendors, "pipelining" is also becoming a Sword of Damocles hanging over the heads of independent large model companies. In this silent game, leading players like Zhipu AI, Moonlight Shadow (Kimi), and MiniMax have not chosen passive compromise but have grown from their genes, offering different breakout paths.

Zhipu AI CEO Zhang Peng, in a public dialogue on April 8th, clearly stated that Zhipu's ultimate goal is never to become a "replaceable calling tool" but to build a fully autonomous agent. This positioning attempts to upgrade Zhipu from a "model supplier" to a "task executor," thereby bypassing the low-price trap of pure API pipelines.

Moonlight Shadow (Kimi) adopts a strategy of "decentralized layout + deep cultivation of long text." It synchronously accesses multiple mainstream cloud platforms like Volcano Engine and Alibaba Cloud, achieving multi-source computational power supply, avoiding being bound to a single channel, and ensuring service stability and cost control. Kimi K2.6, launched in April 2026, adopts a Mixture of Experts (MoE) architecture with a standard context window of 256K tokens.

MiniMax focuses its core investments on vertical fields such as content creation, intelligent customer service, education, enterprise services, and entertainment socializing, with key layouts in scenarios like game AI, digital humans, and multimodal interaction, creating "customized capabilities difficult for cloud platforms to replace."

Will platform integration by major vendors accelerate the "pipelining" of model companies? Wang Kai, Chief Asset Allocation Analyst at Guosen Securities, believes it is necessary to distinguish between short-term and long-term perspectives.

"In the short term, distribution channels being controlled by the platform, partial ceding of pricing power, and profits of model vendors shifting to the entry point side are business norms. But in the long run, general models are prone to homogenization; deep learning models in vertical scenarios like finance, healthcare, and law have professional barriers that cannot be erased simply by centralized aggregation." he said.

In terms of responding to the risk of being platformized, strategies from OpenAI and Anthropic can be referenced. On one hand, strengthen channels that directly face end-users, such as the independent operation of ChatGPT and Claude, which essentially establishes user connections bypassing platforms. On the other hand, the speed of technological iteration and user brand recognition are two effective moats, so model companies need to balance R&D investment with productization layout.

The final outcome of this game of "pipelining vs. platformization" might not be about who eats whom, but a further clarification of division of labor. Cloud vendors act as pipes, model companies focus on technology, and both sides gradually find their respective survival boundaries in the game.

As for who eats whom, at this stage, it is far from the end of the story.

This article is from the WeChat public account "Sci-Tech Innovation Board Daily," author: Wang Nai

你可能也喜欢

一个芯片公司发布AIDC储能认证标准，英伟达凭什么？算力重构供电逻辑，谁在抢跑谁被挡在门外？

英伟达近期发布了《储能系统自我认证指南》，为AI数据中心（AIDC）的储能系统设立了一套严格的认证标准。该标准重点关注储能变流器（PCS），提出了10项硬性技术指标和12项实测仿真要求，包括动态响应、遥测频率和控制透明度等，并将测量精度卡在电压±0.2%、电流±0.2%。这意味着储能行业以往拼产能和成本的竞争模式在此标准下失效，转而强调系统的快速控制与软硬件协同能力。随着AI算力激增，数据中心功耗呈现阶跃式暴涨，传统供电方案已无法满足需求，储能成为AIDC顶层设计的关键。英伟达此举实质是重新定义数据中心供电逻辑。标准中要求厂商提交过往PCS交付记录及24个月内实现10倍扩产的可执行计划，这直接将许多中小厂商挡在门外。AIDC储能认证涉及多维度测试，周期长、成本高，门槛显著提升。目前，西门子等巨头已联合Fluence等合作伙伴抢跑市场布局。尽管未来AIDC将催生数百GWh的储能需求，但新标准画定了新的起跑线。能否跨过这道门槛，取决于厂商的控制算法、交付能力和扩产实力，而非简单的电池容量堆砌。游戏规则已变，行业格局面临重塑。

marsbit38分钟前

一个芯片公司发布AIDC储能认证标准，英伟达凭什么？算力重构供电逻辑，谁在抢跑谁被挡在门外？

marsbit38分钟前

想买的人已经买完了：SpaceX散户狂欢退潮，真正的抛压还在8月

SpaceX上市后经历剧烈波动，散户狂热迅速退潮，真正的抛售压力可能在八月后集中释放。 SpaceX于6月12日上市，开盘价150美元，随即引发散户疯狂买入，首周散户净买入达4.05亿美元，甚至超过其他大型科技股的总和。激进交易者试图通过期权交易推高股价，导致其市值一度短暂超越微软，冲至225美元高点。然而，动量很快消散。随着散户日度资金流断崖式下滑，股价连续三日下跌。周一更因公司计划发行巨额债券的消息，单日暴跌16.4%，市值蒸发6000亿美元，股价跌回150美元开盘价附近。盘后交易一度触及该价位，意味着二级市场买入者可能全部被套。关键风险在于股票锁定期即将结束。目前市场上仅有5%的流通盘，而根据解禁时间表，到9月初，内部人最多可能抛售高达44%的股份，流通盘将扩大近九倍。分析指出，“想买的人已经买完了”，在缺乏新买盘而潜在卖压巨大的情况下，未来推高股价将愈发困难。此次SpaceX的剧烈震荡被视作市场情绪的风向标，其由狂热到冷却的过程，可能预示着由散户和动量资金推动的整体科技股行情面临考验。

marsbit43分钟前

marsbit43分钟前

20 岁创始人，招 18 岁员工，被 19 岁的人投资

这篇文章探讨了当前人工智能（AI）领域出现的“年轻化”现象。随着大模型技术的快速发展，行业对AI人才的争夺异常激烈，并出现了明显的年龄偏好。许多顶尖高校的应届生或在校实习生能够获得高达数百万人民币的年薪或数千元的日薪，其薪酬水平远超传统互联网行业的资深从业者。文章指出，这种现象背后是“AI Native”概念的兴起。企业认为，更年轻、从小接触AI的从业者（多为00后）在思维模式、学习速度和创新能力上更具优势，能更快适应技术的快速迭代。相比之下，拥有多年工作经验但转型较慢的从业者可能面临被淘汰的风险。因此，大厂和初创公司都竞相提前锁定优秀学生，通过高薪、重要项目主导权、灵活扁平的管理模式以及密集的社交活动来吸引他们。这种趋势形成了一个围绕“年轻”的利益循环：年轻研究员凭借成果获得高薪或创业机会；年轻的创业者更容易获得年轻投资人的青睐和融资；而成功的年轻团队又倾向于雇佣和信任更年轻的成员。这在一定程度上冲击了传统的、基于资历和经验的行业规则，但也加剧了人才竞争和行业内部分化。文章最后也反思了这种狂热背后的隐忧，即技术革命在厚赏“非凡”年轻人的同时，也可能对无法快速适应的“平凡”从业者施以严厉的惩罚。

marsbit47分钟前

marsbit47分钟前

美光Q2财报前瞻，花旗调高目标价

美光将于6月24日公布2026财年第三财季业绩。财报公布前，花旗将其目标价从840美元上调至1200美元，维持买入评级，主要依据是2026年内存价格强于预期及毛利率处于高位。当前股价已接近目标价，市场焦点转向其盈利假设能否兑现。花旗将美光2027财年每股收益（EPS）预期上调至114.73美元，其核心假设是内存价格持续上涨。报告预计，2026年DRAM平均售价将上涨200%，NAND上涨186%，其中服务器端涨幅更为显著。价格上涨源于供给偏紧，特别是HBM（高带宽内存）产能挤占了传统DRAM资源，以及需求端受AI服务器、数据中心等驱动。长期采购协议（LTA）被视为平滑盈利周期波动的潜在因素，但其具体条款和对估值的实际支撑作用仍有待观察。报告同时提示风险：若行业扩产加速、AI资本开支放缓或HBM产能爬坡不及预期，可能削弱当前的价格和毛利率假设。花旗给出了牛市目标价1400美元和熊市目标价400美元，显示出市场对存储周期持续性的分歧。本次财报的关键在于管理层对未来供需、HBM定价及盈利指引的展望。

marsbit52分钟前

marsbit52分钟前

错过20倍后，我找到了AI投资的笨办法

错过20倍涨幅后，作者反思AI投资不应盲目追逐热点，而应致力于构建系统的产业认知，即建立“知识仓”。他认为AI是长赛道，当前可能处于早期阶段，未来最大的机会或许在泡沫破裂后或未被关注的角落。盲目追涨或等待崩盘都非良策，关键在于提前深入研究整个产业链。文章将AI产业链分为四层：算力基础设施（如芯片、制造、光互连、散热）、模型与工具层、中间件平台以及垂直应用层，并指出能源是终极约束。作者强调，越底层的环节（如基础设施）通常玩家越少、定价权越强，而当前市场关注点多在第一层（如英伟达），第二、三圈层（如光互连、散热等）可能仍有未被充分定价的机会。未来研究将聚焦几个关键问题：算力需求从训练转向推理的影响、巨额资本投入的回报周期、产业链中后段的机会，以及地缘政治的影响。作者计划逐层剖析，弄清每个环节的商业模式、竞争格局和估值，为未来可能出现的市场机会做好准备。最终指出，真正的投资直觉源于深厚的研究积累，“错过”往往是因为“研究太少”，系统性的知识储备是在AI长赛道中把握机会的基础。

marsbit58分钟前

marsbit58分钟前

交易

现货

合约

More and More 'Model Supermarkets' Are Opening: ByteDance, Alibaba, and Tencent Compete to Integrate

文章摘要

Journalist's On-the-Ground Visit to Developer Community: Majority of Users Report "Not Durable"

Cloud Vendors Collectively Shift to "Model Supermarkets": Initial Signs of Stratification and Solidification

Increased Pressure for Large Model Companies to Become "Pipelined"?

热门币种推荐

相关问答

你可能也喜欢

一个芯片公司发布AIDC储能认证标准，英伟达凭什么？算力重构供电逻辑，谁在抢跑谁被挡在门外？

想买的人已经买完了：SpaceX散户狂欢退潮，真正的抛压还在8月

20 岁创始人，招 18 岁员工，被 19 岁的人投资

美光Q2财报前瞻，花旗调高目标价

错过20倍后，我找到了AI投资的笨办法

交易

热门文章

如何购买S

Sonic：Andre Cronje主导升级，逆势上涨的Layer1新星

成长学院：学习“ Sonic“ ，瓜分价值 1000 USDT

相关讨论

热门问答

热门分类

热门标签