6 Questions to Understand the Business Trends of AI

marsbitPublished on 2026-05-31Last updated on 2026-05-31

Abstract

The AI industry has entered its "summer" phase, according to a six-dimensional scoring framework assessing its development cycle. Each dimension—narrative vs. delivery, system connectivity, delivery capability, ROI rationalization, common industry trends, and capital environment—scores 1 point, totaling 6 points. This places the industry firmly in summer (5-7 points), characterized by a coexistence of grand promises and tangible deliverables, with increasing pressure to demonstrate value and profitability. Key signals mark this shift. ByteDance's Doubao launched paid subscriptions, while OpenAI introduced an advertising platform. These moves are driven by dual forces: immense cost pressures from scaling user bases and massive compute requirements, and the maturation of commercial opportunities. Major players like Anthropic report explosive growth, highlighting AI's transition into core productivity infrastructure. For businesses, the path forward involves three strategic steps. First, identify a small, high-impact use case to quickly demonstrate a closed-loop value proposition, such as automating customer service or content generation. Second, systematically replicate successful pilots across the organization by standardizing processes, building shared AI capabilities, and aligning talent, incentives, and leadership. Finally, move beyond simply adding AI to existing workflows and undertake systemic reconstruction—redesigning processes for parallel AI-human collaboration, im...

Note-taker's Notes:

The AI circle is very lively recently.

Anthropic has become the fastest-growing company in human history, with annualized revenue soaring from $10 billion at the end of 2024 to $47 billion in May 2026. Yesterday, it just completed its H round of financing of $65 billion, with a post-investment valuation of $965 billion, surpassing OpenAI to become the world's highest-valued AI company, and is expected to launch an IPO this fall.

DeepSeek's valuation has surged to $45 billion, with the National Integrated Circuit Industry Investment Fund leading the first round of financing of 70 billion yuan (about $10 billion), and the list has basically been finalized.

Kimi completed a $2 billion financing round, with a post-investment valuation exceeding $20 billion. It has raised over $3.9 billion cumulatively in half a year, becoming the financing champion among domestic large-scale model startups.

StepFun (Jieyue Xingchen) completed nearly $2.5 billion in financing, dismantling its red-chip structure to sprint for a Hong Kong IPO.

ByteDance adjusted its AI infrastructure investment for 2026 from 160 billion yuan upwards to over 200 billion yuan. Bloomberg further exposed that its total capital expenditure ceiling might reach $70 billion (about 500 billion yuan).

Doubao launched three tiers of paid subscriptions on May 4th, firing the first shot to end the era of free domestic AI.

If we follow the four seasons, what season is today's AI economy in? Is it spring, summer, or the transition between spring and summer? Or is it about to enter autumn, like the bubble period rumored outside?

The answer actually lies within cycles.

Today, we will use a six-dimensional scoring framework for cycle judgment proposed by Professor Su Dechao from the School of Philosophy at Wuhan University, founding advisor of Notesman PPE Academy, and instructor of Western Philosophy courses, to clarify this matter thoroughly.

一、Quantitative Scoring of Six Dimensions for Cycle Judgment

Many people like to use the elimination method to judge industry cycles: it's not winter, spring has long passed, autumn doesn't seem to have arrived yet, and finally they arrive at a correct but useless statement: summer. The elimination method only gives you a seemingly correct answer but doesn't tell you why it's correct.

Truly useful judgment must be quantified from specific dimensions.

The "Six-Dimensional Scoring Framework for Cycle Judgment" scores an industry according to six dimensions: "Narrative vs. Delivery, System Connectivity, Delivery Capability, ROI (Return on Investment) Rationalization, Industry Common Phenomena, and Capital Environment." Each dimension scores from 0 to 2 points, with a higher total score indicating closer proximity to autumn.

Let's score them one by one.

1. Narrative vs. Delivery: From Storytelling to Looking at the Ledger

This is the first dimension and also the easiest one to perceive changes.

When ChatGPT first emerged in 2022, everyone said, "AI will change everything." But no one asked you what you could specifically do or how much cost you could save.

Storytelling was enough; that was spring.

The situation today is different. Doubao introduced paid subscriptions, with three tiers of prices clearly written on the page—68 yuan, 200 yuan, 500 yuan—focusing on paid features like PPT generation, data analysis, and video production. It's not storytelling; it's delivering specific capabilities and then charging for them.

OpenAI's advertising platform is even more direct: advertisers buy ad space within ChatGPT and pay per click. On May 5th, the self-service ad management tool launched testing, removing the $50,000 minimum spending threshold, allowing even SMEs to advertise directly.

"AI will change the advertising industry in the future" is narrative; "giving you an advertising channel now" is delivery; they are two different things.

The narrative is still there; people are still saying things like "AI changes the world," but delivery already occupies a considerable proportion.

For this item, score 1 point.

2. System Connectivity: From Islands to Protocols

In spring, every AI product was an island. If you wanted to integrate ChatGPT into your company's system, you had to write a set of adaptation code yourself; changing to another model required rewriting it again.

In April 2026, Google released the Gemini Enterprise Agent Platform, integrating agent management into enterprises' existing workflows.

Microsoft Copilot is embedded in the Office suite, Amazon's AI shopping assistant opened sponsored answers to brands, and their respective boundaries are loosening.

Connectivity must reach a certain level to possibly generate this kind of synchronous response.

Partial protocolization has been achieved now, but standardized protocols have not yet become mainstream.

For this item, score 1 point.

3. Delivery Capability: From Occasional Help to Stable Work

In spring, AI was like an intern, occasionally helpful, but more often, you had to fix things for a long time.

Doubao has over 300 million users. As of March 2026, its daily Token call volume exceeded 120 trillion, representing over a 1000-fold growth compared to May 2024, and doubled again in the past three months.

OpenAI has 900 million weekly active users, 50 million individual subscription users, and over 9 million enterprise users.

What are these enterprises using AI for? Writing code, reviewing contracts, automatically generating marketing copy, handling customer service tickets—all scenarios that replace people from repetitive labor. This is no longer the scale of trial use; it's using AI to do work.

Delivery has been realized. But has delivery capability become a core competitive advantage? Hard to say. The story is still being told, but enterprises that can stand firm by stable delivery are not yet mainstream.

For this item, score 1 point.

4. ROI Rationalization: From Unclear Calculation to Starting to Calculate

In spring, no one calculated ROI. How much computing power was invested, how much output was obtained? Unclear.

Now people are starting to calculate: Tencent Hunyuan API prices increased significantly (according to industry news, some models increased by over 400%), driven by the hard pressure of computing power costs.

A single inference request for a GPT-level model costs about 0.01-0.03 yuan in computing power. When call volumes reach hundreds of millions or billions, costs balloon into astronomical figures. Doubao's high-frequency calls from 345 million monthly active users force ByteDance to face this issue.

Zhipu AI raised prices three times within the year; Alibaba Cloud canceled the basic plan for the Bailian platform. Behind these decisions lies the same logic: Products that cannot clearly calculate ROI are about to collapse.

But is ROI clear and calculable? A few closed loops have begun to show ROI, but the metrics are vague. Some calculate Token costs, some calculate efficiency improvements, some calculate customer acquisition conversion; standards are not unified.

For this item, score 1 point.

5. Industry Common Phenomena: From No One Talking About Profits to Some Starting to Doubt

In spring, everyone was expanding; no one talked about profitability. Burning money for growth was the default model.

Now, Doubao launched a charging model; large model vendors like Zhipu AI and Moonshot AI raised prices, which in itself is an admission that costs are unsustainable and must be charged for.

But has the model of burning money for growth been widely negated? Not yet. Capital is still investing, leading manufacturers are still expanding, just at a slower pace; everyone is starting to calculate.

For this item, score 1 point.

6. Capital Environment: From Random Valuations to Starting to Feel Pressure

In spring, financing was extremely easy, valuations were thrown around casually, and a PPT could get tens of millions of dollars.

OpenAI launched an advertising platform, with an ad revenue target of $2.5 billion this year. The company's total revenue in 2025 was $13 billion, but it incurred a loss of $8 billion.

Although OpenAI's valuation is $852 billion, Anthropic, which came from behind, had an annualized revenue exceeding $30 billion as of April, surpassing OpenAI's $25 billion for the first time. Therefore, in the private secondary market, its valuation was chased to nearly $1 trillion, also exceeding OpenAI's.

OpenAI faces huge cost pressures; the story of growing big and strong before raising more money can't be told anymore; it must make money.

Has the financing winter arrived? Not yet. Leading manufacturers can still raise money, but the valuation logic has changed: from looking at imagination space to looking at revenue capability.

For this item, score 1 point.

Six dimensions, each scoring 1 point, total 6 points. According to the framework, 0-4 points is spring, 5-7 points is summer, 8-10 points is autumn, 11-12 points is winter.

What is the state of summer?

Narrative and delivery coexist. The imagination space is still there, but the ledger is already on the table. Capital can still invest, but starts asking about returns. Users are still growing, but start to stratify; some are willing to pay, some only use free services.

There are local signs of entering autumn. If signals like Doubao charging and OpenAI advertising continue to amplify, gaining another 2 points—for example, ROI becomes clear, and capital universally demands delivery—then autumn has truly arrived.

Summer is the stage where narrative and delivery coexist, but delivery is becoming increasingly important. Everyone knows the story still needs to be told, but the acceptance criteria after the story is quietly changing to: what exactly have you delivered?

二、Why Now:

3 Signals, 2 Drivers

The Six-Dimensional Cycle Judgment scoring is a static analysis. What happened recently from a dynamic perspective?

On May 6th local time, Anthropic founder Dario Amodei said: "Our growth rate exceeds exponential. Revenue and usage in the first quarter of this year achieved 80x annual growth, welcoming explosive growth. We are providing more computing power as fast as possible at an unprecedented speed."

This is the fastest revenue-growing company in human history. The day when AI truly becomes a productivity infrastructure may have already arrived.

In the same week, two signals appeared simultaneously: Doubao charging and OpenAI selling ads.

On the surface, it's a coincidence, but underlying are two threads: cost pressure and commercialization opportunities.

Signal One: Doubao Charging

Why is Doubao charging? High-frequency calls from over 300 million users have turned computing power costs into a problem that must be faced squarely.

As of March 2026, daily Token call volume exceeded 120 trillion, representing over a thousand-fold growth compared to May 2025, and doubled again in the past three months.

According to public calculations by Zheshang Securities cited by multiple media outlets, ByteDance's capital expenditure in 2025 was about 160 billion yuan, of which about 90 billion yuan was used for AI computing power procurement, with the rest for infrastructure and network equipment.

The free model really can't hold on much longer. A cost estimation circulated in the tech community shows that hardware depreciation accounts for about 58% of a single inference cost, and electricity costs about 29%. The larger the user base, the higher the cost.

Converting 120 trillion Tokens based on discounted public API prices is equivalent to daily revenue that should be 300 to 500 million yuan.

But now? C-end revenue is zero.

A thousand-fold growth resulting in zero—you can't find a second case like this among Chinese internet companies in the past 15 years.

At the same time, domestic Tokens entered a continuous price increase channel. Zhipu AI adjusted API prices upwards; GLM5.1 API increased by 10%, with the overseas version increasing by over double; Alibaba Cloud canceled the Bailian platform's basic plan; GLM5.0, MiniMAX 2.5, and Kimi 2.5 ended free public testing.

But there's also a price reduction side: DeepSeek V4-Pro slashed prices to 2.5% off, 0.25 yuan per million tokens; Alibaba Cloud's Tongyi Qianwen visual understanding model price dropped by over 80%; Doubao 2.0 Lite input price is only 0.6 yuan per million tokens.

Large model vendors are stratifying: one end raising prices, the other lowering prices.

Pressure and opportunity, two drivers.

Signal Two: OpenAI Selling Ads

Why is OpenAI selling ads? Half pressure, half opportunity.

Pressure side: The company's revenue in 2025 was $13 billion, with a cash loss of $8 billion. 50 million individual subscription users and 9 million enterprise users correspond to annual revenue of tens of billions of dollars, but computing power costs, R&D costs, and operating costs combined exceed this figure. Subscription revenue is insufficient to cover costs; the numbers don't add up.

Opportunity side: According to industry observation estimates, the ad pilot generated an ARR (Annual Recurring Revenue) of $100 million in less than two months after launch. The ad monetization potential of 900 million weekly active users is huge.

According to authoritative institution forecasts, Meta's full-year ad revenue will exceed $243.46 billion, higher than Google's $239.54 billion, indicating that monetization through advertising still has a huge market.

The forecast OpenAI shows investors is: $2.5 billion in ad revenue in 2026, reaching $100 billion by 2030.

This is a forced choice driven by costs and an active choice driven by opportunity. OpenAI is targeting not just cost coverage, but this large market.

AI is not free; GPUs, electricity, engineers, bandwidth—all cost money. The larger the user base, the higher the cost. In spring, you could burn investors' money to sustain; in summer, you must find users to pay.

But summer also means commercialization paths have opened: advertising, subscriptions, tiering—monetization methods are much richer than in spring.

The underlying logic of two signals appearing at the same point in time is: user scale has reached a critical point, cost pressure forces charging decisions, and commercialization opportunities mature enough to monetize.

Summer is this stage: the ledger is laid out, but the window hasn't closed yet.

三、Entering in Summer, How Can You Run Faster?

Clearly, AI has reached "summer"; it has transitioned from "usable" to "truly capable of helping you save money and make money."

How is this determined? Let's give a few examples:

Semir's designers used to take at least three days to produce one render. Now, using AI tools, they produce a render in 30 seconds, viewing the render directly without sampling.

Designer Lin Jianxia's exact words were: "Unsatisfactory schemes are directly eliminated, without wasting sampling costs."

AI improved Semir's overall design and R&D efficiency by 35%, and pattern design speed by over 200%. In 2025, AI brought Semir direct value: 200 million yuan in new revenue and 20 million yuan in cost reduction.

Anta's "Linglong" design large model, trained on tens of millions of apparel and footwear data accumulated over thirty years, can generate 56 inspiration schemes in minutes. The designer team can complete line drawing and generate high-definition renderings within one day.

With AI collaborating with the team, the tennis shoe project, from initiation to final model confirmation, took no more than 40 days, significantly faster than the traditional 3-month design cycle.

Peacebird (Taipingniao) achieved intelligentization across the entire marketing chain. Starting from understanding the business goal of "increasing GMV for new autumn children's wear products," AI can autonomously identify high-potential users, generate personalized product recommendations and marketing content, and push them to sales consultants' enterprise WeChat with one click.

Ultimately, the new series' click-through rate increased by 90%, payment conversion rate increased by 20%, and new product GMV surged by 31%.

Midea Group has formed an AI R&D team of over 400 people, with more than 13,000 intelligent agents running daily across multiple scenarios such as residential, office, manufacturing, healthcare, warehousing, and logistics. In 2025 alone, AI saved Midea 700 million yuan in costs and improved overall efficiency by over 15 million hours.

What do these cases illustrate? AI is transitioning from "embellishment" to "main force."

Having seen the examples, what should be done next?

In a nutshell: Progress from AI implementation for a series of small problems to gradually building large-scale AI system applications.

Specifically, take three actions.

The First Action: Find a Minimum Entry Point, Achieve a Value Closed Loop

Don't start by thinking "company-wide AI transformation"; that's the biggest pitfall. 80% of companies' AI implementation failures stem from being too ambitious, seeking completeness, and pursuing AI for AI's sake.

How to do it? Remember three words—Small, Accurate, Fast:

Small: Select 1-2 scenarios with "clear pain points, standardized processes, and sufficient data" to start with. For example, AI intelligent customer service, finance/administration automation, marketing material generation, contract compliance review—these belong to "high value, low threshold, quick results."

Accurate: Before launching each scenario, first establish the business baseline for the next 3-6 months, clarify how benefits are calculated, and how success is defined. The core evaluation metrics must be quantifiable financial results, not technical self-congratulatory metrics like "model accuracy rate, response speed."

Fast: If no clear results are seen within 3 months, iterate quickly or shut down; never force it. Set a stop-loss line in advance for each AI project. If preset business goals are not met for two consecutive cycles, shut down directly.

The key to this step is to achieve a closed loop, giving the team confidence and the boss determination, so the subsequent path can be walked.

The Second Action: From Pilot to Replication, Accumulating Organizational Capabilities

A single successful scenario doesn't qualify as transformation; it's just a pilot at best. What truly creates a gap is whether you can turn the success of one point into something the entire company knows how to do.

Don't rush to expand a successful scenario; first solidify the approach: how prompts are written, which tasks are assigned to AI, which personnel must oversee, what pitfalls are common, how success is calculated—write it into a set of standard procedures, then promote it to similar business lines.

Build "two infrastructures": First, an AI capability sharing middle platform—don't let each department explore from scratch; sales can directly use the AI data capabilities proven in finance, R&D can reuse market's user insight models. Second, a prompt knowledge base, categorized and shared by scenario, rewarding those whose prompts are used most.

Then, how can the organization keep up? Semir internally repeatedly emphasizes one sentence: AI implementation is 70% a people problem, only 30% a technical problem. While replicating scenarios, three things must be synchronized; missing any one won't work:

Talent梯队: It's not just hiring a few algorithm engineers. Three layers are needed. The top layer involves the boss or core executives personally leading. The middle layer consists of "translators" who understand both AI boundaries and business pain points. The grassroots level involves frontline employees being able to use AI tools to solve their immediate problems.

Incentive Mechanism: All incentives must be tied to quantifiable AI implementation results, with direct profit-sharing from the increased revenue and cost savings brought by AI; the payout cycle should be short, monthly or quarterly, allowing people to quickly feel "using AI = more money"; most crucially, incentives must also reach frontline executors—they are the ultimate users of AI tools; without their participation, AI will never be implemented.

Organizational Structure: Top leadership must personally lead, involving heads of business, technology, finance, and HR to work together, not letting the IT department fight alone; also, incorporate AI implementation cooperation into the performance assessments of each department head; use performance to speak to those who remain "uninvolved and aloof."

Simply put, this step turns "one person's success" into "an organization's muscle."

The Third Action: Systematic Reconstruction, from Adding AI to Using AI to Redo

Multiple scenarios are successful, and the organization has kept up. Next, it's not about "pasting AI onto old processes" anymore, but letting AI redo the processes—this is the large system.

Process Reconstruction: Serial to Parallel. The old way was "one person finishes, passes to the next," proceeding serially. In the AI era, this is completely obsolete. It must be changed to multiple people and multiple AIs working simultaneously:

Before meetings, let AI simulate all parties' stances first, exposing logical flaws and resource conflicts in advance; formal meetings only resolve real disagreements, cutting meeting time by 70% directly.

Real-time Alignment Dashboard: Stop writing weekly reports. Everyone, including AI, updates status on the same dashboard, with AI responsible for monitoring inconsistencies. If you say "high cost-performance" but set a high-end price, the dashboard marks it red immediately, discoverable the same day, without waiting for the review meeting two weeks later.

Upon receiving a requirement, don't rush to do it; first use AI to paraphrase your understanding and have the other party confirm. If even AI misunderstands, it means the requirement itself is unclear; nip rework at the source.

Automated Trigger Chains are also crucial: User complains—AI generates安抚话术—send to客服 for review—sync with brand work group;

ROI drops—AI finds原因—push suggestions to负责人;

Inventory almost out—AI calculates补货量—push to supply chain—can operate even if no one is monitoring.

In summary: Processes should change from being pushed by people to "when something happens, AI automatically runs, people only make decisions."

Finally, summarize these three actions: First find a pain point to achieve a closed loop, proving AI can make or save money; then solidify the successful approach and expand it, matching people and incentives; finally, let AI redo the processes, turning small problems into a large system. First run through a point, then spread into an area, finally let AI redo the whole game.

This article is from WeChat public account "Notesman" (ID: Notesman), author: Lao Jia

Related Questions

QAccording to the six-dimension scoring framework, what season is the AI economy currently in, and what are the key characteristics of this season?

AThe AI economy is currently in 'summer' with a total score of 6. Key characteristics include: narrative and delivery coexisting, but delivery becoming increasingly important; capital still investing but starting to ask for returns; user growth continuing but with stratification (some willing to pay, some using free services); and the window for commercialization being open but not yet fully realized.

QWhat are the two key signals mentioned in the article that indicate the shift in the AI industry's phase, and what underlying drivers do they represent?

AThe two key signals are Doubao launching paid subscriptions and OpenAI selling advertising. They represent two underlying drivers: cost pressure (forcing companies to monetize due to high computing and operational costs from massive user scale) and commercialization opportunity (the market potential for monetization through subscriptions, advertising, and tiered services has matured).

QWhat are the three main actions suggested for companies to effectively implement AI during the current 'summer' phase?

A1. Find a minimal starting point and run a value closed-loop: Focus on a small, clear pain point with standardized processes and sufficient data, aiming for quick, quantifiable financial results within 3 months. 2. Scale from pilot to replication and build organizational capabilities: Standardize successful practices, build shared AI infrastructure and prompt libraries, and align talent development, incentives, and leadership structure to support AI adoption. 3. Systematically restructure workflows: Move from simply adding AI to existing processes to using AI to redesign workflows for parallel execution, real-time alignment, and automated trigger chains, transforming operations fundamentally.

QHow does the article quantify the 'cost pressure' driving companies like Doubao to introduce paid services?

AThe article quantifies cost pressure through Doubao's user metrics and associated cost estimates. With over 300 million monthly active users and a daily Token调用量 (call volume) exceeding 120 trillion (a >1000x growth from May 2025), the computing costs became unsustainable. Estimates suggest a single inference request costs about 0.01-0.03 RMB in computing power. Scaling this to billions of daily calls creates astronomical costs. Converting the 120 trillion daily Tokens to revenue at discounted API rates suggests potential daily income of 3-5 billion RMB, contrasting sharply with the zero revenue from the free C-end model, highlighting the financial imperative to monetize.

QWhat specific business benefits are highlighted in the case studies of Senma, Anta, Peacebird, and Midea regarding their AI implementation?

ASenma: AI improved overall design R&D efficiency by 35% and pattern design speed by over 200%, contributing to 200 million RMB in new revenue and 20 million RMB in cost savings for 2025. Anta: Its 'Linglong' design model can generate 56 inspiration schemes in minutes, reducing the design cycle for tennis shoes from 3 months to under 40 days. Peacebird: AI-driven marketing for a new children's wear line increased click-through rate by 90%, payment conversion rate by 20%, and GMV for new products by 31%. Midea: With over 400 AI R&D staff and 13,000 daily active agents, AI saved 700 million RMB in costs and improved efficiency by over 15 million hours in 2025.

Related Reads

AI Bubble Warning: AI Investments Are Negative Returns for Most Tech Giants

The article issues a stark warning about a potential AI investment bubble. It notes that while the AI boom shares similarities with the TMT bubble of the late 1990s, its scale is vastly larger, currently driving 93% of U.S. GDP growth. Major hyperscale cloud providers like Microsoft, Alphabet, Amazon, Meta, and Oracle are planning to invest trillions in AI data centers over the coming years. However, calculations based on analyst projections for 2025-2030 reveal a concerning math problem: expected capital expenditure growth far outpaces projected revenue growth. Even under an extremely optimistic scenario of zero costs, the implied return on investment for most of these tech giants (except Amazon) is deeply negative. This suggests that the current trajectory could lead to one of history's largest shareholder value destruction events. The piece outlines two potential escapes: AI generating vastly more revenue than currently anticipated—a near-impossible task—or a significant cutback in the planned investment splurge. The latter scenario could trigger a domino effect, severely impacting the entire tech supply chain (from Nvidia to TSMC), potentially pushing the U.S. economy into recession, and causing a major stock market downturn. The author suggests upcoming high-profile IPOs by companies like OpenAI and Anthropic might represent a transfer of risk from early investors to public market participants. While the peak of the hype cycle might sustain investment through 2026, the fundamental financial dilemma remains unresolved, setting the stage for a potential market correction in 2027 or 2028, similar to the years following Alan Greenspan's "irrational exuberance" warning.

marsbit41m ago

AI Bubble Warning: AI Investments Are Negative Returns for Most Tech Giants

marsbit41m ago

From Tokens to Machine Labor: AI is Shifting from Tool to "Worker"

The article "From Token to Machine Labor: AI is Evolving from Tool to 'Worker'" argues that the business model for AI is shifting beyond simply selling computational resources (tokens, GPU hours) or model access. Instead, a new "machine labor market" is emerging, where the core economic transaction is the purchase of economically useful work directly performed by software. The central thesis is that AI pricing will evolve through four stages: 1) raw tokens, 2) standardized LLM capabilities (e.g., text generation), 3) industry-specific labor markets (e.g., legal review, radiology), and finally 4) a programmable results market where tasks like resolving a support ticket are bid on and priced based on outcome. In this future, buyers will care less about *which* model or GPU completes a task and more about whether the work meets specified standards for accuracy, latency, and cost. This transition reframes the impact of AI on human labor. Rather than simple replacement, it suggests a re-coordination where machines handle standardized, verifiable work, freeing humans for roles involving oversight, context management, responsibility, and final judgment. In some cases, this "last 1%" of human input becomes more valuable as it enables the other 99% to be automated. Furthermore, as AI reduces the cost of work, demand may expand, creating larger markets (e.g., 24/7 customer service) rather than just cheaper versions of existing ones. The article concludes that while infrastructure (GPUs, models, tokens) remains crucial upstream, the market is converging on a simpler, tradeable unit: machine labor that can be defined, measured, priced, and procured based on contractible specifications.

marsbit51m ago

From Tokens to Machine Labor: AI is Shifting from Tool to "Worker"

marsbit51m ago

Xiaomi MiMo's 99% Price Cut is Not Marketing! Luo Fuli Posts on X to Refute Critics

The price of Xiaomi's MiMo-V2.5 series API has been permanently reduced by up to 99%, specifically for the "Input (Cache Hit)" cost, which covers users re-reading historical context in long conversations. MiMo's head, Luo Fuli, published a detailed technical blog to clarify that this drastic price cut stems from genuine engineering breakthroughs, not a marketing stunt or a simple price war. The core of the achievement lies in six key engineering optimizations. First, the model architecture adopts a Hybrid Sliding Window Attention (SWA), reducing the memory footprint (KVCache) to 1/7th of a traditional model. Second, a dual-pool memory management system actually utilizes these savings, allowing a single GPU to handle over 5 times more concurrent users. Third, an upgraded prefix caching mechanism achieves a cache hit rate of 93-95% for repeated reads, meaning most such requests bypass GPU computation entirely. Fourth, a self-developed distributed cache (GCache) utilizes idle SSD space on existing GPU servers, eliminating additional storage costs. Fifth, an intelligent scheduling system (LLM-Router) efficiently routes requests to maximize cache reuse and performance. Sixth, Multi-Token Prediction (MTP) accelerates the model's text generation ("output") side. Together, these systemic optimizations dramatically lower the real computational cost per request, enabling the 99% price reduction for cached inputs while reportedly maintaining positive gross margins. Luo Fuli's disclosure aims to shift the narrative from "price war" to a demonstration of substantive AI engineering progress.

marsbit2h ago

Xiaomi MiMo's 99% Price Cut is Not Marketing! Luo Fuli Posts on X to Refute Critics

marsbit2h ago

$26 Billion: An 'All-Chinese Team' Backs the World's Highest-Valued AI Programming Company

Cognition AI, the company behind the AI programmer "Devin," has raised over $1 billion in new funding at a valuation of $26 billion, just eight months after reaching a $10.2 billion valuation. The round was led by Lux Capital, General Catalyst, and 8VC. Founded by three young Chinese entrepreneurs with strong competitive programming backgrounds, Cognition initially gained fame with Devin, marketed as the world's first AI software engineer capable of handling tasks from start to finish. While its early demos were impressive, real-world usage revealed reliability and cost-effectiveness issues, leading to a significant price cut for Devin in 2025. A pivotal moment came when Cognition acquired the assets of AI IDE company Windsurf after a failed acquisition by OpenAI. This move gave Cognition a crucial developer-facing tool, allowing it to pursue a two-pronged strategy: Devin for autonomous task execution and Windsurf for integrated, collaborative coding within an IDE. This shift helped the company move away from the controversial "AI replacement" narrative towards a model of augmenting human engineers, particularly for repetitive or maintenance tasks. This strategic pivot is backed by strong commercial metrics. The company reports a 10x increase in enterprise usage this year, with an annual revenue run-rate of $492 million and a 50% month-over-month growth in enterprise Devin usage over the past six months. Its client list now includes major corporations like Goldman Sachs and Mercedes-Benz, as well as government agencies like NASA and the U.S. Army. Investors are betting on Cognition becoming a foundational piece of next-generation software engineering infrastructure, positioning it at the center of a hybrid future where AI agents and human developers work in tandem.

marsbit3h ago

$26 Billion: An 'All-Chinese Team' Backs the World's Highest-Valued AI Programming Company

marsbit3h ago

The Hottest 00s Generation on Wall Street

"Wall Street's Hottest '00s Phenom: The 25-Year-Old Fund Manager Who Bet on AI's 'Boring' Backbone" At just 25, Leopold Aschenbrenner, once fired by OpenAI, now runs a hedge fund worth $13.7 billion. His strategy? Betting against the consensus. While others chased AI chips, he invested early in the physical infrastructure powering the AI boom: electricity, data centers, and energy. Expelled from OpenAI's safety team in 2024, Aschenbrenner foresaw the coming bottleneck. He argued that AI progress would be limited not by algorithms, but by power, chip capacity, and space. Acting on this, he founded Situational Awareness LP to go long on these "old economy" assets. His bets have paid off spectacularly. His fund's assets soared from $255 million in late 2024 to $13.7 billion by Q1 2026. His portfolio is a direct reflection of his thesis: major long positions in fuel cell company Bloom Energy and data center/bitcoin mining firms like CleanSpark and Riot Platforms, which control critical land and power resources. Conversely, he holds massive put options against overheated semiconductor giants like NVIDIA and AMD. A notable exception was his bullish bet on storage company SanDisk, which surged ~160% in Q2. Aschenbrenner's vision is materializing. Tech giants like Amazon, Alphabet, and Meta are ramping up colossal capital expenditure on data centers. Global data center power consumption is projected to skyrocket, with AI accounting for over half by 2030. The demand for enabling technologies like optical fiber and modules is also exploding. His story underscores a fundamental truth of the AI era: the ethereal intelligence of algorithms rests on a very physical, heavy, and power-hungry foundation. The future is being built not just in code, but in concrete, copper, and kilowatts.

marsbit5h ago

The Hottest 00s Generation on Wall Street

marsbit5h ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片