Tremble Humans, AI Continues Its Accelerated Sprint

marsbitPublished on 2026-06-13Last updated on 2026-06-13

Abstract

Trembling, Humans: AI Continues Its Accelerated Sprint Yes, AI is still rapidly accelerating. While deep learning seemed to stall quickly in its early years, large models after years of development show no sign of hitting their ceiling. At the Zhiyuan Conference 2026, the focus is on enabling AI to move from the digital world into the physical world. Scaling Law remains effective, continuing to drive advancements in both large language models and multimodal models. The industry is now entering a phase of pursuing World Models, though unresolved technical paths and data issues mean this exploration may take 3-5 more years. Concurrently, breakthroughs in Agents are accelerating AI's real-world application in fields like healthcare and meetings. Making Agents truly useful requires key hardware-software co-design, evident from the strong presence of chip vendors at the conference. We stand at a new historical threshold where AI is becoming a foundational force reshaping the world. The first day of the conference highlighted AI's evolution from "knowing how to chat" to "knowing how to work." Scaling Law persists, World Models are the next key battleground, and Agents are transitioning from usable to好用 (user-friendly). Scaling Law is not ending but diversifying. New models like Anthropic's Fable 5 demonstrate scaling through parameter size, synthetic data, and reinforcement learning. Advancements in AI Coding and Agent deployment are enabling a trend of AI self-evolution, poten...

That's right, AI is still in an accelerated sprint.

In 2016, deep learning had only been exploding for a year before it almost stagnated. In 2026, after four years of explosive growth, large models still haven't hit their ceiling.

At the 2026 BAAI Conference, Guangzhui Intelligent observed that from models to software/hardware to products, everything is striving for AI to 'run' from the digital world into the physical world.

On one hand, Scaling Law continues to function steadily, propelling the ongoing development of large language models and multimodal models. The AI industry has entered a phase of pursuing World Models. However, issues like current technical routes and data remain unresolved, likely requiring at least 3-5 more years of exploration.

On the other hand, breakthroughs in Agents are accelerating the deployment of AI in real-world scenarios. As Agents have reached a usable stage, the industry is advancing their application in areas like healthcare and meetings. To transition Agents from usable to useful, software-hardware co-design has become key. At the exhibition booths of the BAAI Conference, chip manufacturers occupied 'half the room,' with nearly all leading domestic AI chip companies present.

"We are standing at a new historical inflection point. Artificial intelligence is no longer just a tool transforming a specific industry but is becoming the underlying force reconstructing the world. AI Coding, autonomous agents, and model self-evolution are opening up possibilities for creating AI. World Models, embodied intelligence, and robotics are extending intelligence from the digital world to the physical world," said Wang Zhongyuan, President of the Beijing Academy of Artificial Intelligence (BAAI).

What exactly is happening within this wave of reconstruction by this underlying force?

On the first day of the BAAI Conference, the guests present offered this answer: AI is moving from 'being able to chat' to 'being able to work.' Scaling Law persists, World Models with unconverged technical directions become the focus of the next phase, while Agents have started transitioning from usable to useful, with many optimization challenges remaining.

AI Has Not Hit Its Technical Ceiling,

And Has Learned Self-Evolution

Over the past year, as high-quality internet text data was being exhausted, a pessimistic sentiment spread throughout the industry that 'Scaling Law is about to peak.'

In multiple forums at the BAAI Conference, the question 'Has the Scaling Law dividend diminished?' was frequently raised. Several guests denied this notion.

"I still firmly believe scaling is far from over," said Wang He, Founder and CTO of Galaxy Universal. "Looking back today, Scaling Law hasn't failed; it has just become more diversified."

Scaling continues to show its effect on a series of newly released large language models. Analyzing Anthropic's recently released Fable 5, Luo Fuli from Xiaomi suggested this model itself is a product of scientifically advancing scaling. It is the result of extending large models by combining three dimensions: parameter scale, synthetic data, and reinforcement learning.

"We speculate that Fable 5's parameter scale itself is likely several times that of the current largest open-source models. Additionally, it involved significant computational investment in Test-Time Scaling or reinforcement learning. Furthermore, synthetic data generated by humans and agents brought the data scale to a new order of magnitude," said Luo Fuli.

In the multimodal field, performance improvements brought by scaling are equally significant. Zhu Jun, Founder and Chief Scientist of Shengsheng Technology, stated that data quality, model size, and large-scale training all enhance model performance. With improved foundational model capabilities, models also learn physical laws and understand 3D scenes more efficiently.

While scaling continues to be effective, alongside the maturation of AI Coding and accelerated deployment of Agents, a trend of AI self-evolution is becoming evident, upgrading from writing code to autonomously completing product iteration updates.

"The foundation of the vast human digital world is largely constructed through code. With AI Coding making substantial progress and becoming mainstream, it means AI could gradually take over everything in the digital world," said Wang Zhongyuan.

Globally, using AI for product updates has become the norm.

"If the model determines an agent's capabilities, then the Harness determines the upper limit of those capabilities," said Li Jingqiu. "Its difficulty lies in further improving problem clarification, verification, and feedback on top of the model."

For example, relying solely on the model to understand a problem inevitably has limitations. The Harness needs to elaborate and enrich the user's simple one-sentence instruction so the model can better comprehend the requirement. This requires the Harness to leverage intent understanding. After receiving the task, it must design the subsequent workflow and then orchestrate the model to execute it. This process may require human intervention and correction, followed by checks before task completion.

World Models:

The Next Key Battleground for Large Models

Pushing outward along the boundaries of the digital world, World Models have become the next key battleground for large models.

"Currently, no single world model truly feels particularly impressive or solves all kinds of problems in the real physical world," said Wang Zhongyuan.

For World Models in their early developmental stage, the industry hasn't reached full consensus on the technologies involved. With technical routes not yet converged, a series of unresolved problems remain. Using data as an example, Wang Zhongyuan illustrated that whether video data, simulation data, or real-world physical data is needed, a clear methodological path hasn't been found yet.

Taking Galaxy Universal as an example, Wang He introduced their application of synthetic data at the event.

"Before the WAM (World Action Model) paradigm emerged, we conducted extensive experiments within the VLA paradigm using synthetic data, specifically for grasping tasks," said Wang He. "We used 1 billion frames of simulation data to prove: as long as you scale the data to this extent, you can achieve complete zero-shot learning. Give me any object in the real world, and it can handle the grasp."

Regarding the development progress of World Models, the BAAI predicts that 'at least several more years' are needed. The next three to five years will likely be a phase of continuous evolution and iteration for World Models.

Over the past few years, various world models with different technical routes have emerged in the industry, each progressing distinctively.

Taking multimodal world models as an example, Zhu Jun stated that video models and world models are closely related because world models need three capabilities: understanding and interpreting states, prediction, and action. Among currently accessible training data, video data is most relevant to world models.

With various technical routes diverging and industry consensus yet to form, the BAAI classifies world models into four categories:

First, language-centric world models, mapping other modalities and abilities into language space, including LLMs, VLMs, VLAs, etc.

Second, pixel-centric world models; video generation essentially predicts the next frame, but video generation models are not equivalent to world models, though they are related. The potentially very popular World Action Model (WAM) this year is evolving from a pixel-centric perspective.

Third, 3D structure-centric world models, including 3D reconstruction which focuses purely on the three-dimensional world.

Fourth, visual representation-centric world models.

Currently, BAAI is exploring a 'fifth' path – the fusion of language-centric and visual representation-centric approaches, namely latent space representation. This involves compressing information like text and images into a vector space to represent various states of the real physical world.

"Future unified latent space modeling will not be limited to visual space but encompass full-modal latent space. This is highly likely to be the true next possible path for world models," said Wang Zhongyuan.

At the conference, BAAI introduced the world model it is developing – WuJie · Physis-v0.1. Centered on physical space modeling to predict the next physical state, it is positioned as the world's first general-purpose world foundation model, emphasizing four key capabilities: 'physically correct, causally traceable actions, long-term temporal consistency, and general-purpose generalization.'

Currently, this model is still in the training phase. BAAI will continue to share progress in the second half of the year and will open-source the model upon training completion.

From 'Usable' to 'Useful':

Agents Face More Challenges

On the model side, progress in World Models drives the realization of physical AI; on the product side, Agents (Intelligent Agents) become the key products for AI to enter public life.

Since 2025, dubbed the 'Year of the Agent,' some impressive Agent products have emerged, showing signs of taking off. However, the unexpected surge in popularity of 'Lobsters' this year still came as a surprise.

Compared to last year when agents were mostly in an execution state, this year's agents have clearly become more proactive and capable, able to help users proactively execute more complex tasks.

At this year's BAAI Conference, BAAI also released four vertical-focused agents: BAAI Cardiac Agent, the world's first auxiliary diagnosis agent for cardiac magnetic resonance, aiding doctor decision-making by integrating multimodal capabilities and medical expertise; the autonomous research agent AREX for the scientific research field; SoulAgent, an agent helping users listen to meetings in real-time and capture key points; and a risk discovery agent targeting hazardous protein acquisition.

For example, regarding the meeting-listening agent, Guangzhui Intelligent tested its ability to summarize different meeting contents. SoulAgent did provide simple summaries of meeting content. While not as complete as minutes, the core viewpoints were accurate. This is particularly suitable for situations where parallel forum sessions overlap.

However, current agents still face numerous technical issues requiring further optimization. Yang An, President's Chair Professor at Nanyang Technological University, mentioned that to maintain and enhance agent capabilities, the most crucial aspects currently are related to context engineering, such as Memory, orchestration, etc.

At the agent sub-forum, Harness (literally meaning a horse's harness, referring to the entire engineering framework or environment built around an agent), which received little attention last year but gained significant popularity this year, became a high-frequency keyword mentioned on-site.

For example, if relying solely on the model to understand a problem, limitations are inevitable. The Harness needs to elaborate and enrich the user's simple one-sentence instruction so the model can better comprehend the requirement. This requires the Harness to leverage intent understanding. After receiving the task, it must design the subsequent workflow and then orchestrate the model to execute it. This process may require human intervention and correction, followed by checks before task completion.

In short, like a real human assistant, every detailed step requires product refinement for the Harness to further improve the Agent's execution effectiveness.

Currently, Agents are still in the early stages of development. It is foreseeable that this industry has immense room for growth. Both improvements in model capabilities and solidification of engineering details will continue to enhance Agents' task-handling abilities.

This article is from WeChat Official Account: Guangzhui Intelligent , Author: Focus on Frontier Technology

Trending Cryptos

CitreaCTR

wrapped stUSDTWSTUSDT

The 25-Year-Old OpenAI Prodigy Who Took Global Investors Down With Him

At 25, OpenAI prodigy Leopold Aschenbrenner saw his $45 billion hedge fund, Situational Awareness, collapse in a month. His fund, which soared 439% by heavily leveraging AI infrastructure stocks like SK Hynix and CoreWeave, was caught in a vicious margin call spiral triggered by a sharp sell-off in AI hardware stocks and a rebound in the software stocks he shorted. The forced liquidation, intensified by leveraged Korean retail investors' panic selling, led to Citadel acquiring $16 billion of his portfolio at a steep discount. Ironically, the AI stocks he was forced to sell surged immediately after his fund's collapse. The episode underscores a classic Wall Street lesson: being right on the long-term trend is futile without surviving short-term volatility, especially when using excessive leverage. Despite the debacle, Aschenbrenner retains a $5 billion stake in Anthropic, leaving a potential path for recovery.

marsbit5m ago

The 25-Year-Old OpenAI Prodigy Who Took Global Investors Down With Him

marsbit5m ago

STRC Major De-pegging's First Financial Report, How Will Strategy Repair Its Capital Flywheel?

Bitcoin treasury company Strategy released its Q2 2026 earnings report on July 31. Despite a 6.9% year-over-year revenue increase to $122 million, the company recorded a net loss of $8.22 billion, largely due to $8.32 billion in unrealized losses from Bitcoin price fluctuations. As of quarter-end, Strategy holds 843,775 BTC with an average cost of $75,000 per coin, and Bitcoin per share increased. The report highlights a critical shift in Strategy's capital model following the de-pegging of its key financing tool, STRC (Strategic Coin), which fell below its $100 target. Management's top priority is restoring STRC to its target value, aiming for a recovery by September 8. They rule out discounted STRC issuances and plan to maintain its dividend yield at 12%, instead focusing on bolstering its $3.75 billion cash reserve. Strategy has moved from a one-way "buy-and-hold" Bitcoin strategy to active capital management. This new approach, part of its "Digital Credit Capital Framework," involves flexibly managing its balance sheet across four elements: BTC, USD cash, common stock (MSTR), and digital credit securities like STRC. This allows for BTC monetization (having sold $218.4 million in BTC so far), strategic repurchases of discounted securities, and debt optimization, as seen with a $1.5 billion convertible bond buyback. The company's future hinges on two key tests: successfully re-pegging STRC to restore market confidence in its digital credit system, and a long-term recovery in Bitcoin's price to ultimately support its growth thesis.

marsbit25m ago

STRC Major De-pegging's First Financial Report, How Will Strategy Repair Its Capital Flywheel?

marsbit25m ago

STRC's First Financial Report Post-Depegging, How is Strategy Restoring the Capital Flywheel?

On July 31, 2026, Bitcoin treasury company Strategy released its Q2 financial report. Despite a 6.9% year-over-year increase in revenue to $122 million, the company recorded a substantial net loss of $8.22 billion, primarily due to $8.32 billion in unrealized losses from Bitcoin holdings. While Strategy's core Bitcoin strategy remains intact—its holdings grew 11% to 843,775 BTC—the company is undergoing a fundamental shift in its capital model. Following the de-pegging of its key financing tool, the STRCoin (STRC), from its $100 target in May, Strategy has pivoted from a one-directional "raise funds, buy Bitcoin" cycle to a more dynamic, multi-asset capital management approach. A key part of this new framework is the "Monetization Program," through which Strategy has sold approximately $218.4 million worth of BTC to bolster liquidity. The company's top priority is repairing STRC's peg, committing not to issue discounted shares until it returns to its target range. It has initiated a $1 billion buyback program for discounted digital credit securities, having repurchased $28.9 million face value of STRC so far. Management aims to restore the peg around September 8, 2026. Strategy now actively manages a matrix of assets: Bitcoin (for accumulation or strategic sales), USD cash reserves (now at $3.75 billion), common stock (MSTR), and digital credit securities like STRC. This allows for tactical moves like repurchasing discounted debt or equity to capture value. The future success of Strategy's "capital flywheel" hinges on two factors: the short-term ability to successfully re-peg STRC to restore market confidence in its digital credit system, and the long-term price trajectory of Bitcoin, upon which its entire investment thesis ultimately depends.

Odaily星球日报30m ago

STRC's First Financial Report Post-Depegging, How is Strategy Restoring the Capital Flywheel?

Odaily星球日报30m ago

AWS Accounts for 60% of Operating Profit, Amazon Delivers Q2 Results with No Weak Spots

Amazon reported its Q2 2026 financial results, with total sales reaching $200.6 billion, maintaining year-over-year growth. Net profit was $62.6 billion, significantly boosted by a $53.4 billion non-operating pre-tax gain primarily from its investment in AI company Anthropic. The more indicative operating profit was $27.5 billion. Amazon Web Services (AWS) was the key profit driver, contributing 60.5% of the company's operating profit. AWS revenue grew 36.8% year-over-year, with its operating profit rising nearly two-thirds. This growth provides crucial financial support for Amazon's broader investments in retail and infrastructure. Other segments also showed solid performance. Advertising revenue grew 26.2%, while third-party seller services and online stores maintained positive momentum, strengthening the core retail business. Despite strong profits, Amazon's trailing twelve-month (TTM) free cash flow turned negative, as $169 billion in property and equipment investments (largely for data centers and AI) outpaced $161.4 billion in operating cash flow. This highlights the company's heavy ongoing capital expenditure, even as investment gains inflate net income. The results present a dual picture: robust operational earnings led by AWS alongside massive reinvestment into future growth, particularly in AI infrastructure.

marsbit36m ago

AWS Accounts for 60% of Operating Profit, Amazon Delivers Q2 Results with No Weak Spots

marsbit36m ago

With Two Consecutive Quarters of Losses, Coinbase Must Rely on Paths Beyond Trading

Coinbase posted its second consecutive quarterly net loss of $359 million on $1.22 billion in revenue for Q2, highlighting its vulnerability to crypto market cycles where weaker prices and lower volatility reduce user trading. However, the report also reveals a strategic shift in its business model. Despite a 25% quarter-over-quarter decline in global spot trading volume, Coinbase increased its market share to a company-record 10.3%. This suggests its position as a compliant U.S. on-ramp is strengthening even in a cooler market. A key development is the diversification of revenue streams. Transaction revenue fell to $599 million, nearly equaling subscription and services revenue of $555 million. Stablecoin services, generating $292 million, are becoming a crucial revenue "floor." This income, derived from interest on the $20 billion average USDC balance held on its platform, is less tied to daily trading activity. Furthermore, while spot trading volume dropped significantly, derivatives volume held steady at $1.03 trillion. Coinbase is pushing to integrate spot, stablecoin, and derivatives liquidity to create a more interconnected and sticky ecosystem for users. The GAAP net loss includes non-cash expenses like stock-based compensation and crypto asset valuation changes. Its adjusted EBITDA remained positive at $208 million for the 14th straight quarter, indicating core operations can cover ongoing costs. The company is also reducing expenses to manage the downturn. The central question moving forward is whether Coinbase's growing market share, stablecoin revenues, and expanding product integration can sufficiently offset the inherent cyclicality of its core trading business during future market contractions.

marsbit45m ago

Trading

Spot

Hot Articles

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

Talus is a decentralized AI Agent framework built on the Sui, designed to solve the structural problems of current AI systems: centralization, opacity, and a lack of native economic identity.

43.4k Total ViewsPublished 2026.03.18Updated 2026.03.18

The Cornerstone of the Autonomous AI Economy: How Talus is Reshaping On-Chain Intelligent Agents

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

By 2026, the integration of artificial intelligence and cryptocurrency has advanced from proof-of-concept to a new stage of "system-level integration".

2.9k Total ViewsPublished 2026.03.26Updated 2026.03.26

In-depth Analysis of AI and Crypto: The Era of Symbiosis between Algorithms and Ledgers

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

In 2026, the U.S. IPO market has regained momentum.

36.8k Total ViewsPublished 2026.07.08Updated 2026.07.08

U.S. Equity TradFi Assets: Traditional Finance as a Steady Anchor Amid the AI IPO Boom

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.