5.4 Billion Burned, Sora Dies: Anonymous Chinese Model Kicks Open the Next Door in 38 Seconds

marsbitОпубликовано 2026-04-10Обновлено 2026-04-10

Введение

In March-April 2026, two major events reshaped the AI video generation landscape. OpenAI shut down its flagship model Sora, citing unsustainable daily costs of $15 million and low user retention, effectively exiting the consumer video market. Shortly after, an anonymous Chinese model dubbed "HappyHorse-1.0" topped the blind-test leaderboard on Artificial Analysis with a score of 1357 in text-to-video (without audio), outperforming rivals like ByteDance’s Seedance 2.0. HappyHorse-1.4 seconds to generate 1080p video with audio on a single H100 GPU. Its unified Transformer architecture and distilled diffusion techniques significantly improved efficiency compared to Sora’s costly diffusion-based approach. The model is speculated to be developed by Alibaba or based on Sand.ai’s technology, though its anonymous release suggests strategic data collection and legal risk avoidance regarding copyright and deepfake regulations. Meanwhile, commercial leaders like ByteDance impose high barriers—including million-dollar API contracts and strict compliance checks—to mitigate legal risks, focusing on B2B applications rather than consumer use. Key emerging opportunities include automated e-commerce promo videos, AI-assisted short drama production, and localized ad creation for global markets, all driven by plunging generation costs and faster turnaround times. The competition has shifted from pure model performance to cost efficiency, workflow integration, and regulatory compliance.

From late March to early April 2026, two landmark events occurred in the AI video sector within two weeks.

The first: Sora, once revered as the industry's "white moonlight," was announced to be completely shut down by OpenAI on March 24th—its standalone app, API interface, and video features embedded in ChatGPT were all taken offline. OpenAI has completely exited the consumer-grade video generation market.

The second: Less than two weeks later, on April 7th, an anonymous model codenamed "HappyHorse-1.0" unexpectedly topped the most authoritative AI video blind test leaderboard, Artificial Analysis, with an overwhelming score.

One is a Silicon Valley giant admitting defeat in a daily $15 million money-burning game; the other is an unknown technical dark horse kicking through the long-held top spot on the blind test leaderboard by Chinese teams. These two events, happening in the same time window, seem unrelated but actually point to the same conclusion: The rules of competition in AI video are undergoing a qualitative change—shifting from "whose model is smarter" to "whose computing power is cheaper and whose compliance wall is thicker."

The Truth Behind Topping the Charts: Overwhelming Pure Visuals and a "Specialized" Dark Horse

To judge the quality of a dark horse, first look at the referee.

Artificial Analysis Video Arena is not a self-congratulatory PR leaderboard from manufacturers but an Elo rating voted on by thousands of real users in completely blind tests of generated videos.

HappyHorse-1.0's report card is overwhelming.

In the "Text-to-Video (No Audio)" track, it scored 1357 points (as of April 9th), a full 84 points ahead of the second-place Seedance 2.0 (1273 points). This means users were significantly more likely to choose it over any other model in blind tests. Stepping over it are not only ByteDance but also star products like Kling 3.0 and SkyReels V4.

However, its "specialization" is also a fact. Once audio is included, it scored 1217 points in the "Text-to-Video (With Audio)" track, losing to Seedance 2.0 (1220 points) by just 3 points. In other words, HappyHorse-1.0 kicked through ByteDance's pure visual technical reputation defense line, but Seedance still held its ground in the audio-visual comprehensive experience.

The significance of this chart-topping performance lies more in breaking the market expectation that "domestic video models have solidified"—a new challenger can use a small 15B parameter model to crush all major players in the pure visual dimension.

How is it so fast?

On a single top-tier H100 GPU, it takes only 38.4 seconds to generate a 1080p HD video (with synchronized audio). The confidence in speed comes from the underlying 15 billion parameter (15B) unified Transformer architecture, combined with DMD-2 distillation technology, compressing the inference steps to just 8 steps.

Simply put, traditional large video models are like an "outsourcing team"—a large text model first understands your需求, then hands it over to a diffusion model to "draw," with significant communication loss in between. In contrast, the unified Transformer architecture used by HappyHorse-1.0 is an "all-rounder," processing both text and visual pixels within the same neural network, eliminating cross-modal intermediate loss.

Interestingly, HappyHorse-1.0 was initially questioned as "marketing futures" when it first topped the charts (April 7-8)—its website claimed to be open source, but the GitHub repository and model download links were all 404 or "coming soon." However, on April 9th, multiple media outlets reported that it had officially announced open source, and users could experience it online via text generation and image generation on its official website. It took less than 48 hours to go from "Schrödinger's open source" to actually releasing the weights.

Anonymous Conspiracy: Why Are Big Tech Companies Entering Incognito?

There are currently two mainstream speculations in the industry.

First, it comes from Alibaba's Taotian Group's newly established "Future Life Laboratory," led by former Kuaishou Technology Vice President and Kling AI head Zhang Di.

Second, it deeply draws on the underlying technology of daVinci-MagiHuman from domestic startup Sand.ai—Zhihu user Vigo Zhao compared HappyHorse-1.0's public benchmark data with known models line by line and found them highly consistent. Jiemian News also reported that the "most accepted conclusion in technical circles" is that HappyHorse is an optimized iteration of daVinci-MagiHuman.

The above speculations have not been officially confirmed. However, exclusive news this morning claimed that HappyHorse-1.0 was indeed developed by Alibaba, led by former Kuaishou VP and Kling technical head Zhang Di, who returned to Alibaba in November 2025. Additionally, Alibaba Cloud will soon launch the model on its Bailian platform, and Alibaba's recent organizational adjustments are related to this.

As of publication, Alibaba has not responded officially.

The question is: Since they hold a dragon-slaying sword, why don't big tech companies hold a press conference? Why anonymously混迹 on third-party blind test platforms?

Although there is currently no official explanation,推测 from industry惯例 and business logic suggests at least two layers of planning behind it.

The first layer is free "data harvesting."

The biggest bottleneck in current AI video is the lack of real human preference data. Anonymously空降 a blind test platform is equivalent to having global netizens perform free A/B testing for it. Without spending a penny, it can precisely identify the model's defects in the real world.

The second layer is avoiding the fatal "compliance landmine."

AI video is currently on the火山口 of copyright lawsuits. Releasing实名 before the large model has built digital watermarking and portrait blocking mechanisms can easily attract sky-high claims from Hollywood. Anonymous testing shows off muscles while providing legal physical isolation.

However, from another perspective, HappyHorse-1.0's狂欢 highlights Sora's落寞. Both are in video, so why the polarized fate? Upon closer thought, Sora's exit actually tears open the industry's most bloody scar: a严重倒挂 ROI (Return on Investment).

According to SemiAnalysis estimates, Sora's daily operating cost was as high as $15 million, burning about $5.4 billion a year. Its diffusion model architecture requires rendering about 30 images to generate 1 second of video, but common issues like object distortion and incoherent motion lead to a large number of videos being discarded. The final usability rate is estimated by analysts to be only 5% to 10%.

Producing 1 usable video wasted more than ten times the computing power. When a tool cannot be embedded into users' daily workflows and merely becomes a "novelty toy," no one is willing to pay持续. According to data disclosed by an a16z partner, Sora's 1-day retention rate was only 10%, 7-day was 2%, 30-day was只剩 1%, and 60-day was接近 0%.

Sora, with its $5.4 billion annual cost and cliff-like drop in retention, proved that the route of暴力堆算力 with pure diffusion models is unworkable. HappyHorse-1.0 offers another answer—15B parameters, unified Transformer architecture, 8-step inference, 38.4 seconds on a single card. The gap between the two is not parameter scale but architectural efficiency. The outcome of this architecture war may be more significant for the industry than any single chart-topping performance.

Looking at the Chinese AI giants remaining on the field, they are fighting another battle of computing power economics.

First, look at API call costs:

ByteDance's Seedance 2.0, its API pricing for 1080p pure video generation is 46 RMB / 1 million Tokens. Based on actual tests, generating a 15-second video consumes about 308,880 Tokens. Converted, the cost to generate one second of commercial-grade video is about 1 RMB (approx. $0.14).

This is the commercial reality. For the vast majority of enterprises, directly calling a closed-source API costing about fourteen cents per second is far more attractive than spending millions of RMB on H100 servers to折腾所谓的 "open-source models."

Million-Dollar Frameworks: The Ultimate Barrier for Giant Rivalry

If you think cheap computing power is the only barrier, you are too naive.

To access Seedance 2.0 and use真人 reference images to generate videos, enterprises need to sign an annual prepaid framework contract worth tens of millions.同时, new framework signings must also pay a deposit of 50% of the prepaid amount or 1 million RMB (whichever is higher) as a保证金, which is only gradually released after one year.

This million-dollar threshold is essentially a保证金 that makes enterprises bear the main responsibility—transferring the legal risks of generating deepfake videos (Deepfake) to头部 B-end enterprises with risk resistance capabilities through commercial contracts.

In mid-February this year, a video generated by an Irish director using Seedance 2.0, featuring a realistic fight between Tom Cruise and Brad Pitt on a rooftop, went viral. On February 13th, a cease-and-desist letter drafted by Disney lawyer David Singer was delivered to ByteDance. The Motion Picture Association (MPA) subsequently严厉指控 Seedance 2.0 for "large-scale unauthorized use of copyrighted content," and the actors' union SAG-AFTRA also issued尖锐 criticism for the unauthorized use of members' likenesses.

To protect themselves, giants have set extremely high capital thresholds and enterprise qualification reviews (KYC).

They simply don't care what funny videos ordinary C-end users can make; they want to become the "water, electricity, and coal" of B-end industrialized content production. By monopolizing computing power infrastructure and establishing a strict authorization system, they completely block mid-to-long-tail competitors.

What does the great reshuffle of the post-Sora era leave for the industry?

The underlying infrastructure game of AI video is already a专属牌桌 for重资本,重算力 giants. But what is being博弈 on the table is infrastructure, while golden opportunities are growing in the缝隙 underneath the table.

The core logic is simple: computing power costs are visibly plummeting—from several dollars per second in the Sora era to about 1 RMB per second for today's Seedance 2.0, to theoretically zero marginal cost locally after HappyHorse-1.0's open source. Every time the cost drops by an order of magnitude, it催生 a batch of new commercial scenarios.

Overall, the three most noteworthy directions in the AI video generation field currently might be:

E-commerce带货 video automation. Product promotion videos on domestic short video platforms are still mainly shot manually, costing 500-2000 RMB per piece with a production cycle of 2-5 days. If API computing power compresses this cost to 10-50 RMB and shortens the production cycle to minutes, the entire投放 logic will be rewritten—the volume of test materials can surge from 10 per day to 1000, and the efficiency and accuracy of A/B testing will improve qualitatively.

Short drama industrialized production. Vertical screen short dramas are exploding in the global market, with single-episode budgets typically ranging from 50,000 to 150,000 RMB, but shooting cycles and actor costs are rigid bottlenecks. Although AI video cannot yet replace real acting, it can already replace 30%-40% of shooting work on "non-emotional" shots like scene establishing shots, transition shots, and special effects shots, directly压缩 total production costs.

Overseas advertising localization. The same product投放 in Southeast Asia, the Middle East, and Latin America requires advertising materials in different languages, with different ethnicities and cultural symbols. The traditional method requires teams in multiple countries to shoot separately. AI video can compress this process to be completed by one person on one computer within a day, and the cost hardly increases linearly with the number of markets.

These three directions share a common characteristic: they do not require the model to be number one in跑分, nor do they require movie-grade画质, but they require cost to be low enough, speed fast enough, and stability good enough—and this is precisely the scenario where API calls are more suitable than local deployment.

HappyHorse-1.0 kicked the door open. But standing behind the door is the commercial infrastructure that ByteDance and Kuaishou have been building for two years—computing power supply chains, compliance review systems, and B-end customer networks.

A technical dark horse can win a weekend of applause, but winning the war requires积累 in another dimension. From today, the competition rules for AI video have shifted from "whose model is stronger" to "whose workflow is thicker."(This article was first published on Titanium Media App, author| AGI-Signal, editor| Lin Shen)

Связанные с этим вопросы

QWhat were the two major events that occurred in the AI video generation field between late March and early April 2026?

AFirst, OpenAI announced the complete shutdown of Sora on March 24, removing its standalone app, API, and ChatGPT integration, exiting the consumer video generation market. Second, an anonymous model named 'HappyHorse-1.0' topped the Artificial Analysis blind test leaderboard on April 7 with a dominant score.

QWhat architectural innovation does HappyHorse-1.0 use to achieve fast video generation, and how does it differ from traditional methods?

AHappyHorse-1.0 uses a unified Transformer architecture with 15 billion parameters combined with DMD-2 distillation, reducing inference steps to just 8. Unlike diffusion models that process text and visuals separately (like an 'outsourced team'), it handles both in a single neural network, eliminating cross-modal inefficiencies.

QWhy did OpenAI shut down Sora, and what were its key operational challenges?

AOpenAI shut down Sora due to unsustainable costs and low user retention. It had a daily operating cost of $15 million (about $5.4 billion annually), a video usability rate of only 5-10%, and poor retention rates: 10% after 1 day, 2% after 7 days, and nearly 0% after 60 days, making it economically unviable.

QWhat are the speculated reasons behind the anonymous release of HappyHorse-1.0 on a blind test platform?

AThe anonymous release is likely for two reasons: to harvest free real-world user preference data through blind A/B testing without cost, and to avoid legal risks like copyright lawsuits by maintaining anonymity until robust digital watermarking and portrait blocking mechanisms are in place.

QWhat are the three most promising commercial applications for AI video generation highlighted in the article, and why are they viable?

AThe three key applications are: 1) E-commerce带货 video automation, reducing cost per video to 10-50 RMB and enabling mass A/B testing; 2) Short drama industrial production, replacing 30-40% of non-emotional shots like scenes and transitions to cut costs; 3) Localized overseas advertising, quickly generating culturally tailored ads for global markets at minimal marginal cost. All prioritize low cost, speed, and stability over top-tier quality.

Похожее

How Risky is the "Death Spiral" of MSTR and STRC?

Summary: This article explores the perceived "death spiral" risk between MicroStrategy (MSTR), its Bitcoin holdings, and its perpetual preferred stock (STRC), drawing comparisons to the LUNA-UST collapse. While both systems feature price anchors, high yields for holders, and potential feedback loops, their core mechanisms differ fundamentally. The MSTR-STRC structure relies on continuous financing to sustain its high dividend payouts, primarily through stock ATM offerings. A negative feedback cycle could occur: falling MSTR stock price makes raising equity capital harder, increasing pressure to sell Bitcoin, which undermines STRC confidence and further depresses MSTR. However, unlike LUNA-UST's automated, direct linkage, the MSTR-STRC loop is weaker and has brakes: STRC dividends can be deferred or rates lowered, and STRC holders have a $100/share liquidation preference in bankruptcy, providing a price floor. The company's sustainability hinges on its ability to continue financing. Its current ~$900 million USD reserves cover only about 6.3 months of its ~$1.71 billion annual interest/dividend burden. The next six months are critical, aligning with both the potential bottom in Bitcoin's four-year cycle and the depletion timeline of its reserves. While a LUNA-style catastrophic collapse is deemed highly unlikely due to structural differences, the key question is whether MicroStrategy can navigate this period through healthy deleveraging to restart its capital engine.

Foresight News18 мин. назад

How Risky is the "Death Spiral" of MSTR and STRC?

Foresight News18 мин. назад

How Much Debt Does Strategy Really Have? Is There a Risk of Implosion?

MicroStrategy's Debt Risk: A Turning Point in the "Never Sell" Strategy As of June 3, 2026, MicroStrategy holds 843,706 bitcoins (valued at ~$53.1B) but faces significant financial obligations. Its capital structure includes $6.75B in convertible notes and $15.48B in perpetual preferred stock (led by the $8.5B STRC series), creating an annual payout burden of ~$1.71B. With software revenue at only ~$500M, interest and dividend obligations far exceed operating income. A critical shift occurred in late May 2026 when the company sold 32 bitcoins for ~$2.5M to cover dividends, breaking CEO Michael Saylor's long-standing "never sell" pledge. This symbolic move triggered a sharp decline in both Bitcoin's price and MSTR stock, reflecting market fears about cash flow sustainability. The core of the strain is the STRC perpetual preferred stock, designed as a "permanent loan" with no maturity date but requiring high monthly dividends (currently 11.5%). Its business model relies on a three-part cycle: issuing new STRC shares, using proceeds to buy more Bitcoin and fund a USD reserve, and using that reserve to pay dividends. This cycle depends on continuous investor demand for STRC and Bitcoin's price appreciation. Analysis shows Bitcoin needs to appreciate at least 2.3% annually to cover the $1.71B in yearly obligations at current holdings. With Bitcoin price down ~22% from March 2026 highs, this pressure has intensified. The company's $900M USD reserve can only cover about 7 months of payments if STRC issuance stalls. Key risks are not immediate bankruptcy or forced Bitcoin liquidation (as BTC is not collateral), but rather: 1) The erosion of MSTR's premium to its Bitcoin holdings (mNAV), which would cripple its ability to raise cheap capital; 2) A vicious cycle where stagnant Bitcoin prices reduce STRC demand, draining the USD reserve and forcing BTC sales, further depressing prices. The period from February 2027 to September 2028 is a crucial test, with over $5.9B in convertible notes facing put options or maturity. In essence, MicroStrategy has evolved from a simple Bitcoin holder into a complex financial entity acting like a "private Bitcoin bank," leveraging its BTC holdings to create layered financial products. Its survival depends on maintaining Bitcoin's price trend, its stock premium, and market appetite for its preferred shares. The recent token sale marks not a betrayal of its Bitcoin thesis, but an admission that the leveraged strategy must eventually be paid for.

marsbit28 мин. назад

How Much Debt Does Strategy Really Have? Is There a Risk of Implosion?

marsbit28 мин. назад

Bitcoin Falls Sharply Behind Micron Technology As Investors Favor Semiconductor Exposure

Bitcoin is struggling to break the $63,000 level, having dropped sharply from its all-time high. It is now significantly underperforming compared to semiconductor stocks, particularly Micron Technology. Market analysis by Joao Wedson reveals Bitcoin has fallen over 95% against Micron, signaling a major investor rotation towards AI and semiconductor infrastructure assets. While this divergence could profoundly impact crypto markets, Wedson suggests cryptocurrency may eventually become a contrarian play. Meanwhile, Bitcoin's price decline is linked to large stakeholders ("whales") selling over 24,600 BTC last week, indicating market caution. Conversely, small-scale traders increased their holdings, potentially signaling a future buy opportunity.

bitcoinist32 мин. назад

Bitcoin Falls Sharply Behind Micron Technology As Investors Favor Semiconductor Exposure

bitcoinist32 мин. назад

Anthropic Cries Wolf: Is the AGI Threat Real, or Just an IPO Story?

Anthropic has published an article titled "When AI builds itself," discussing the emerging concept of "recursive self-improvement," where AI begins to actively participate in designing, training, testing, and optimizing its own subsequent versions. The company presents internal data showing that by May 2026, over 80% of code merged into its codebase was written by Claude, its AI model. Claude's capabilities have expanded to handling complex, open-ended engineering tasks, achieving a 76% success rate in such areas, and even contributing to research processes, such as optimizing code performance and conducting AI safety experiments. Anthropic outlines an evolution from human-driven development to AI-assisted workflows, culminating in the current stage where AI agents can autonomously write, run, and delegate code. The company cautions that the path toward a "closed loop," where AI continuously improves itself, is becoming visible. It calls for coordinated global mechanisms to potentially slow or pause frontier AI development to allow safety research and societal structures to catch up. However, the timing of this warning coincides with Anthropic's preparations for an IPO, framing the narrative not just as a safety concern but also as a demonstration of Claude's advanced capabilities and its integral role in accelerating Anthropic's own R&D—creating a potential "flywheel" effect for competitive advantage. This contrasts with OpenAI's recent, more policy-oriented discussion of the same risks, highlighting the competitive dynamics in the AI industry as companies position themselves in both the technological and regulatory landscape.

marsbit1 ч. назад

Anthropic Cries Wolf: Is the AGI Threat Real, or Just an IPO Story?

marsbit1 ч. назад

Coinbase Targets Crypto Crime, Freezing $3M Linked To Scam Operations

Coinbase froze over $3 million in cryptocurrency linked to Southeast Asian scam networks as part of a multi-agency "Disruption Week" campaign. The operation, coordinated by the DOJ's Scam Center Strike Force, involved companies like Meta, Microsoft, and Starlink, and disrupted over 1.4 million scam accounts. The DOJ noted that investment fraud, including "pig butchering" scams, is among the fastest-growing threats, with crypto-related scams causing billions in losses. This action follows other recent crackdowns targeting scam infrastructure globally. Coinbase highlighted that blockchain provides a permanent transaction record, aiding investigations and countering the narrative that crypto is solely a tool for crime.

bitcoinist1 ч. назад

Coinbase Targets Crypto Crime, Freezing $3M Linked To Scam Operations

bitcoinist1 ч. назад

Торговля

Спот

Фьючерсы

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на AI (AI) представлены ниже.

5.4 Billion Burned, Sora Dies: Anonymous Chinese Model Kicks Open the Next Door in 38 Seconds

Введение

The Truth Behind Topping the Charts: Overwhelming Pure Visuals and a "Specialized" Dark Horse

Anonymous Conspiracy: Why Are Big Tech Companies Entering Incognito?

Million-Dollar Frameworks: The Ultimate Barrier for Giant Rivalry

Связанные с этим вопросы

Похожее

How Risky is the "Death Spiral" of MSTR and STRC?

How Much Debt Does Strategy Really Have? Is There a Risk of Implosion?

Bitcoin Falls Sharply Behind Micron Technology As Investors Favor Semiconductor Exposure

Anthropic Cries Wolf: Is the AGI Threat Real, or Just an IPO Story?

Coinbase Targets Crypto Crime, Freezing $3M Linked To Scam Operations

Торговля

Популярные статьи

AI Companions: Новое определение взаимодействия человека с ИИ

HTX Learn: пройдите обучение по "AI Companions" и разделите 10 000 USDT!

Неделя обучения по популярным токенам (2): 2026 может стать годом приложений реального времени, сектор AI продолжает оставаться в тренде

Обсуждения

Топ вопросы

Популярные категории

Популярные теги