Mysterious Model HappyHorse Tops the Chart Overnight: Is the Video Generation Arena Welcoming a "Game Changer"?

marsbitXuất bản vào 2026-04-08Cập nhật gần nhất vào 2026-04-08

Tóm tắt

A mysterious AI video generation model named "HappyHorse-1.0" has quietly topped the AI Video Arena leaderboard on Artificial Analysis, surpassing established models like Seedance 2.0 and others in Elo score—a user-blind-test-based ranking reflecting real perceived quality. The model’s origin was initially unknown, but technical analysis later linked it to the open-source model "daVinci-MagiHuman," jointly developed by Shanghai SII GAIR Lab and Beijing-based Sand.ai. HappyHorse-1.0, likely an optimized iteration by Sand.ai, uses a 15-billion-parameter transformer architecture for joint audio-video-text modeling. Its strong performance in human-centric scenes (e.g., portraits, narrations) helped it excel in blind tests, though it still lags in multi-character or complex motion scenarios. The achievement signals a potential shift: an open-source model rivaling closed-source alternatives in perceived quality, which could lower costs and increase flexibility for developers in vertical applications like virtual avatars. However, limitations remain, including high computational requirements (H100 GPU needed) and shorter generation lengths. While not yet threatening market leaders, HappyHorse represents progress toward open models reaching "production-ready" quality, potentially accelerating community-driven improvements in the video AI space.

No launch event, no technical blog, no corporate backing—a text-to-video model named HappyHorse-1.0 quietly topped the AI Video Arena rankings on the authoritative AI evaluation platform Artificial Analysis, surpassing Seedance 2.0 with a higher Elo score and leaving mainstream players like Keling and Tiangang far behind, sparking a "decryption race" in the tech community.

Artificial Analysis' ranking is not based on technical parameter evaluations but on aggregated blind test results from real users, reflected through Elo scores. This makes the ranking harder to question than typical benchmark scores and turns "Who made this?" into an unavoidable question.

"Happy Horse" Quietly Tops the Chart, Sparking a Guessing Game in Tech Circles

Speculations on X emerged quickly. The first clue noticed was the language order on the official website: Mandarin and Cantonese were listed before English. For a product targeting global users, this order is unusual—if the team were U.S.-based, English would almost certainly be first. This strongly suggests the team behind it is from China.

The name itself is also a clue. 2026 is the Year of the Horse in the lunar calendar, and the name "HappyHorse" subtly references this, similar to the earlier "Pony Alpha." Suspects quickly piled up: Tencent and Alibaba's founders both have the surname Ma" (horse), putting them naturally on the list; some bet on Xiaomi, noting Lei Jun's low-key style and penchant for surprise reveals; others felt it aligned more with DeepSeek, which had quietly released a visual model before taking it down. Speculations ran wild, but no one had solid evidence.

The real breakthrough came from technical comparisons. X user Vigo Zhao cross-referenced HappyHorse-1.0's public benchmark data with known models and found a highly matching candidate: daVinci-MagiHuman, an open-source model called "DaVinci Magic Human" launched on GitHub in March.

Visual quality 4.80, text alignment 4.18, physical consistency 4.52, word error rate in speech 14.60%—each metric matched. The official website structure was nearly identical too: architecture descriptions, performance tables, and demo video styles all seemed to follow the same template. Both use a single-stream Transformer architecture, both support joint audio-video generation, and both support the same list of languages. This level of overlap is hard to dismiss as coincidence.

The most widely accepted conclusion in tech circles is that HappyHorse is an optimized iteration of the open-source model daVinci-MagiHuman, developed by Sand.ai, one of the joint developers. The core goal is to validate the model's performance上限 under real user preferences, paving the way for future commercialization.

daVinci-MagiHuman was officially open-sourced on March 23, 2026, a collaboration between two young teams. One is from the Generative Artificial Intelligence Research Laboratory (GAIR) at Shanghai Institute of Intelligence (SII), led by scholar Liu Pengfei; the other is Beijing-based Sand.ai (San Dai Tech), founded by Cao Yue, who also has an academic background, with a focus on autoregressive world models.

The model uses a 15-billion-parameter pure self-attention single-stream Transformer, packing text, video, and audio tokens into the same sequence for joint modeling—no one in the open-source community had previously attempted true joint pre-training of audio and video from scratch, as most efforts involved stitching together single-modal bases.

How Did an Open-Source Video Model Achieve a Two-Week Comeback?

Once the identity was clarified, another question became even harder to answer: daVinci-MagiHuman was only open-sourced in late March, so how did HappyHorse-1.0 manage to secure a higher Elo score than Seedance 2.0 in just two weeks?

Based on information disclosed on the official website, it's reasonable to speculate that HappyHorse made targeted adjustments to the default generation strategy for the evaluation scenario.

The Elo system essentially accumulates user preferences. Slight improvements in perceptually sensitive areas—like stable facial expressions, audio-visual alignment, and visual appeal—can make a big difference in blind tests. The model's capability上限 remains unchanged, but its "evaluation performance" can be polished.

In fact, over 60% of the blind test samples on Artificial Analysis involve portrait generation and voice-over content. daVinci-MagiHuman was trained with a focus on portrait performance, giving it a natural advantage in such scenarios, which is the main reason for its领先 blind test win rate. If blind test samples are dominated by portrait close-ups, models skilled in portraits will systematically benefit, unrelated to their actual performance in multi-character, complex camera work, or long-term narrative scenarios.

The result is a noticeable gap between the ranking numbers and actual test experiences, splitting X discussants into two camps. Skeptics, after testing, believe that HappyHorse-1.0 still lags behind Seedance 2.0 in character details and motion coherence, questioning the representativeness of the Elo score itself.

Supporters, however, hold high hopes for HappyHorse's potential, hoping it can address the industry pain point of "visual consistency across multi-shot sequences," something current mainstream video models haven't solved well. If daVinci-MagiHuman truly makes a breakthrough here, it could be far more significant than a ranking.

The model's limitations shouldn't be overshadowed by the numbers. Xiaohongshu blogger @JACK's AI World was among the first to deploy and test daVinci-MagiHuman. He found that it requires an H100 to run, making it nearly impossible for consumer-grade GPUs. Although the community is researching quantization solutions, local deployment for individual users remains challenging in the short term.

In terms of scenarios, it currently excels mainly with single characters; once multiple people appear or the scene becomes high, the quality drops—this isn't something tuning parameters can fix, as it's directly related to its design focus on portraits. Generation length is typically around 10 seconds; going longer risks instability, and high-definition output requires super-resolution plugins.

@JACK's AI World concluded: daVinci-MagiHuman's overall usability is not as good as LTX 2.3; it will only be suitable for daily use after the community successfully implements quantization.

Has the Video Generation Arena Finally Welcomed a True "Game Changer"?

Of course, leading the rankings once doesn't say much. Next, HappyHorse will need to undergo more thorough testing in areas like stability, high-concurrency access speed, cross-scene consistency, character control precision, and generalization beyond the test set. These are the core metrics that determine whether a model can truly enter creators' workflows.

But if we zoom out to the broader industry landscape, the signal this event sends is already clear enough.

Open-source video models themselves aren't new. But a visible gap in effectiveness has long existed between open-source and closed-source models—in scenarios requiring delivery to clients, the generation quality of open-source models has consistently failed to cross the threshold from "usable" to "deliverable." The pricing power of closed-source products like Keling and Seedance is, to a considerable extent, built upon this gap.

The significance this time lies in the fact that a product based on an open-source model has, for the first time, matched mainstream closed-source competitors in a blind test ranking based on real user perception. Regardless of how much tuning was done for the evaluation scenario, for closed-source vendors relying on this gap to maintain pricing power, this is at least a signal worth taking seriously.

For developers, the implications of this turning point are more concrete. In vertical scenarios like portraits, digital humans, and virtual anchors, once the generation quality of an open-source base reaches the "deliverable" threshold, the cost structure of self-deployment will undergo substantial changes—not just compressing API call costs, but more importantly, bringing data, models, and the entire inference pipeline under one's own control, offering customization depth and privacy compliance flexibility that closed-source solutions can hardly match.

HappyHorse-1.0 won't shake the market positions of Seedance 2.0 or Keling in the short term. But once the perception that open-source models can rival closed-source ones is established, subsequent quantization optimizations, vertical fine-tuning, and inference acceleration will be pushed forward by the community at a pace far exceeding that of closed-source products.

In this Year of the Horse, what's truly worth watching might not be which horse runs the fastest, but the fact that the track itself is widening.

This article is from the WeChat public account "AI Value Official," author: Xingye, editor: Meiqi

Câu hỏi Liên quan

QWhat is the name of the text-to-video model that recently topped the AI Video Arena leaderboard on Artificial Analysis?

AHappyHorse-1.0

QWhich open-source model is HappyHorse-1.0 highly suspected to be based on, according to technical comparisons?

AdaVinci-MagiHuman

QWhat is the core architectural approach used by the daVinci-MagiHuman model for joint audio-video modeling?

AA single-stream Transformer architecture that models text, video, and audio tokens in a unified sequence.

QWhat is the primary reason HappyHorse-1.0 performed so well in the user-blind-test-based Elo ranking system?

AIt was likely optimized for the evaluation scenarios, particularly excelling in human portrait generation and narration content, which made up over 60% of the test samples.

QWhat broader industry signal does HappyHorse-1.0's performance send, according to the article?

AIt signals that open-source models can achieve user-perceived quality comparable to closed-source commercial products, potentially changing cost structures and offering greater flexibility for developers in vertical scenarios.

Nội dung Liên quan

Công ty của Donald Trump bán tiếp một lô Bitcoin lớn!

Công ty của Donald Trump vừa chuyển một lượng lớn Bitcoin lên sàn giao dịch CryptoCom, theo thông tin từ dữ liệu chuỗi khối. Khoảng 2.628 BTC, tương đương 165 triệu USD, đã được chuyển từ các địa chỉ được cho là liên kết với Trump Media & Technology Group. Trước đó, công ty được cho là đã mua tổng cộng 11.542 BTC với giá trung bình 118.500 USD mỗi đồng. Tính đến nay, khoảng 7.281 BTC đã được rút ra từ các địa chỉ này, trong khi 4.261 BTC được cho là vẫn còn giữ lại. Tổng lỗ (đã thực hiện và chưa thực hiện) từ các khoản đầu tư Bitcoin của Trump Media ước tính vào khoảng 555 triệu USD. Cần lưu ý rằng việc chuyển Bitcoin lên sàn giao dịch không nhất thiết có nghĩa là đã bán số tài sản này. Giao dịch có thể nhằm mục đích lưu ký, quản lý thanh khoản hoặc cho một hoạt động tài chính khác. Tuy nhiên, các giao dịch từ ví lạnh lên sàn tập trung thường được xem là động thái chuẩn bị bán.

cryptonews.ru1 giờ trước

Công ty của Donald Trump bán tiếp một lô Bitcoin lớn!

cryptonews.ru1 giờ trước

Tại sao Bitcoin duy trì mức 64.000 USD sau lần tạm dừng cứng rắn của Fed

Bitcoin duy trì quanh mức 64.000 USD sau cuộc họp của Fed. Dù Fed giữ lãi suất trong khoảng 3,50–3,75%, ba thành viên ủy ban đã bỏ phiếu ủng hộ tăng lãi suất, gửi đi tín hiệu chính sách thắt chặt hơn dự kiến. Bitcoin phản ứng biến động nhưng cuối cùng ổn định quanh 64.000 USD, với vùng hỗ trợ 63.000–63.500 USD và kháng cự ở 66.000 USD. ETF Bitcoin ghi nhận dòng vốn ròng chảy vào 32,1 triệu USD, chấm dứt chuỗi rút tiền, trong khi ETF Ethereum tiếp tục thất thoát 18,65 triệu USD. Điều này cho thấy sự luân chuyển vốn sang Bitcoin giữa bối cảnh bất ổn vĩ mô. Ethereum giao dịch quanh 1.900 USD, áp lực bán tăng nhưng mạng lưới vẫn mạnh với lượng ETH chờ staking cao. Các altcoin khác di chuyển không đồng nhất. Về pháp lý, dự luật CLARITY Act bị trì hoãn đến sau kỳ nghỉ tháng 8 của Thượng viện Mỹ, khiến thị trường thận trọng hơn về khả năng thông qua vào năm 2026. Tóm lại, thị trường tiền mã hóa đang trong trạng thái chờ đợi, với Bitcoin thể hiện sự kiên cường nhờ dòng vốn ETF. Các nhà đầu tư trung hạn cần theo dõi khả năng giữ trên 63.000 USD của Bitcoin, mức 1.860 USD của Ethereum và dòng vốn thể chế để tìm tín hiệu cho sự phục hồi trong nửa cuối năm 2026.

cryptonews.ru1 giờ trước

Tại sao Bitcoin duy trì mức 64.000 USD sau lần tạm dừng cứng rắn của Fed

cryptonews.ru1 giờ trước

Parker Lewis Lý Giải Tại Sao Bitcoin Vẫn Là Đồng Tiền Tốt Nhất

Nhà phân tích Bitcoin uy tín Parker Lewis chỉ trích mạnh mẽ các chiến lược tiếp thị của các công ty tự xưng là kho bạc tiền mã hóa. Ông cho rằng việc các công ty này huy động vốn thông qua việc bán "tín dụng số" dưới dạng cổ phiếu ưu đãi vĩnh viễn đã làm sai lệch bản chất của tiền mã hóa đầu tiên. Lewis nhấn mạnh Bitcoin không có lợi suất định sẵn, và việc hứa hẹn cổ tức thường xuyên là một trò chơi rủi ro cao, dựa chủ yếu vào việc thu hút nhà đầu tư mới trên thị trường tăng trưởng. Ông dẫn chứng sự chênh lệch lớn giữa thị trường tín dụng toàn cầu (300 nghìn tỷ USD) và thị trường cổ phiếu ưu đãi vĩnh viễn (1 nghìn tỷ USD) để chứng minh rủi ro của các công cụ phái sinh này, thường được chuyển cho các nhà đầu tư nhỏ lẻ. Lewis bác bỏ quan điểm cho rằng Bitcoin quá biến động, lập luận rằng biến động là hệ quả tự nhiên của việc chấp nhận một loại tài sản mới. Với nguồn cung cứng và không co giãn, mỗi làn sóng người dùng mới sẽ dẫn đến biến động giá mạnh. Thay vì mua cổ phiếu phái sinh của các công ty như MicroStrategy, ông khuyên nên mua Bitcoin trực tiếp, vì điều này an toàn hơn về mặt toán học so với việc giao tiền cho các nhà quản lý tập đoàn. Việc chuyển hướng sang các công cụ phái sinh làm mất tập trung vào mối đe dọa thực sự: sự mất giá nhanh chóng của tiền pháp định. Lewis đưa ra "Chỉ số thịt bò" của riêng mình, cho thấy mức lạm phát tiêu dùng thực tế khoảng 12-13% mỗi năm, cao hơn nhiều so với số liệu chính thức. Ông kết luận rằng chiến lược tài chính an toàn và hiệu quả nhất trong bối cảnh lạm phát toàn cầu là sở hữu trực tiếp Bitcoin và tự kiểm soát khóa riêng tư, thay vì theo đuổi lợi nhuận đầy rủi ro từ cổ phiếu kho bạc tiền mã hóa.

cryptonews.ru1 giờ trước

Parker Lewis Lý Giải Tại Sao Bitcoin Vẫn Là Đồng Tiền Tốt Nhất

cryptonews.ru1 giờ trước

Công ty ARK Invest của Katie Wood mua 109,129 cổ phiếu Circle với giá 6,83 triệu USD

Công ty ARK Invest do bà Cathie Wood (Kэти Вуд) lãnh đạo đã mua 109.129 cổ phiếu Circle với giá trị gần 6,83 triệu USD thông qua ba quỹ ETF của mình. Giao dịch này diễn ra không lâu sau khi Circle nhận được giấy phép quản lý tín thác từ Sở Dịch vụ Tài chính New York cho một thực thể mới có tên Circle New York Trust, điều mà CEO Jeremy Allaire gọi là "mục tiêu dài hạn". Tuy nhiên, cổ phiếu CRCL của Circle vẫn giảm 2,54% vào ngày 31/7, cho thấy các nhà đầu tư có thể chưa coi giấy phép là yếu tố thúc đẩy tăng trưởng. Cùng lúc, ARK Invest cũng mua vào cổ phiếu của Tesla, SpaceX và Nvidia với tổng trị giá khoảng 40,2 triệu USD trong đợt bán tháo các công ty công nghệ, đồng thời giảm tỷ trọng nắm giữ ở một số công ty khác như Shopify và Cloudflare.

cryptonews.ru1 giờ trước

Công ty ARK Invest của Katie Wood mua 109,129 cổ phiếu Circle với giá 6,83 triệu USD

cryptonews.ru1 giờ trước

Bắt giữ các thành viên trong âm mưu lừa đảo với XRP, chiếm đoạt 9 triệu đô la từ 71 nhà đầu tư

Vào ngày 30/7, Cảnh sát Thủ đô Seoul thông báo đã bắt giữ ba nghi phạm trong một vụ lừa đảo đầu tư liên quan đến tiền điện tử XRP. Nhóm này bị cáo buộc vận hành nền tảng đầu tư gian lận Fxrpntwork.com, thu hút 71 nhà đầu tư chuyển khoảng 3,4 triệu đô la XRP trong một tuần (từ 16-23/10) trước khi đóng cửa trang web và biến mất. Các nghi phạm quảng bá trang web thông qua blog, bài báo trực tuyến và video YouTube, hứa hẹn bảo toàn vốn gốc và lợi nhuận hàng tháng từ 1,5% đến 1,8%. Họ hướng dẫn nạn nhân chuyển XRP từ các sàn giao dịch Hàn Quốc qua nền tảng nước ngoài rồi vào ví do nhóm kiểm soát. Cảnh sát Seoul cảnh báo các nhà đầu tư cần kiểm tra kỹ nguồn thông tin chính thức trước khi chuyển tiền. Các đối tượng đã sao chép thương hiệu của Flare Network và FXRP để tạo vẻ ngoài hợp pháp. Ripple và Interpol cảnh báo đây là thủ đoạn phổ biến, với các mạng lưới tội phạm xuyên quốc gia ngày càng có tổ chức. Việc hứa hẹn lợi nhuận đảm bảo vẫn là dấu hiệu phổ biến của lừa đảo tiền điện tử. Hai nghi phạm 29 tuổi đã bị bắt, một người 34 tuổi khác bị đề nghị truy tố, và một nghi phạm 29 tuổi khác đang bị truy nã ở nước ngoài với thông báo đỏ của Interpol. Cảnh sát cũng đã phong tỏa tài sản ảo trị giá 17,3 tỷ won và đang điều tra các chuyển khoản liên quan trị giá 27,3 tỷ won để xác định thêm nạn nhân và đồng phạm.

cryptonews.ru1 giờ trước

Bắt giữ các thành viên trong âm mưu lừa đảo với XRP, chiếm đoạt 9 triệu đô la từ 71 nhà đầu tư

cryptonews.ru1 giờ trước

Giao dịch

Giao ngay

Mysterious Model HappyHorse Tops the Chart Overnight: Is the Video Generation Arena Welcoming a "Game Changer"?

Tóm tắt

"Happy Horse" Quietly Tops the Chart, Sparking a Guessing Game in Tech Circles

How Did an Open-Source Video Model Achieve a Two-Week Comeback?

Has the Video Generation Arena Finally Welcomed a True "Game Changer"?

Câu hỏi Liên quan

Nội dung Liên quan

Công ty của Donald Trump bán tiếp một lô Bitcoin lớn!

Tại sao Bitcoin duy trì mức 64.000 USD sau lần tạm dừng cứng rắn của Fed

Parker Lewis Lý Giải Tại Sao Bitcoin Vẫn Là Đồng Tiền Tốt Nhất

Công ty ARK Invest của Katie Wood mua 109,129 cổ phiếu Circle với giá 6,83 triệu USD

Bắt giữ các thành viên trong âm mưu lừa đảo với XRP, chiếm đoạt 9 triệu đô la từ 71 nhà đầu tư

Giao dịch

Danh mục Phổ biến

Thẻ Nổi bật