Kalshi's First Research Report Released: How Collective Intelligence Outperforms Wall Street Think Tanks in Predicting CPI

Odaily星球日报发布于2025-12-24更新于2025-12-24

文章摘要

Kalshi Research's inaugural report demonstrates that prediction markets consistently outperform Wall Street consensus forecasts in predicting the U.S. year-over-year CPI inflation rate. The study, covering over 25 monthly CPI releases from February 2023 to mid-2025, shows Kalshi’s market-implied forecasts had a 40.1% lower mean absolute error (MAE) than consensus predictions across all environments. The advantage was most pronounced during economic "shocks." For large surprises (over 0.2 percentage points), Kalshi's forecasts were 50% more accurate a week before the data release, improving to 60% more accurate the day before. For medium surprises (0.1-0.2 percentage points), the advantage was similarly 50%, rising to 56.2% closer to the release. Crucially, a divergence of over 0.1 percentage points between the market forecast and consensus served as a strong signal, with an 81.2% probability that a shock would occur. When the two forecasts disagreed, the market prediction was more accurate 75% of the time. The report attributes this "Shock Alpha" to three factors: the "wisdom of crowds" aggregating diverse information, superior incentive structures that reward accuracy over conformity, and more efficient information synthesis, even with the same public data. This suggests prediction markets provide a valuable, differentiated signal for investors and policymakers, especially during periods of high uncertainty.

This article is from:Kalshi Research

Compiled by | Odaily Planet Daily (@OdailyChina); Translator | Azuma (@azuma_eth)

Editor's Note: Leading prediction market platform Kalshi yesterday announced the launch of a new research report series, Kalshi Research, aimed at providing Kalshi's internal data to scholars and researchers interested in topics related to prediction markets. The inaugural research report for this series has been released. The original title is "Kalshi Outperforms Wall Street in Predicting Inflation" (Beyond Consensus: Prediction Markets and the Forecasting of Inflation Shocks).

Below is the content of the original report, compiled by Odaily Planet Daily.

Overview

Typically, in the week leading up to the release of important economic statistics, analysts and senior economists from large financial institutions provide their estimates of the expected figures. These forecasts, when aggregated, are referred to as the "consensus expectation" and are widely regarded as a crucial reference for gaining insights into market changes and adjusting position layouts.

In this research report, we compare the performance of the consensus expectation versus the implied pricing from Kalshi's prediction markets (sometimes referred to herein as "market prediction") in forecasting the actual value of a key macroeconomic signal—the year-over-year headline inflation rate (YOY CPI).

Key Highlights

  • Overall Superior Accuracy: Across all market environments (including normal and shock periods), Kalshi's predictions had a Mean Absolute Error (MAE) that was 40.1% lower than the consensus expectation.
  • "Shock Alpha": During periods of major shocks (greater than 0.2 percentage points), Kalshi's predictions one week ahead had an MAE 50% lower than the consensus expectation; this advantage expanded to 60% on the day before the data release. During periods of moderate shocks (between 0.1 - 0.2 percentage points), Kalshi's predictions one week ahead also had an MAE 50% lower than the consensus expectation, expanding to 56.2% on the day before the data release.
  • Predictive Signal: When the deviation between the market prediction and the consensus expectation exceeded 0.1 percentage points, the probability of a shock occurring was approximately 81.2%, rising to about 82.4% on the day before the data release. In cases where the market prediction differed from the consensus expectation, the market prediction was more accurate in 75% of instances.

Background

Macroeconomic forecasters face an inherent challenge: the times when forecasting is most critical—namely, during market dislocations, policy shifts, and structural breaks—are precisely the periods when historical models are most likely to fail. Financial market participants typically release consensus forecasts days before key economic data announcements, aggregating expert opinions into market expectations. However, these consensus views, while valuable, often share similar methodological approaches and information sources.

For institutional investors, risk managers, and policymakers, the stakes of forecasting accuracy are asymmetric. During uncontroversial periods, slightly better predictions offer limited value; but during periods of market turmoil—when volatility spikes, correlations break down, or historical relationships fail—superior accuracy can yield significant Alpha returns and limit drawdowns.

Therefore, understanding how parameters behave during market volatility is crucial. We focus on a key macroeconomic indicator—the year-over-year headline inflation rate (YOY CPI)—a core reference for future interest rate decisions and an important signal of economic health.

We compared and evaluated forecasting accuracy across multiple time windows before the official data release. Our core finding is that so-called "Shock Alpha" indeed exists—during tail events, market-based predictions can achieve additional predictive precision compared to the consensus benchmark. This outperformance is not merely of academic interest; it significantly enhances signal quality at critical moments when forecasting errors carry the highest economic cost. In this context, the truly important question is not whether prediction markets are "always correct," but whether they provide a differentiated signal worthy of inclusion in traditional decision-making frameworks.

Methodology

Data

We analyzed the daily implied predictions of traders on the Kalshi platform at three time points: one week before the data release (matching the consensus release timing), one day before release, and the morning of the release. Each market used was (or had been) a real, tradable, active market, reflecting real-money positions at varying liquidity levels. For the consensus expectation, we collected institution-level YoY CPI consensus forecasts, typically published about a week before the U.S. Bureau of Labor Statistics official data release.

The sample period spans from February 2023 to mid-2025, covering over 25 monthly CPI release cycles across various macroeconomic environments.

Shock Classification

We categorized events into three types based on the "magnitude of surprise" relative to historical levels. A "shock" was defined as the absolute difference between the consensus expectation and the actual published data:

  • Normal Events: YoY CPI forecast error below 0.1 percentage points;
  • Moderate Shocks: YoY CPI forecast error between 0.1 and 0.2 percentage points;
  • Major Shocks: YoY CPI forecast error exceeding 0.2 percentage points.

This classification allows us to examine whether predictive advantages vary systematically with the difficulty of the forecast.

Performance Metrics

To evaluate forecasting performance, we employed the following metrics:

  • Mean Absolute Error (MAE): The primary accuracy metric, calculated as the average of the absolute differences between predicted and actual values.
  • Win Rate: When the difference between the consensus expectation and the market prediction reached or exceeded 0.1 percentage points (rounded to one decimal place), we recorded which forecast was closer to the final actual result.
  • Forecast Timeframe Analysis: We tracked how the accuracy of market valuations evolved from one week before release to the release day, revealing the value of continuously incorporating information.

Results: CPI Forecasting Performance

Overall Superior Accuracy

Across all market environments, the market-based CPI predictions had a Mean Absolute Error (MAE) that was 40.1% lower than the consensus forecasts. Across all timeframes, the MAE for market-based CPI predictions was lower than the consensus expectation by 40.1% (one week ahead) to 42.3% (one day ahead).

Furthermore, in cases where the consensus expectation and the market-implied value diverged, Kalshi's market-based predictions demonstrated a statistically significant win rate, ranging from 75.0% one week ahead to 81.2% on release day. If ties with the consensus expectation (accurate to one decimal place) are included, the market-based prediction was tied or better than consensus in approximately 85% of cases one week ahead.

Such a high directional accuracy rate indicates: when market predictions diverge from the consensus expectation, this divergence itself carries significant informational value regarding the likelihood of a shock event occurring.

"Shock Alpha" Exists

The difference in forecasting accuracy was particularly pronounced during shock events. During moderate shock events, the MAE of market predictions was 50% lower than the consensus expectation at the release time, and this advantage expanded to 56.2% or more on the day before the data release; during major shock events, the MAE of market predictions was also 50% lower than the consensus expectation at the release time, and could reach 60% or more on the day before the data release; whereas in normal environments without shocks, market predictions and consensus expectations performed roughly equally.

Although the sample size for shock events is small (reasonable in a world where shocks are inherently highly unpredictable), the overall pattern is clear: when the forecasting environment is most challenging, the information aggregation advantages of markets are most valuable.

However, more importantly, it's not just that Kalshi's predictions perform better during shock periods, but also that the divergence between market predictions and the consensus expectation itself may be a signal of an impending shock. In cases of divergence, the win rate of market predictions relative to the consensus expectation reached 75% (within comparable time windows). Furthermore, threshold analysis indicates: when the deviation between the market and consensus exceeds 0.1 percentage points, the probability of predicting a shock is approximately 81.2%, and on the day before the data release, this probability further increases to about 84.2%.

This practically significant difference suggests that prediction markets can serve not only as a competitive forecasting tool alongside consensus expectations but also as a "meta-signal" regarding forecasting uncertainty, transforming market-consensus divergence into a quantifiable early warning indicator for potential unexpected outcomes.

Further Discussion

An obvious question follows: Why do market predictions outperform consensus forecasts during shocks? We propose three complementary mechanisms to explain this phenomenon.

Market Participant Heterogeneity and "Wisdom of the Crowd"

Traditional consensus expectations, while integrating views from multiple institutions, often share similar methodological assumptions and information sources. Econometric models, Wall Street research reports, and government data releases form a highly overlapping common knowledge base.

In contrast, prediction markets aggregate positions held by participants with diverse information bases: including proprietary models, industry-level insights, alternative data sources, and experience-based intuition. This participant diversity has a solid theoretical foundation in the "wisdom of crowds" theory. This theory suggests that when participants possess relevant information and their prediction errors are not perfectly correlated, aggregating independent predictions from diverse sources often yields superior estimates.

The value of this informational diversity is particularly pronounced during "state shifts" in the macro environment—individuals with scattered, local information interact in the market, and their informational fragments combine to form a collective signal.

Differences in Participant Incentive Structures

Institution-level consensus forecasters often operate within complex organizational and reputational systems that systematically deviate from the goal of "purely pursuing predictive accuracy." The career risks faced by professional forecasters create an asymmetric payoff structure—significant forecasting errors incur substantial reputational costs, while even extreme accuracy, especially achieved by deviating substantially from peer consensus, may not yield proportional career rewards.

This asymmetry induces "herding behavior," where forecasters tend to cluster their predictions near the consensus value, even if their private information or model outputs suggest different results. The reason is that within the career system, the cost of "being wrong alone" is often higher than the reward for "being right alone."

In stark contrast, the incentive mechanism faced by prediction market participants directly aligns forecasting accuracy with economic outcomes—accurate predictions mean profits, incorrect predictions mean losses. In this system, reputational factors are almost non-existent; the only cost of deviating from market consensus is economic loss, entirely dependent on the prediction's correctness. This structure imposes stronger selection pressure for predictive accuracy—participants who can systematically identify consensus forecast errors continuously accumulate capital and amplify their influence in the market through larger position sizes;而那些 mechanically follow consensus suffer continuous losses when consensus proves wrong.

During periods of significantly heightened uncertainty, when the career cost for institutional forecasters to deviate from expert consensus is at its peak, this divergence in incentive structures is often most pronounced and economically most significant.

Information Aggregation Efficiency

A noteworthy empirical fact is: even one week before the data release—a timeframe matching the typical window for consensus expectation releases—market predictions still exhibit significant accuracy advantages. This suggests that the market advantage does not stem solely from the often-cited "information speed advantage" of prediction market participants.

Instead, market predictions may more efficiently aggregate information fragments that are too dispersed, too industry-specific, or too ambiguous to be formally incorporated into traditional econometric forecasting frameworks. The relative advantage of prediction markets may lie not in earlier access to public information, but in their ability to synthesize heterogeneous information more effectively within the same timeframe—information that survey-based consensus mechanisms, even with the same time window, often struggle to process efficiently.

Limitations and Caveats

Our findings require an important qualification. Since the overall sample covers only about 30 months, and major shock events are by definition rare, this means statistical power remains limited for larger tail events. A longer time series will enhance future inferential ability, although the current results strongly suggest the superiority and differentiated signal of market predictions.

Conclusion

We document systematic and economically significant outperformance of prediction markets relative to expert consensus expectations, particularly during shock periods when forecasting accuracy is most critical. Market-based CPI predictions exhibited approximately 40% lower error overall, and this error reduction reached about 60% during periods of major structural change.

Based on these findings, several future research directions become particularly important: First, investigating whether "Shock Alpha" events themselves can be predicted using volatility and forecast divergence indicators, across a larger sample size and multiple macroeconomic indicators; Second, determining the liquidity threshold above which prediction markets can stably outperform traditional forecasting methods; Third, exploring the relationship between prediction market forecasts and those implied by high-frequency trading financial instruments.

In an environment where consensus forecasts heavily rely on correlated model assumptions and shared information sets, prediction markets offer an alternative information aggregation mechanism capable of capturing state switches earlier and processing heterogeneous information more efficiently. For entities needing to make decisions in an economic environment characterized by rising structural uncertainty and tail event frequency, "Shock Alpha" may represent not just an incremental improvement in predictive capability, but a fundamental component of a robust risk management infrastructure.

相关问答

QWhat is the main finding of Kalshi Research's first report regarding CPI prediction accuracy?

AKalshi's prediction market had a 40.1% lower Mean Absolute Error (MAE) than Wall Street consensus forecasts across all market conditions.

QWhat is 'Shock Alpha' as defined in the Kalshi report?

A'Shock Alpha' refers to the significant additional predictive accuracy of Kalshi's market-based forecasts over consensus during shock events, with MAE reductions of 50% to 60%.

QWhat probability does a divergence of over 0.1 percentage points between market and consensus forecasts signal a potential shock event?

AA divergence of over 0.1 percentage points signals an approximately 81.2% probability of a shock event occurring, rising to about 84.2% the day before the data release.

QWhat are the three mechanisms proposed in the report to explain why market predictions outperform consensus during shocks?

AThe three mechanisms are: 1. Participant heterogeneity and the 'wisdom of crowds'. 2. Differences in incentive structures (direct financial alignment in markets vs. career risks in institutions). 3. Superior information aggregation efficiency in markets.

QWhat key macroeconomic indicator was the focus of the performance comparison in this study?

AThe study focused on comparing the prediction performance for the year-over-year headline inflation rate (YoY CPI).

你可能也喜欢

被 PayPal 扫地出门,马斯克要用加密市场卷土重来

马斯克旗下社交平台X于2026年4月推出Smart Cashtags功能,上线数天即推动约10亿美元交易量。该功能允许用户直接点击帖文中的股票或加密货币代码查看实时价格与讨论,并支持通过智能合约地址搜索资产,目前在美国和加拿大iOS用户中上线,未来将扩展至Android和网页端。X与加拿大券商Wealthsimple合作,用户可一键跳转交易,实现从信息到交易的闭环体验。 此举被视为社交交易(Social Trading)崛起的关键一步,Galaxy Research报告指出加密行业已形成三大“注意力金融化”工具:链上代币(文化传播金融化)、永续合约(杠杆观点金融化)和预测市场(信息优势金融化)。X凭借5-6亿月活用户和加密社群核心地位,有望大幅降低交易门槛,吸引新用户。 更重要的布局是即将全面推出的X Money,这是马斯克重返支付战场的关键工具,对标微信支付和支付宝的“通讯+商务”模式。其测试版已提供6%年化收益、3%消费返现、P2P转账等功能,未来可能整合加密货币交易与稳定币结算,构建“超级金融社交App”。 然而,X面临监管挑战。美国参议员Warren质疑6%高收益的可持续性及合作银行Cross River Bank的合规记录,同时稳定币的保险漏洞和社交交易可能加剧羊群效应与金融风险娱乐化问题。若X能平衡创新与监管,或将为加密市场注入新资金,成为产业新增长点。

marsbit2小时前

被 PayPal 扫地出门,马斯克要用加密市场卷土重来

marsbit2小时前

交易

现货
合约

热门文章

如何购买S

欢迎来到HTX.com!我们已经让购买Sonic(S)变得简单而便捷。跟随我们的逐步指南,放心开始您的加密货币之旅。第一步:创建您的HTX账户使用您的电子邮件、手机号码注册一个免费账户在HTX上。体验无忧的注册过程并解锁所有平台功能。立即注册第二步:前往买币页面,选择您的支付方式信用卡/借记卡购买:使用您的Visa或Mastercard即时购买Sonic(S)。余额购买:使用您HTX账户余额中的资金进行无缝交易。第三方购买:探索诸如Google Pay或Apple Pay等流行支付方法以增加便利性。C2C购买:在HTX平台上直接与其他用户交易。HTX场外交易台(OTC)购买:为大量交易者提供个性化服务和竞争性汇率。第三步:存储您的Sonic(S)购买完您的Sonic(S)后,将其存储在您的HTX账户钱包中。您也可以通过区块链转账将其发送到其他地方或者用于交易其他加密货币。第四步:交易Sonic(S)在HTX的现货市场轻松交易Sonic(S)。访问您的账户,选择您的交易对,执行您的交易,并实时监控。HTX为初学者和经验丰富的交易者提供了友好的用户体验。

2.1k人学过发布于 2025.01.15更新于 2025.03.21

如何购买S

相关讨论

欢迎来到HTX社区。在这里,您可以了解最新的平台发展动态并获得专业的市场意见。以下是用户对S(S)币价的意见。

活动图片