Kalshi's First Research Report Released: How Collective Intelligence Outperforms Wall Street Think Tanks in Predicting CPI

Odaily星球日报发布于2025-12-24更新于2025-12-24

文章摘要

Kalshi Research's inaugural report demonstrates that prediction markets consistently outperform Wall Street consensus forecasts in predicting the U.S. year-over-year CPI inflation rate. The study, covering over 25 monthly CPI releases from February 2023 to mid-2025, shows Kalshi’s market-implied forecasts had a 40.1% lower mean absolute error (MAE) than consensus predictions across all environments. The advantage was most pronounced during economic "shocks." For large surprises (over 0.2 percentage points), Kalshi's forecasts were 50% more accurate a week before the data release, improving to 60% more accurate the day before. For medium surprises (0.1-0.2 percentage points), the advantage was similarly 50%, rising to 56.2% closer to the release. Crucially, a divergence of over 0.1 percentage points between the market forecast and consensus served as a strong signal, with an 81.2% probability that a shock would occur. When the two forecasts disagreed, the market prediction was more accurate 75% of the time. The report attributes this "Shock Alpha" to three factors: the "wisdom of crowds" aggregating diverse information, superior incentive structures that reward accuracy over conformity, and more efficient information synthesis, even with the same public data. This suggests prediction markets provide a valuable, differentiated signal for investors and policymakers, especially during periods of high uncertainty.

This article is from:Kalshi Research

Compiled by | Odaily Planet Daily (@OdailyChina); Translator | Azuma (@azuma_eth)

Editor's Note: Leading prediction market platform Kalshi yesterday announced the launch of a new research report series, Kalshi Research, aimed at providing Kalshi's internal data to scholars and researchers interested in topics related to prediction markets. The inaugural research report for this series has been released. The original title is "Kalshi Outperforms Wall Street in Predicting Inflation" (Beyond Consensus: Prediction Markets and the Forecasting of Inflation Shocks).

Below is the content of the original report, compiled by Odaily Planet Daily.

Overview

Typically, in the week leading up to the release of important economic statistics, analysts and senior economists from large financial institutions provide their estimates of the expected figures. These forecasts, when aggregated, are referred to as the "consensus expectation" and are widely regarded as a crucial reference for gaining insights into market changes and adjusting position layouts.

In this research report, we compare the performance of the consensus expectation versus the implied pricing from Kalshi's prediction markets (sometimes referred to herein as "market prediction") in forecasting the actual value of a key macroeconomic signal—the year-over-year headline inflation rate (YOY CPI).

Key Highlights

Overall Superior Accuracy: Across all market environments (including normal and shock periods), Kalshi's predictions had a Mean Absolute Error (MAE) that was 40.1% lower than the consensus expectation.
"Shock Alpha": During periods of major shocks (greater than 0.2 percentage points), Kalshi's predictions one week ahead had an MAE 50% lower than the consensus expectation; this advantage expanded to 60% on the day before the data release. During periods of moderate shocks (between 0.1 - 0.2 percentage points), Kalshi's predictions one week ahead also had an MAE 50% lower than the consensus expectation, expanding to 56.2% on the day before the data release.
Predictive Signal: When the deviation between the market prediction and the consensus expectation exceeded 0.1 percentage points, the probability of a shock occurring was approximately 81.2%, rising to about 82.4% on the day before the data release. In cases where the market prediction differed from the consensus expectation, the market prediction was more accurate in 75% of instances.

Background

Macroeconomic forecasters face an inherent challenge: the times when forecasting is most critical—namely, during market dislocations, policy shifts, and structural breaks—are precisely the periods when historical models are most likely to fail. Financial market participants typically release consensus forecasts days before key economic data announcements, aggregating expert opinions into market expectations. However, these consensus views, while valuable, often share similar methodological approaches and information sources.

For institutional investors, risk managers, and policymakers, the stakes of forecasting accuracy are asymmetric. During uncontroversial periods, slightly better predictions offer limited value; but during periods of market turmoil—when volatility spikes, correlations break down, or historical relationships fail—superior accuracy can yield significant Alpha returns and limit drawdowns.

Therefore, understanding how parameters behave during market volatility is crucial. We focus on a key macroeconomic indicator—the year-over-year headline inflation rate (YOY CPI)—a core reference for future interest rate decisions and an important signal of economic health.

We compared and evaluated forecasting accuracy across multiple time windows before the official data release. Our core finding is that so-called "Shock Alpha" indeed exists—during tail events, market-based predictions can achieve additional predictive precision compared to the consensus benchmark. This outperformance is not merely of academic interest; it significantly enhances signal quality at critical moments when forecasting errors carry the highest economic cost. In this context, the truly important question is not whether prediction markets are "always correct," but whether they provide a differentiated signal worthy of inclusion in traditional decision-making frameworks.

Methodology

Data

We analyzed the daily implied predictions of traders on the Kalshi platform at three time points: one week before the data release (matching the consensus release timing), one day before release, and the morning of the release. Each market used was (or had been) a real, tradable, active market, reflecting real-money positions at varying liquidity levels. For the consensus expectation, we collected institution-level YoY CPI consensus forecasts, typically published about a week before the U.S. Bureau of Labor Statistics official data release.

The sample period spans from February 2023 to mid-2025, covering over 25 monthly CPI release cycles across various macroeconomic environments.

Shock Classification

We categorized events into three types based on the "magnitude of surprise" relative to historical levels. A "shock" was defined as the absolute difference between the consensus expectation and the actual published data:

Normal Events: YoY CPI forecast error below 0.1 percentage points;
Moderate Shocks: YoY CPI forecast error between 0.1 and 0.2 percentage points;
Major Shocks: YoY CPI forecast error exceeding 0.2 percentage points.

This classification allows us to examine whether predictive advantages vary systematically with the difficulty of the forecast.

Performance Metrics

To evaluate forecasting performance, we employed the following metrics:

Mean Absolute Error (MAE): The primary accuracy metric, calculated as the average of the absolute differences between predicted and actual values.
Win Rate: When the difference between the consensus expectation and the market prediction reached or exceeded 0.1 percentage points (rounded to one decimal place), we recorded which forecast was closer to the final actual result.
Forecast Timeframe Analysis: We tracked how the accuracy of market valuations evolved from one week before release to the release day, revealing the value of continuously incorporating information.

Results: CPI Forecasting Performance

Overall Superior Accuracy

Across all market environments, the market-based CPI predictions had a Mean Absolute Error (MAE) that was 40.1% lower than the consensus forecasts. Across all timeframes, the MAE for market-based CPI predictions was lower than the consensus expectation by 40.1% (one week ahead) to 42.3% (one day ahead).

Furthermore, in cases where the consensus expectation and the market-implied value diverged, Kalshi's market-based predictions demonstrated a statistically significant win rate, ranging from 75.0% one week ahead to 81.2% on release day. If ties with the consensus expectation (accurate to one decimal place) are included, the market-based prediction was tied or better than consensus in approximately 85% of cases one week ahead.

Such a high directional accuracy rate indicates: when market predictions diverge from the consensus expectation, this divergence itself carries significant informational value regarding the likelihood of a shock event occurring.

"Shock Alpha" Exists

The difference in forecasting accuracy was particularly pronounced during shock events. During moderate shock events, the MAE of market predictions was 50% lower than the consensus expectation at the release time, and this advantage expanded to 56.2% or more on the day before the data release; during major shock events, the MAE of market predictions was also 50% lower than the consensus expectation at the release time, and could reach 60% or more on the day before the data release; whereas in normal environments without shocks, market predictions and consensus expectations performed roughly equally.

Although the sample size for shock events is small (reasonable in a world where shocks are inherently highly unpredictable), the overall pattern is clear: when the forecasting environment is most challenging, the information aggregation advantages of markets are most valuable.

However, more importantly, it's not just that Kalshi's predictions perform better during shock periods, but also that the divergence between market predictions and the consensus expectation itself may be a signal of an impending shock. In cases of divergence, the win rate of market predictions relative to the consensus expectation reached 75% (within comparable time windows). Furthermore, threshold analysis indicates: when the deviation between the market and consensus exceeds 0.1 percentage points, the probability of predicting a shock is approximately 81.2%, and on the day before the data release, this probability further increases to about 84.2%.

This practically significant difference suggests that prediction markets can serve not only as a competitive forecasting tool alongside consensus expectations but also as a "meta-signal" regarding forecasting uncertainty, transforming market-consensus divergence into a quantifiable early warning indicator for potential unexpected outcomes.

Further Discussion

An obvious question follows: Why do market predictions outperform consensus forecasts during shocks? We propose three complementary mechanisms to explain this phenomenon.

Market Participant Heterogeneity and "Wisdom of the Crowd"

Traditional consensus expectations, while integrating views from multiple institutions, often share similar methodological assumptions and information sources. Econometric models, Wall Street research reports, and government data releases form a highly overlapping common knowledge base.

In contrast, prediction markets aggregate positions held by participants with diverse information bases: including proprietary models, industry-level insights, alternative data sources, and experience-based intuition. This participant diversity has a solid theoretical foundation in the "wisdom of crowds" theory. This theory suggests that when participants possess relevant information and their prediction errors are not perfectly correlated, aggregating independent predictions from diverse sources often yields superior estimates.

The value of this informational diversity is particularly pronounced during "state shifts" in the macro environment—individuals with scattered, local information interact in the market, and their informational fragments combine to form a collective signal.

Differences in Participant Incentive Structures

Institution-level consensus forecasters often operate within complex organizational and reputational systems that systematically deviate from the goal of "purely pursuing predictive accuracy." The career risks faced by professional forecasters create an asymmetric payoff structure—significant forecasting errors incur substantial reputational costs, while even extreme accuracy, especially achieved by deviating substantially from peer consensus, may not yield proportional career rewards.

This asymmetry induces "herding behavior," where forecasters tend to cluster their predictions near the consensus value, even if their private information or model outputs suggest different results. The reason is that within the career system, the cost of "being wrong alone" is often higher than the reward for "being right alone."

In stark contrast, the incentive mechanism faced by prediction market participants directly aligns forecasting accuracy with economic outcomes—accurate predictions mean profits, incorrect predictions mean losses. In this system, reputational factors are almost non-existent; the only cost of deviating from market consensus is economic loss, entirely dependent on the prediction's correctness. This structure imposes stronger selection pressure for predictive accuracy—participants who can systematically identify consensus forecast errors continuously accumulate capital and amplify their influence in the market through larger position sizes;而那些 mechanically follow consensus suffer continuous losses when consensus proves wrong.

During periods of significantly heightened uncertainty, when the career cost for institutional forecasters to deviate from expert consensus is at its peak, this divergence in incentive structures is often most pronounced and economically most significant.

Information Aggregation Efficiency

A noteworthy empirical fact is: even one week before the data release—a timeframe matching the typical window for consensus expectation releases—market predictions still exhibit significant accuracy advantages. This suggests that the market advantage does not stem solely from the often-cited "information speed advantage" of prediction market participants.

Instead, market predictions may more efficiently aggregate information fragments that are too dispersed, too industry-specific, or too ambiguous to be formally incorporated into traditional econometric forecasting frameworks. The relative advantage of prediction markets may lie not in earlier access to public information, but in their ability to synthesize heterogeneous information more effectively within the same timeframe—information that survey-based consensus mechanisms, even with the same time window, often struggle to process efficiently.

Limitations and Caveats

Our findings require an important qualification. Since the overall sample covers only about 30 months, and major shock events are by definition rare, this means statistical power remains limited for larger tail events. A longer time series will enhance future inferential ability, although the current results strongly suggest the superiority and differentiated signal of market predictions.

Conclusion

We document systematic and economically significant outperformance of prediction markets relative to expert consensus expectations, particularly during shock periods when forecasting accuracy is most critical. Market-based CPI predictions exhibited approximately 40% lower error overall, and this error reduction reached about 60% during periods of major structural change.

Based on these findings, several future research directions become particularly important: First, investigating whether "Shock Alpha" events themselves can be predicted using volatility and forecast divergence indicators, across a larger sample size and multiple macroeconomic indicators; Second, determining the liquidity threshold above which prediction markets can stably outperform traditional forecasting methods; Third, exploring the relationship between prediction market forecasts and those implied by high-frequency trading financial instruments.

In an environment where consensus forecasts heavily rely on correlated model assumptions and shared information sets, prediction markets offer an alternative information aggregation mechanism capable of capturing state switches earlier and processing heterogeneous information more efficiently. For entities needing to make decisions in an economic environment characterized by rising structural uncertainty and tail event frequency, "Shock Alpha" may represent not just an incremental improvement in predictive capability, but a fundamental component of a robust risk management infrastructure.

你可能也喜欢

西班牙打击非法盗版平台，查获价值46.7万美元加密货币

西班牙警方在打击非法漫画平台行动中查获价值46.7万美元的加密货币。行动中，警方在阿尔梅里亚逮捕三名嫌疑人，并在一处墙内温度计中发现两个冷钱包，内含约40万欧元。该平台自2014年起运营，通过广告收入非法获利超400万欧元，为西语世界最大盗版漫画分发网站之一。目前尚不清楚警方能成功解锁钱包，因冷钱包需PIN码或助记词才能访问。案件凸显全球执法部门面临的新挑战——非法活动收益正通过加密货币硬件钱包存储。韩国此前曾出现类似案例，2021年查获的22个比特币（时价150万美元）在警方保管期间离奇失踪。本案于2025年6月接获投诉后启动调查，涉案平台通过免费提供盗版漫画吸引流量牟取广告收益。

bitcoinist1小时前

bitcoinist1小时前

被 PayPal 扫地出门，马斯克要用加密市场卷土重来

马斯克旗下社交平台X于2026年4月推出Smart Cashtags功能，上线数天即推动约10亿美元交易量。该功能允许用户直接点击帖文中的股票或加密货币代码查看实时价格与讨论，并支持通过智能合约地址搜索资产，目前在美国和加拿大iOS用户中上线，未来将扩展至Android和网页端。X与加拿大券商Wealthsimple合作，用户可一键跳转交易，实现从信息到交易的闭环体验。此举被视为社交交易（Social Trading）崛起的关键一步，Galaxy Research报告指出加密行业已形成三大“注意力金融化”工具：链上代币（文化传播金融化）、永续合约（杠杆观点金融化）和预测市场（信息优势金融化）。X凭借5-6亿月活用户和加密社群核心地位，有望大幅降低交易门槛，吸引新用户。更重要的布局是即将全面推出的X Money，这是马斯克重返支付战场的关键工具，对标微信支付和支付宝的“通讯+商务”模式。其测试版已提供6%年化收益、3%消费返现、P2P转账等功能，未来可能整合加密货币交易与稳定币结算，构建“超级金融社交App”。然而，X面临监管挑战。美国参议员Warren质疑6%高收益的可持续性及合作银行Cross River Bank的合规记录，同时稳定币的保险漏洞和社交交易可能加剧羊群效应与金融风险娱乐化问题。若X能平衡创新与监管，或将为加密市场注入新资金，成为产业新增长点。

marsbit2小时前

marsbit2小时前

Tether大规模冻结：与美国当局重大行动中锁定3.44亿美元USDT

加密货币巨头Tether宣布应美国财政部外国资产控制办公室（OFAC）及执法部门要求，已冻结两个Tron钱包中价值3.44亿美元的USDT。这些钱包被指控与逃避制裁、犯罪网络等非法活动有关。Tether强调这是其常规合规操作的一部分，并透露已与全球65个国家的340余家执法机构合作，累计协助冻结超44亿美元资产，其中21亿美元涉及美国案件。与此同时，第二大稳定币发行商Circle因处理黑客事件的方式面临质疑。4月初Drift协议遭2.8亿美元黑客攻击后，Circle被指控未及时冻结赃款，导致攻击者通过其跨链协议转移资金。目前Circle已在马萨诸塞州被提起诉讼。而Tether则宣布与Drift协议合作，提供近1.5亿美元资金支持平台重启与用户补偿。（字数：299）

bitcoinist2小时前

bitcoinist2小时前

深圳的撸毛党，都转战港股打新了

深圳作为充满活力的城市，吸引了众多雄心勃勃的人。四十多年前，这里还只是珠江口东岸的稻田和渔村，但四十年间，它发展成为全球电子元器件集散地华强北、全球油画工厂大芬村，并孕育了华为、腾讯等知名企业。这座城市充满了通过信息、流动性和监管不对称实现价差变现的机会。 Arez就是典型的深圳人，三十出头，与朋友在南山开设了一家撸毛工作室。2023年，他通过Arbitrum、Starknet和LayerZero的空投活动为团队赚取了可观收益。鼎盛时期，他管理着上千个链上地址，涉及脚本运行、IP代理、KYC资源和资金轮转等操作。

marsbit3小时前

marsbit3小时前

孙宇晨起诉 WLFI 的文件中，藏着四个离谱细节

加密货币富豪孙宇晨起诉World Liberty Financial（WLFI）高管欺诈、盗窃等行为，诉状中披露了四个关键细节：一、指控WLFI联合创始人Chase Herro有欺诈前科，包括客户曾建立网站揭露其骗局，并有税务违规记录，还涉嫌与爱泼斯坦岛有关联；二、WLFI被指未经代币持有者投票，擅自进行两次“秘密”技术升级，新增可冻结和扣押用户代币的权限；三、双方矛盾部分源于孙宇晨购买价值1亿美元的TRUMP Meme币，虽获特朗普家族成员批准，但引发WLFI不满；四、孙宇晨指控WLFI未注册牌照却从事货币传输业务，涉嫌违反美国联邦及州法律。 WLFI首席执行官回应称诉讼是孙宇晨为转移注意力之举，并强调冻结行为是为保护用户权益。

marsbit3小时前

marsbit3小时前

交易

现货

合约

Kalshi's First Research Report Released: How Collective Intelligence Outperforms Wall Street Think Tanks in Predicting CPI

文章摘要

Overview

Key Highlights

Background

Methodology

Data

Shock Classification

Performance Metrics

Results: CPI Forecasting Performance

Overall Superior Accuracy

"Shock Alpha" Exists

Further Discussion

Market Participant Heterogeneity and "Wisdom of the Crowd"

Differences in Participant Incentive Structures

Information Aggregation Efficiency

Limitations and Caveats

Conclusion

相关问答

你可能也喜欢

西班牙打击非法盗版平台，查获价值46.7万美元加密货币

被 PayPal 扫地出门，马斯克要用加密市场卷土重来

Tether大规模冻结：与美国当局重大行动中锁定3.44亿美元USDT

深圳的撸毛党，都转战港股打新了

孙宇晨起诉 WLFI 的文件中，藏着四个离谱细节

交易

热门文章

如何购买S

Sonic：Andre Cronje主导升级，逆势上涨的Layer1新星

成长学院：学习“ Sonic“ ，瓜分价值 1000 USDT

相关讨论

热门问答

热门分类

热门标签