More Accurate Than Polls, More Dangerous Than Imagined: Prediction Markets in the Eyes of the Fed

比推Опубліковано о 2026-02-24Востаннє оновлено о 2026-02-24

Анотація

The Federal Reserve is exploring the use of prediction markets, particularly Kalshi, as a real-time tool for policy insights. A Fed-affiliated working paper found that Kalshi’s predictions for core CPI and unemployment are statistically comparable to—and sometimes more accurate than—Bloomberg consensus estimates. Prediction markets aggregate real-money, belief-backed trading, offering frequent updates and capturing nuanced shifts that traditional surveys miss. For instance, Kalshi priced inflation uncertainty in real time during a trade policy scare—a dynamic monthly surveys couldn’t reflect. While these markets provide valuable signals, they also carry risks. Prices reflect both expectations and risk preferences, and heavy reliance on sports betting for liquidity makes macroeconomic markets vulnerable to regulatory changes. If sports betting is restricted, liquidity could dry up, increasing manipulation risks. Moreover, if the Fed openly uses prediction markets, it could create a feedback loop where traders manipulate smaller markets like Kalshi to influence broader policy communication and traditional financial instruments. Despite these concerns, prediction markets offer a uniquely timely and distributed form of expectation aggregation—especially for events like FOMC meetings, where informed participants trade with real stakes. The Fed should require open data transparency to mitigate manipulation and carefully weigh the signal against the noise.

Source: The Token Dispatch

Author: Prathik Desai

Original Title: The Signal and the Noise


“Forecasts usually tell us more of the forecaster than of the future.”

— Warren Buffett

Money filters out empty talk. Supporters believe this is precisely why prediction markets are reliable. We saw people accurately predict the outcome of the 2024 U.S. presidential election on Polymarket and Kalshi. However, prediction markets themselves are not new, and their success in predicting political outcomes is not the first of its kind.

In October 1988, a group of economists at the University of Iowa supported their academic curiosity with a small, real-money prediction market. They launched a presidential election futures market where participants could buy contracts: if George H. W. Bush won, the contract paid $1; if Michael Dukakis won, it paid $0. On the eve of the election, Bush's contracts traded at 53 cents, while traditional polls suggested a close race. Ultimately, Bush won with 53.4% of the vote and a solid 8-point margin.

Since that academic experiment, these real-money futures markets have outperformed traditional polls in every election predicted more than 100 days in advance. In U.S. presidential elections since 1988, prediction markets have been closer to the final result than polls 74% of the time.

This success stems from a mechanism that forces people to express genuine beliefs backed by real money, something surveys can never achieve. Those who truly believed Bush would win bought and held contracts. For random participants, there is no incentive to spend $50 to support a claim they themselves do not believe. When this behavior aggregates thousands of traders, information converges into a price that reflects the true beliefs of a broad group, rather than a small, disproportionate sample.

That small academic experiment in Iowa, run on a shoestring budget, has now evolved into an institutionalized infrastructure.

Last week, a working paper authored by economists affiliated with the Federal Reserve noted that Kalshi, the largest regulated prediction market in the U.S., could serve as a valuable real-time benchmark for policymakers. The same week, New York Stock Exchange (NYSE) President Lynn Martin stated that Polymarket, the world's highest-volume prediction market, moved S&P index futures on election night by pricing Donald Trump's victory earlier than any news organization. Subsequently, Kalshi announced a partnership with a trading platform that handles $2.6 trillion in daily institutional volume.

In today's in-depth analysis, I will explore whether prediction markets can serve as reliable barometers for policymaking and what risks they bring.

Prediction Markets as Policymaking Tools

The paper found that Kalshi's predictions are statistically similar to Bloomberg's consensus expectations, with nearly identical prediction errors for core CPI and unemployment rates. In fact, the paper also found that Kalshi's predictions for core CPI were significantly better than Bloomberg's estimates.

@FederalReserve

Despite similar statistical performance, Kalshi's uniqueness lies in its ability to provide more frequent, real-time probability curve updates for macroeconomic indicators such as GDP growth, core CPI, and unemployment rates. For estimates like inflation, the Bloomberg consensus is only available in the months leading up to the data release. This makes traditional estimates less frequent, with longer gaps that fail to reflect real-time expectation updates.

Kalshi not only provides predictions of outcomes but also real-time uncertainty ranges and tail risks. In early April 2025, uncertainty about trade policy temporarily raised inflation expectations. Although this uncertainty did not materialize, Kalshi priced this dynamic in real time. Monthly Bloomberg estimates could never capture this nuance.

@FederalReserve

Today, when Federal Reserve governors speak at Federal Open Market Committee (FOMC) meetings, Kalshi's market odds move in real time. They price every word from the governors, providing policymakers with a perspective on how traders interpret the expected information.

For example, when Christopher J. Waller made dovish remarks ahead of the July 2025 FOMC meeting, the probability of no rate cut fell to 75%. After the stronger-than-expected June jobs report, this probability quickly rebounded to over 90%. The entire expectation of traders, backed by real money, is presented to policymakers in a way no other tool can currently achieve.

Who Trades on These Markets?

Before deciding how much to trust prediction markets, it is important to examine who is trading and what the volume represents.

Between September 2024 and January 2026, volume on Polymarket for FOMC meetings grew 11-fold, from $59 million to $660 million. In total, Polymarket's FOMC markets processed $2.6 billion, surpassing the combined total of the platform's culture, economy, geopolitics, and science categories.

So, who is trading such large amounts on FOMC meetings? While it is difficult to pinpoint on anonymous prediction platforms like Polymarket, we can speculate: it is hard not to think of macro hedge fund analysts involved in drafting labor statistics reports, or money market fund managers who stand to gain if rates are not cut.

Why them? The Iowa market worked because the number of people who put their money where their mouth is, with reliable information, outweighed those who merely gambling without it. While acknowledging the risk of over-assumption, I believe that when real stakes and funds of this scale are involved, those with reliable information converge on the market, leading to more accurate price discovery.

What to Be Wary Of

All this does not mean prediction markets can be perfect measuring tools for policymakers.

Probabilities in prediction markets also reflect traders' risk preferences. They are not a raw reflection of outcome expectations. For example, when Kalshi prices the probability of unfavorable CPI data at 15% while traditional surveys price it at 10%, this gap can be explained by two factors:

  1. Prediction markets may be pricing real-time information missed by the Bloomberg consensus.

  2. Traders may be paying a premium on prediction markets to hedge against unfavorable outcomes.

Policymakers must understand what this gap reflects before treating this information as a signal for policymaking.

While Kalshi's macroeconomic signals to policymakers seem reliable, over 85% of its total nominal volume comes from the sports category.

@Dune

Currently, at least 20 federal lawsuits are challenging the regulatory arbitrage achieved through nationwide sports betting for prediction markets.

The reliability of Kalshi's FOMC markets is partly due to sports betting providing the platform with foundational liquidity through active traders, narrow bid-ask spreads, and market-making infrastructure, which sustains all Kalshi markets. Although macro markets operate independently, they benefit from this foundation. If sports betting disappears under regulatory pressure, the platform will lose the liquidity engine that keeps spreads tight and prices continuous. Thinner macro markets become easier to move with less capital, making them more susceptible to manipulation.

The Fed's paper recommends using Kalshi as a monitoring tool, not a decision-making input. But making this intention public is itself problematic.

The authors suggest greater use of Kalshi to interpret incoming data and check real-time interpretations of Fed communications. However, because the intention to reference prediction markets is public, it could create a feedback loop.

For example, a policymaker at the Fed might see Kalshi pricing a 15% probability of a rate cut, lower than the expectation they wish to convey through their actions. In response, they might soften their rhetoric in the next speech, which could then cause fluctuations in global traditional interest rate markets. The problem here is that while Kalshi's FOMC market size is $660 million, the federal funds futures market is worth hundreds of billions of dollars. The former requires relatively small positions to change the odds. A well-funded participant, aware that moving Kalshi could influence subsequent Fed remarks (if not decisions), could use relatively small positions to move a much larger market. Policymakers' communications could become targets for manipulation.

This scenario highlights the difference between the 1988 Iowa futures market and the 2026 prediction markets. The Iowa economists back then simply wanted to determine if a market with real stakes could produce better predictions than surveys. At that time, policymaking was not under such close scrutiny to deter manipulators.

Back then, prices reflected true beliefs because those prices did not influence the world. They merely allowed those with insights to monetize them. Once the Fed publicly announces (if it does so) its intention to use prediction markets as a policy input, this attribute disappears. It also introduces a "performative" element to trading.

However, incorporating prediction market odds into the policy toolbox is not a misstep. Financial commitment still filters out empty talk. Informed participants continue to dominate price discovery. The result is a signal unmatched by surveys in terms of speed and distribution richness. For FOMC markets, this is more pronounced than any other application: there are participants on both sides with genuine hedging capabilities, and the market, by pricing real-time events frequently, better reflects real-time expectations.

Policymakers should mandate open-source data transparency as a prerequisite for formal adoption. If the data is not auditable, manipulation may go undetected. They should understand that both signal and noise come from the same place. People with real money and real beliefs can tell you what they think in real time.

For those powerful enough to game the system, this window of opportunity did not exist when the Iowa economists were conducting their academic experiment decades ago. Today, this window is wider than ever.


Twitter:https://twitter.com/BitpushNewsCN

Bitpush TG Discussion Group:https://t.me/BitPushCommunity

Bitpush TG Subscription: https://t.me/bitpush

Original link:https://www.bitpush.news/articles/7614191

Пов'язані питання

QAccording to the Federal Reserve working paper, how does the predictive accuracy of Kalshi compare to Bloomberg's consensus forecasts for core CPI and unemployment?

AThe paper found that Kalshi's predictions were statistically similar to Bloomberg's consensus expectations, with nearly identical prediction errors for core CPI and unemployment. In fact, Kalshi's forecasts for core CPI were significantly better than Bloomberg's estimates.

QWhat key advantage does the article highlight for Kalshi over traditional survey-based forecasts like Bloomberg's?

AKalshi provides more frequent, real-time updates to its probability curves for macroeconomic indicators, offering real-time uncertainty ranges and tail risk assessments that traditional surveys, which are only available months before data releases, cannot capture.

QWhat is the primary risk identified if the Federal Reserve were to publicly state its intention to use prediction markets like Kalshi as a policy input?

AIt could create a feedback loop and make policy communication a target for manipulation. A well-funded participant could use a relatively small position to move the odds on Kalshi, potentially influencing subsequent Fed communication and causing volatility in the much larger traditional interest rate markets.

QWhat does the article identify as the main reason for the historical success of prediction markets over traditional polls in forecasting election outcomes?

AThe mechanism forces people to express true beliefs backed by real money, which a survey questionnaire can never do. This aggregates information from a large crowd into a price that reflects genuine collective belief rather than a small, disproportionate sample.

QWhat major category currently provides the vast majority of the nominal trading volume on the Kalshi platform, and why is this a potential concern?

ASports betting accounts for over 85% of Kalshi's total nominal volume. This is a concern because the liquidity from sports trading supports the platform's infrastructure. If sports betting faces regulatory pressure and declines, the macro markets could lose liquidity, becoming thinner and more susceptible to manipulation.

Пов'язані матеріали

GPT-5.6 Countdown: Abandon the Illusion of a Single API, Computational Iteration Can't Outpace a Single Page of Compliance

In mid-June, three seemingly independent industry events—the compliance-driven throttling of Fable 5, the open-sourcing of GLM-5.2, and the leaked release timeline for GPT-5.6—are pushing the global AI industry toward a watershed moment. These shifts signal a fundamental restructuring of the industry's underlying logic. First, **"usability" has substantially overtaken "advanced capabilities"** as the primary weight, pushing the global large language model (LLM) supply chain into a "dual-track" phase of controlled closed-source and local open-source coexistence. Second, **the competitive moats of closed-source giants are shifting**. Their technical focus is moving from "language intelligence" toward "spatial intelligence (world models)"—a domain heavily reliant on computing power. Third, faced with常态化 transnational compliance risks, **a "model-agnostic" decoupled design has become a survival necessity for application-layer developers to maintain business continuity.** The article details how Anthropic's Fable 5, despite its advanced engineering feats, was restricted for non-U.S. citizens within 72 hours of launch, highlighting how geopolitical compliance can instantly limit even the most advanced models. In response, the open-source camp, exemplified by Zhipu AI's MIT-licensed GLM-5.2, is gaining market share by offering stable performance improvements and significant cost advantages (up to 70% savings for enterprises), while achieving full adaptation with domestic semiconductor platforms. Meanwhile, closed-source leaders like OpenAI are pivoting. The anticipated GPT-5.6 reportedly shifts focus from language to spatial intelligence and world models, aiming to rebuild a generational gap in areas like 3D understanding, simulation, and industrial design that demand immense compute. The core conclusion is that the LLM supply chain's logic has changed. Enterprises must now evaluate infrastructure based on a composite of technical performance and policy compliance. For developers, complete reliance on a single closed-source API poses unacceptable risk. Implementing a truly model-agnostic architecture—enabling swift switches to compliant, locally deployable open-source alternatives—is no longer just good practice but a fundamental baseline for business continuity.

marsbit1 год тому

GPT-5.6 Countdown: Abandon the Illusion of a Single API, Computational Iteration Can't Outpace a Single Page of Compliance

marsbit1 год тому

Is the 'Token Subsidy War' Among AI Giants Almost Over?

The article discusses the ongoing "token subsidy war" among AI giants like OpenAI and Anthropic, questioning whether it's nearing its end. It reveals that current AI subscription prices are heavily subsidized, with some plans offering tokens at up to 70 times the actual cost to attract and retain heavy users, especially developers and enterprises. This strategy mirrors past internet-era subsidy battles, but with a key difference: AI tokens lack "lock-in" effects. Unlike ride-hailing or food delivery apps, users can easily switch between AI providers as APIs become standardized, making it difficult for companies to raise prices post-subsidy. The piece highlights a structural asymmetry in the competition. Giants like Google, with massive advertising revenue, can afford to subsidize tokens indefinitely, akin to using "tokens as a weapon." In contrast, venture-backed companies like OpenAI and Anthropic face pressure to become profitable, especially as they approach IPO. The article cites Google Ventures founder Bill Maris, who suggests Google could slash token prices by 80%, putting immense pressure on competitors. Two potential endgames are presented: the "internet service" model (subsidize, monopolize, then raise prices) and the "utility" model (tokens become a standardized, low-margin commodity like electricity). Given the low switching costs, the latter seems more likely. The competition may not have a single winner but could instead accelerate AI's evolution into a foundational, infrastructure-level technology, akin to a public utility. For now, users continue to benefit from heavily subsidized token costs.

marsbit1 год тому

Is the 'Token Subsidy War' Among AI Giants Almost Over?

marsbit1 год тому

Beyond the Stadium: The Profitable Games Surrounding the World Cup

"Beyond the Pitch: The Profit Game Around the World Cup" The FIFA World Cup transcends being a sporting spectacle, evolving into a massive global arena for speculation and profit-seeking. The 2026 tournament has amplified this dynamic, creating a multi-layered ecosystem of financial opportunism alongside the football. **Prediction markets** have surged into the mainstream. Platforms like Polymarket and Kalshi saw trading volumes for World Cup contracts soar, attracting new users with their financial trading model and high-profile, chain-based wealth stories that overshadow traditional sports betting in terms of growth and narrative. However, **traditional sportsbooks** remain the dominant force, leveraging established user habits, legal markets, and comprehensive product offerings to handle the vast majority of speculative wagers, with projections suggesting record-breaking betting volumes. Capital markets also react. **"Concept stocks"** in countries like South Korea and Japan experience volatile price swings based on team performance and anticipated fan spending on items like chicken, beer, and viewing parties, effectively becoming a stock market reflecting fan sentiment. The **ticket resale market** has become a sophisticated arena for arbitrage. Prices fluctuate wildly based on team draws and star power, with sellers sometimes listing tickets they don't yet own in a practice akin to short-selling, while FIFA's own "Right to Buy" tokens add another layer of speculative trading. **Collectibles and merchandise** offer another avenue. Panini sticker albums, with their inherent scarcity and nostalgic value, can become high-value collectibles. Limited-edition or locally themed jerseys command significant premiums on secondary markets, and even counterfeit vendors profit from fans' desire for affordable match-day identity. The **cryptocurrency** space has seen a frenzy of speculative, unauthorized World Cup-themed meme coins on chains like Solana. These tokens, often exploiting team names and player imagery, experience extreme pump-and-dump cycles, creating stories of massive gains for a few early entrants and steep losses for many others. Finally, an entire industry thrives on **providing information and tools** to other speculators. Developers create platforms like SeatSidekick to track ticket inventory and prices, while paid Telegram groups and subscriptions sell betting tips and predictions, monetizing the widespread desire for an informational edge. In essence, the World Cup has become a compressed, global laboratory for speculation. While the games determine champions on the field, a parallel, complex network of financial transactions—spanning prediction contracts, bets, stocks, tickets, collectibles, crypto, and information services—settles its own scores in the global market.

marsbit2 год тому

Beyond the Stadium: The Profitable Games Surrounding the World Cup

marsbit2 год тому

How Does Codex Use a Computer? Three Entry Points and Permission Boundaries

This article explains the three primary methods for Codex to interact with a computer, each with distinct use cases, permission boundaries, and trust levels. **1. Computer Use:** This offers the broadest access, allowing Codex to visually control and interact with the graphical user interface of authorized macOS/Windows apps, system settings, and even iOS simulators. It's ideal for tasks lacking APIs or structured tools, such as operating legacy software or multi-app workflows. However, it's the slowest method and has the widest permission scope, requiring careful supervision for sensitive actions. **2. Chrome Extension:** This grants Codex access to the user's logged-in Chrome browser state, including cookies, profiles, and open tabs. It's best for tasks requiring user identity across websites like Gmail, LinkedIn, Salesforce, or internal dashboards. Its key advantage is multi-tab control for complex workflows. While more powerful for browser-based tasks than Computer Use, it carries higher sensitivity as actions are performed under the user's identity. **3. In-App Browser:** This is a browser isolated within the Codex thread, separate from the user's personal browsing data. It excels in web development and debugging scenarios—previewing local servers, testing responsive layouts, or annotating designs directly on the page. Its isolation is a strength for development but a limitation for tasks requiring login sessions. The core principle is to choose the narrowest, safest, and most structured interface for the task. Use plugins or MCPs first, resort to visual control (Computer Use) only for GUI-dependent tasks, employ the Chrome extension for identity-reliant browser work, and prefer the In-App Browser for isolated development. **Appshots** are clarified as a fourth, complementary tool for *inputting* context—capturing a screenshot of a window to point Codex to something—rather than a method for Codex to *act*. Together, this layered approach highlights a key to AI agent productization: not granting unlimited permissions, but constraining them within clear boundaries for specific tasks while preserving user oversight.

marsbit3 год тому

How Does Codex Use a Computer? Three Entry Points and Permission Boundaries

marsbit3 год тому

Торгівля

Спот
Ф'ючерси
活动图片