Your Backtest Is Lying: Why You Must Use Point-in-Time Data

insights.glassnodePublished on 2026-03-13Last updated on 2026-03-13

Abstract

This article warns against a common pitfall in backtesting trading strategies: look-ahead bias caused by using revised historical data. It illustrates this with a hypothetical Bitcoin strategy based on exchange outflows from Binance. The strategy is built on the premise that sustained outflows (when the 5-day moving average of BTC balance falls below the 14-day average) are bullish, while inflows signal a sell-off. An initial backtest using standard, revised data shows the strategy performing comparably to a simple buy-and-hold approach. However, the author argues these results are misleading because the data has been updated with information that wasn't available in real-time. This data mutation creates an unfair advantage in the backtest. To demonstrate, the test is rerun using Point-in-Time (PiT) data—an immutable, append-only record that reflects only what was known on any given day. The results are significantly worse, as the PiT-based strategy misses key profitable moves. The key takeaway is that accurate backtesting requires immutable Point-in-Time data to avoid look-ahead bias and replay history honestly.

Let's build a simple, hypothetical trading strategy. The premise is straightforward and rooted in a widely discussed narrative: when coins leave exchanges, it tends to be bullish. The reasoning is intuitive: coins moving off exchanges typically signal that holders are withdrawing to self-custody, reducing the available supply for selling. Conversely, coins flowing onto exchanges may indicate that holders are preparing to sell.

A single day of outflows, however, is just noise. To identify a genuine trend, we would apply a moving average crossover on the exchange balance. When the short-term average falls below the long-term average, it confirms that coins have been leaving exchanges consistently, as a sustained pattern, rather than isolated events.

Using Glassnode's exchange balance for Binance, we define the following:

Enter the market when the 5-day moving average of Binance's BTC balance falls below its 14-day moving average, signaling a sustained outflow trend.
Exit the market when the 5-day average rises back above the 14-day average, signaling that the outflow trend has reversed and coins are returning to the exchange.

We then benchmark this strategy against simply holding BTC over the same period, starting January 1, 2024 through March 9, 2026, with an initial capital of $1,000 and 0.1% trading fees applied to each trade.

This is a simplified trading strategy, designed primarily for illustrative purposes. It is not investment advice, nor is it meant to suggest that exchange balances are a robust foundation for a trading system.

Here's how to read this chart:

🟫 The brown line at the bottom is the binary trading signal, toggling between in the market (1) and out of the market (0).

🟦 The blue line tracks the strategy's portfolio value over time.

🟩 The green line is the buy-and-hold portfolio benchmark.

We can observe that the exchange balance strategy performed reasonably well, although at times the buy-and-hold strategy outperformed it. In the final days of the research period, however, the exchange balance strategy caught up. While some investors may find the combination of reduced volatility and an ultimately comparable performance to buy-and-hold appealing, the final numbers are misleading – and here’s why.

The Problem: Data Mutation and Look-Ahead Bias

Metrics are not static. Many are retroactively revised as new information becomes available. This is particularly true for metrics that depend on address clustering or entity labeling, such as on-chain exchange balances. However, it is also the case for metrics such as trading volume or price, as individual exchanges can occasionally submit their data with slight delays.

This means that a value you see today for, say, January 15, 2024, may not be the same value that was published on January 15, 2024. The data has been revised with hindsight. When you backtest a strategy on this revised data, you are implicitly using information that was not available at the time the trading decisions would have been made. This introduces a look-ahead bias.

The Honest Backtest: Using Point-in-Time Data

Let's therefore repeat the exact same backtest – same signal logic, same parameters, same dates, same fees – but this time using the Point-in-Time (PiT) variant of the Exchange Balance metric, available in Glassnode Studio.

PiT metrics are strictly append-only and immutable. Each historical data point reflects only the information that was known at the time it was first computed. No retroactive revisions, no look-ahead bias.

While we are using the same metric, the strategy now produces significantly different results, as illustrated by the purple line in the new chart below. The overall performance is notably worse.

Although both strategies behave similarly for much of 2024, we observe that the PiT-based version fails to capture the strong upticks in November 2024 and March 2025 as effectively. As a result, the cumulative performance diverges meaningfully and ends up considerably lower.

Key Takeaway

In this example, the purple strategy, which only has access to information as it was available at the time, performs noticeably worse. ► Backtests will lie if fed with wrong or revised data. Only immutable, Point-in-Time metrics ensure you’re replaying history as it actually happened.

How Many Tokens Away Is Yang Zhilin from the 'Moon Chasing the Light'?

The article explores the intense competition between two leading Chinese AI companies, DeepSeek and Kimi (Moon Dark Side), and the mounting pressure on Yang Zhilin, the founder of Kimi. While DeepSeek re-emerged after 15 months of silence with its powerful V4 model—boasting 1.6 trillion parameters and low-cost, long-context capabilities—Kimi has been focusing on long-context processing and multi-agent systems with its K2.6 model. Yang faces a threefold challenge: technological rivalry, commercialization pressure, and investor expectations. Despite Kimi’s high valuation (reaching $18 billion), its revenue heavily relies on a single product with low paid conversion rates, while DeepSeek’s strategic silence and open-source influence have strengthened its market position and valuation prospects, now targeting over $20 billion. Both companies reflect broader trends in China’s AI ecosystem: Kimi aims for global influence through open-source contributions and agent-based advancements, while DeepSeek prioritizes foundational innovation and hardware independence, notably shifting to Huawei’s chips. Their competition is seen as vital for China’s AI progress, with the gap between top Chinese and U.S. models narrowing to just 2.7% on the Elo rating scale. Ultimately, the article argues that this rivalry, though anxiety-inducing for leaders like Zhilin, is essential for driving innovation and solidifying China’s role in the global AI landscape.

marsbit31m ago

How Many Tokens Away Is Yang Zhilin from the 'Moon Chasing the Light'?

marsbit31m ago

TechFlow Intelligence Bureau: ChatGPT Helps Amateur Mathematician Crack 60-Year-Old Problem, CFTC Sues New York Regulator Over Coinbase and Gemini

An amateur mathematician, with the assistance of ChatGPT, has solved a combinatorial mathematics puzzle originally proposed by Hungarian mathematician Paul Erdős in the 1960s. This marks another milestone in AI-aided mathematical research, demonstrating the evolving capabilities of large language models in formal reasoning. In other AI developments, OpenAI introduced a new privacy filter tool for enterprise API usage, automatically screening sensitive data. Meanwhile, the Qwen3.6-27B model achieved 100 tokens per second on a single RTX 5090 GPU using quantization, significantly lowering the cost barrier for local AI deployment. In crypto and Web3, the U.S. CFTC sued New York’s financial regulator, challenging its oversight of Coinbase and Gemini—a first-of-its-kind federal-state regulatory clash. Following a vulnerability, KelpDAO and major DeFi protocols established a recovery fund. Tether froze $344 million in assets linked to Iran’s central bank upon U.S. Treasury request, highlighting the centralized control risks in stablecoins. Separately, Litecoin underwent a 3-hour chain reorganization to undo a privacy-layer exploit. In the U.S., former President Trump invoked the Defense Production Act to address power grid bottlenecks affecting AI data centers and dismissed the entire National Science Board, raising concerns over research independence. A retail trader gained 250% on a $600k Intel options bet amid AI-related speculation. Xiaomi announced its first performance electric vehicle, targeting rivals like Tesla. Meanwhile, iPhone users reported devices automatically reinstalling a hidden app daily, suspected to be MDM-related. A Chinese securities report noted that A-share institutional crowding has reached its second-longest streak since 2007, signaling high valuations and potential style rotation. The day’s developments reflect a dual narrative: AI is enabling unprecedented individual breakthroughs, while centralized power structures—whether governmental or corporate—are becoming more assertive, underscoring that decentralization is as much a political-economic challenge as a technical one.

marsbit53m ago

TechFlow Intelligence Bureau: ChatGPT Helps Amateur Mathematician Crack 60-Year-Old Problem, CFTC Sues New York Regulator Over Coinbase and Gemini

marsbit53m ago

Iran’s Crypto Lifeline Hit As US Freezes $344 Million In Funds

The US Treasury has frozen $344 million in cryptocurrency tied to Iranian military and political groups, specifically targeting two Tron blockchain wallets connected to the Islamic Revolutionary Guard Corps and Hizballah. This action follows reports that Iran was collecting Bitcoin payments from ships for safe passage through the Strait of Hormuz. The freeze occurred just one day after stablecoin issuer Tether locked the same amount of funds at the request of US law enforcement. The move is part of broader US efforts to disrupt Iran's ability to generate and move funds, highlighting the limitations of using cryptocurrency as a sanctions workaround when centralized issuers comply with enforcement actions.

bitcoinist1h ago

Iran’s Crypto Lifeline Hit As US Freezes $344 Million In Funds

bitcoinist1h ago

Amidst the Volatility in the Crypto Market, Who is Buying Against the Trend?

"In a volatile crypto market downturn of Q1 2026, institutional players demonstrated divergent strategies. While Bitcoin fell over 25% and Ethereum dropped 35%, key institutions accumulated: corporate treasuries like Strategy (MicroStrategy) added over $10 billion in BTC, sovereign wealth funds like Mubadala increased IBIT holdings by 46%, and major banks launched crypto services and ETFs. Notably, 26 new crypto ETFs were filed or launched under streamlined SEC rules, including bank-led products from Morgan Stanley and BlackRock’s staking ETH ETF. VC funding saw concentration: total investment reached ~$5 billion, but deal count plummeted 49%. Three mega-deals—BVNK ($1.8B), Kalshi ($1B), and Polymarket ($600M)—accounted for half the quarter’s funding. Investment shifted from DeFi/NFTs to payments, prediction markets, and regulated CeFi infrastructure. Selling pressure came from hedge funds (e.g., Brevan Howard cut IBIT by 85%) and Bitcoin miners liquidating holdings. The U.S. strategic Bitcoin reserve saw zero additions, highlighting that real buying came from corporations and sovereign funds, not governments. The institutional crypto landscape is bifurcating: long-term holders accumulate while short-term traders exit, signaling a foundation for the next cycle."

marsbit2h ago

Amidst the Volatility in the Crypto Market, Who is Buying Against the Trend?

marsbit2h ago

Hands-on with Hunyuan Hy3 Preview: Tencent's AI, Finally Competitive?

Tencent's Hunyuan AI team has released its latest language model, Hy3 preview, marking a significant step forward for the company's AI capabilities. With 295B total parameters and support for 256K context length, the model employs a mixture-of-experts architecture. It shows improvements in complex logic, instruction following, contextual learning, code generation, and agent task execution. In testing, Hy3 preview demonstrated strong performance in multi-step logical reasoning but showed occasional instability in identifying traps in trick questions. It performed well in extracting key information from disordered meeting transcripts and accurately followed new linguistic rules. As an AI agent, it successfully built functional applications like a Snake game and generated data analysis dashboards, though it sometimes fell short in fully completing complex open-ended tasks. In natural language use, it produced coherent and stylistically appropriate narratives with reduced “AI-like” tone. Priced competitively, Hy3 preview is already integrated into Tencent’s key products, including Tencent Cloud and WorkBuddy. While not leading in every benchmark, it represents a solid, practical model that signals Tencent’s renewed momentum in AI development.

marsbit4h ago

Hands-on with Hunyuan Hy3 Preview: Tencent's AI, Finally Competitive?