Your Backtest Is Lying: Why You Must Use Point-in-Time Data

insights.glassnodeОпубликовано 2026-03-13Обновлено 2026-03-13

Введение

This article warns against a common pitfall in backtesting trading strategies: look-ahead bias caused by using revised historical data. It illustrates this with a hypothetical Bitcoin strategy based on exchange outflows from Binance. The strategy is built on the premise that sustained outflows (when the 5-day moving average of BTC balance falls below the 14-day average) are bullish, while inflows signal a sell-off. An initial backtest using standard, revised data shows the strategy performing comparably to a simple buy-and-hold approach. However, the author argues these results are misleading because the data has been updated with information that wasn't available in real-time. This data mutation creates an unfair advantage in the backtest. To demonstrate, the test is rerun using Point-in-Time (PiT) data—an immutable, append-only record that reflects only what was known on any given day. The results are significantly worse, as the PiT-based strategy misses key profitable moves. The key takeaway is that accurate backtesting requires immutable Point-in-Time data to avoid look-ahead bias and replay history honestly.

Let's build a simple, hypothetical trading strategy. The premise is straightforward and rooted in a widely discussed narrative: when coins leave exchanges, it tends to be bullish. The reasoning is intuitive: coins moving off exchanges typically signal that holders are withdrawing to self-custody, reducing the available supply for selling. Conversely, coins flowing onto exchanges may indicate that holders are preparing to sell.

A single day of outflows, however, is just noise. To identify a genuine trend, we would apply a moving average crossover on the exchange balance. When the short-term average falls below the long-term average, it confirms that coins have been leaving exchanges consistently, as a sustained pattern, rather than isolated events.

Using Glassnode's exchange balance for Binance, we define the following:

  • Enter the market when the 5-day moving average of Binance's BTC balance falls below its 14-day moving average, signaling a sustained outflow trend.
  • Exit the market when the 5-day average rises back above the 14-day average, signaling that the outflow trend has reversed and coins are returning to the exchange.

We then benchmark this strategy against simply holding BTC over the same period, starting January 1, 2024 through March 9, 2026, with an initial capital of $1,000 and 0.1% trading fees applied to each trade.

This is a simplified trading strategy, designed primarily for illustrative purposes. It is not investment advice, nor is it meant to suggest that exchange balances are a robust foundation for a trading system.
Access live chart

Here's how to read this chart:

🟫 The brown line at the bottom is the binary trading signal, toggling between in the market (1) and out of the market (0).

🟦 The blue line tracks the strategy's portfolio value over time.

🟩 The green line is the buy-and-hold portfolio benchmark.

We can observe that the exchange balance strategy performed reasonably well, although at times the buy-and-hold strategy outperformed it. In the final days of the research period, however, the exchange balance strategy caught up. While some investors may find the combination of reduced volatility and an ultimately comparable performance to buy-and-hold appealing, the final numbers are misleading – and here’s why.

The Problem: Data Mutation and Look-Ahead Bias

Metrics are not static. Many are retroactively revised as new information becomes available. This is particularly true for metrics that depend on address clustering or entity labeling, such as on-chain exchange balances. However, it is also the case for metrics such as trading volume or price, as individual exchanges can occasionally submit their data with slight delays.

This means that a value you see today for, say, January 15, 2024, may not be the same value that was published on January 15, 2024. The data has been revised with hindsight. When you backtest a strategy on this revised data, you are implicitly using information that was not available at the time the trading decisions would have been made. This introduces a look-ahead bias.

The Honest Backtest: Using Point-in-Time Data

Let's therefore repeat the exact same backtest – same signal logic, same parameters, same dates, same fees – but this time using the Point-in-Time (PiT) variant of the Exchange Balance metric, available in Glassnode Studio.

PiT metrics are strictly append-only and immutable. Each historical data point reflects only the information that was known at the time it was first computed. No retroactive revisions, no look-ahead bias.

While we are using the same metric, the strategy now produces significantly different results, as illustrated by the purple line in the new chart below. The overall performance is notably worse.

Although both strategies behave similarly for much of 2024, we observe that the PiT-based version fails to capture the strong upticks in November 2024 and March 2025 as effectively. As a result, the cumulative performance diverges meaningfully and ends up considerably lower.

Access live chart

Key Takeaway

In this example, the purple strategy, which only has access to information as it was available at the time, performs noticeably worse. ► Backtests will lie if fed with wrong or revised data. Only immutable, Point-in-Time metrics ensure you’re replaying history as it actually happened.

Связанные с этим вопросы

QWhat is the main problem with using revised data for backtesting a trading strategy?

AThe main problem is that it introduces look-ahead bias, as the revised data includes information that was not available at the time the trading decisions would have been made.

QHow does the Point-in-Time (PiT) data differ from the standard exchange balance metric?

APoint-in-Time data is strictly append-only and immutable, meaning each historical data point reflects only the information known at the time it was first computed, with no retroactive revisions.

QWhat was the trading signal used in the hypothetical strategy based on exchange balances?

AThe strategy entered the market when the 5-day moving average of Binance's BTC balance fell below its 14-day moving average, and exited when the 5-day average rose back above the 14-day average.

QWhy did the backtest using Point-in-Time data perform worse than the one using revised data?

AThe PiT-based strategy failed to capture strong market upticks as effectively because it only had access to information available in real-time, without the benefit of hindsight revisions.

QWhat is the key takeaway from the article regarding backtesting and data quality?

ABacktests will produce misleading results if fed with revised data; only immutable, Point-in-Time metrics ensure an accurate replay of history as it actually happened.

Похожее

Blocked Its Own Treasure, WeChat AI Steps Up

Tencent's stock surged over 10% on June 2nd amid reports that WeChat, with 1.43 billion monthly users, is finalizing tests for a native AI Agent. The reported feature, accessible by swiping right from the main interface, allows users to issue commands in natural language. The AI then decomposes tasks and automatically calls upon relevant Mini Programs within WeChat to complete actions like ordering food, booking tickets, or making payments, creating a closed-loop service execution system. This strategic shift follows the internal conflict and subsequent "blocking" of Tencent's standalone AI app, Yuanbao, by WeChat for violating sharing rules during a 2026 Spring Festival promotion. The incident highlighted a lack of internal consensus and exposed the weakness of competing in the standalone AI assistant arena against rivals like ByteDance's Doubao (345M MAU) and Alibaba's Qianwen. The new WeChat AI Agent aims to leverage WeChat's unique assets—its massive user base, standardized Mini Program APIs, WeChat Pay, and identity system—to move from simple content generation to actual task execution. Analysts note this changes the competitive landscape from model benchmarks to which AI can connect to more real-world services. However, success depends on key variables: the capability of Tencent's underlying Hunyuan model, managing massive inference costs, and redesigning incentives for Mini Program developers whose traffic might be bypassed. The move is seen as an attempt to keep user service intent within WeChat's ecosystem as AI begins to redefine how users access services.

marsbit55 мин. назад

Blocked Its Own Treasure, WeChat AI Steps Up

marsbit55 мин. назад

ByteDance Adopts Arm CPUs, Jensen Huang: So Sad I Didn't Buy Arm

**Summary:** At Computex 2026, Arm CEO Rene Haas announced that ByteDance and Oracle have adopted Arm's self-designed Arm AGI data center CPU. The company expects significant revenue growth from this product, projecting $20 billion in demand for the 2027/2028 fiscal years. Haas noted that restricting AI-capable CPUs from the US to China is nearly impossible due to their widespread applications. Arm's stock has surged dramatically this year, notably rising 16% after NVIDIA's Arm-based Vera CPU and RTX Spark announcements. A highlight was the informal, humorous on-stage conversation between Haas and NVIDIA CEO Jensen Huang. Huang joked about NVIDIA's failed attempt to acquire Arm and playfully lamented selling his Arm shares. Both executives showed a clear sense of camaraderie and shared regret over the missed merger. Key technical topics were discussed: 1. **AI PC Design:** Huang explained NVIDIA's RTX Spark superchip (with a 20-core Arm CPU) is designed for future AI agents that will autonomously run and use tools on PCs, blending local and cloud processing. 2. **Agent vs. OS:** Huang emphasized the operating system remains crucial, as AI agents rely on its APIs and tools to function. 3. **Growth Constraints:** He identified the shift to "useful AI" that generates profitable tokens as a primary driver for immense, almost limitless, computational demand. Haas outlined Arm's strategy across PC and data centers. For PCs, Arm collaborates with partners like NVIDIA and MediaTek, offering its compute subsystem (CSS) for custom SoCs. In data centers, its Arm AGI CPU (built on TSMC's 3nm process) has gained major partners including OpenAI, Meta, and now ByteDance and Oracle. Arm presented a multi-year roadmap for its in-house CPU line. The article concludes that while GPUs dominated the AI training race, the explosion of AI agents is shifting significant focus to CPUs for inference, state management, and tool orchestration. The industry is trending towards vertical integration, with companies like cloud providers designing chips and chip/IP firms offering full solutions, all competing to deliver more efficient computing per watt.

marsbit1 ч. назад

ByteDance Adopts Arm CPUs, Jensen Huang: So Sad I Didn't Buy Arm

marsbit1 ч. назад

New Wall Street Play: Yen Shorts Still Adding, But Japan Stocks Don't Rely on Carry Trade Unwinding

On June 3rd, USD/JPY hit 160.44, its highest level since July 2024, while the Nikkei 225 surged past 68,000 points. Contrary to popular narratives of an imminent "carry trade unwind" akin to August 2024, data reveals a more complex picture. Speculative net short positions in yen futures have actually increased, reaching -114,667 contracts by late May, suggesting traders are doubling down rather than retreating. Meanwhile, Japan's Finance Ministry conducted its largest-ever single-round FX intervention (11.73 trillion yen) in April-May but failed to hold the 160 yen line. The Nikkei's rally is not driven by carry trade dynamics. Foreign investors are aggressively buying Japanese stocks, with net purchases in 2026 running nearly 16 times higher than 2025 levels. This inflow is concentrated in AI and semiconductor-related stocks like SoftBank and Socionext, fueled by positive sector outlooks, rather than being a flight from unwinding yen shorts. Furthermore, the Nikkei has continued climbing despite the Bank of Japan's (BOJ) rate hikes to 0.75%. This disconnect exists because the current equity boom is fueled by AI-driven foreign investment, not reliant on cheap yen funding. However, this relationship remains fragile. Should the BOJ hike rates further (e.g., to 1.0%) while dollar weakness increases carry trade costs, the trajectories of the yen and Japanese stocks could reconverge, potentially triggering volatility.

marsbit1 ч. назад

New Wall Street Play: Yen Shorts Still Adding, But Japan Stocks Don't Rely on Carry Trade Unwinding

marsbit1 ч. назад

Broadcom's Q3 Guidance Misses Expectations by $12 Billion, After-Hours Trading Plummets Over 13%, AI Narrative "Cooling"?

On June 3, Broadcom released record Q2 FY26 results with revenue of $22.19B, up 48% YoY, and AI chip sales of $10.8B, up 143%. Adjusted EPS of $2.44 beat estimates. However, its Q3 AI semiconductor revenue guidance of $16B, while up over 200% YoY, fell roughly $1.2B (7%) short of analyst consensus expectations of $17.2B. This miss, coupled with slightly weaker-than-expected software revenue, triggered a severe market reaction. CEO Hock Tan maintained the FY26 AI revenue outlook of over $100B but did not raise it, disappointing investors who had priced in more robust growth. The stock plummeted over 13% in after-hours trading, erasing roughly $270B in market cap. The sell-off extended to peers like Marvell. A key concern for markets, particularly for Chinese optical module suppliers, was Tan's comment that the contribution of AI networking (e.g., Ethernet switches, optical interconnect chips) to AI revenue, currently near 40%, is expected to normalize to around 30% over time, signaling a potential peak in growth for that segment. Despite the guidance shortfall, Tan reiterated that AI demand remains "insatiable" and reaffirmed the long-term target of exceeding $100B in AI revenue by FY27. The reaction highlights the heightened sensitivity and premium valuation placed on AI-exposed stocks, where anything less than stellar guidance can prompt significant profit-taking. The broader question is whether this represents a cooling AI narrative or a correction in overstretched valuations.

marsbit1 ч. назад

Broadcom's Q3 Guidance Misses Expectations by $12 Billion, After-Hours Trading Plummets Over 13%, AI Narrative "Cooling"?

marsbit1 ч. назад

Торговля

Спот
Фьючерсы
活动图片