From 'Address Clustering' to 'Evidence Standards': Why is Chainalysis Redefining Blockchain Tracing?

marsbit2026-07-01 tarihinde yayınlandı2026-07-01 tarihinde güncellendi

Özet

**Summary:** In June 2026, Chainalysis introduced the **Blockchain Tracing Ontology (BTO)**, a proposed data framework aiming to establish standardized, transparent, and verifiable models for blockchain analysis. This initiative addresses a core industry issue: despite public blockchain data, different firms often produce inconsistent results (e.g., differing entity labels for the same address) due to non-uniform methodologies, particularly in **address clustering**. This lack of standardization poses challenges for judicial investigations, AML, and enforcement. The BTO is not a new clustering algorithm but a **common "language" or conceptual framework**. It moves beyond the simplistic "cluster" model by introducing a hierarchical structure: **Entity → Wallet → Wallet Segment → Address**, which better reflects complex organizational wallet management. A key shift is from presenting mere results to ensuring **process trust and explainability**. The framework emphasizes documenting the **Evidence** and **Confidence** behind each analytical claim—specifying the on-chain/off-chain data, rules applied, and certainty levels—enabling third-party verification. This focus is partly informed by legal precedents like the **Bitcoin Fog** case, where Chainalysis's methods underwent rigorous judicial scrutiny (Daubert hearing), highlighting the need for reproducible, scientifically sound analysis. The proposal clarifies that on-chain analysis identifies address relationships and flow p...

Author: 137Labs

At the end of June 2026, Chainalysis publicly released a data framework called the 'Blockchain Tracing Ontology', aiming to establish a more unified data description system for blockchain analysis. Compared to previous new product or feature announcements, this document resembles more of an industry standard proposal: it attempts to redefine the fundamental concepts of on-chain data analysis and establish a data model for blockchain tracing that is interpretable, verifiable, and reproducible.

Following its release, this proposal quickly became a topic of interest in the fields of blockchain analysis and digital asset compliance. Although it is still in the stage of public discussion and an industry initiative, it has already prompted people to reconsider: does on-chain analysis require a more unified and transparent data standard?

A Long-standing Problem: Why Do Different Companies Arrive at Different Analysis Results?

Blockchain data is inherently open and transparent, but there has long been a lack of unified standards for interpreting this data.

Currently, most on-chain analysis platforms employ 'Address Clustering' technology, inferring which addresses might be controlled by the same entity based on transaction behavior. However, the algorithms, rules, and evidence sources adopted by different institutions are inconsistent. Consequently, the same address may be attributed to completely different entities on different platforms.

For example, one analysis firm might identify an address as belonging to a major exchange, while another marks it as an unknown wallet; the same batch of addresses might also be assigned to different clusters across platforms. Such discrepancies may have limited impact on market analysis, but when it comes to judicial investigations, asset freezes, anti-money laundering, or law enforcement forensics, they can lead to significant controversy.

For courts, simply concluding that 'this is an exchange's wallet' is far from sufficient; a more critical question to answer is: Why can this judgment be made?

What Chainalysis Proposes Is Not a New Algorithm, But a 'Language'

Seeing the word 'Ontology', many might mistakenly think Chainalysis has introduced another new clustering algorithm. That is not the case.

Ontology is a concept from knowledge engineering, referring to a unified system of concepts and a relationship model designed to standardize the definitions of different objects and the ways they are interrelated. Internet search, medical knowledge bases, and even AI knowledge graphs heavily utilize ontologies to ensure data can be understood uniformly.

What Chainalysis hopes to achieve is establishing a similar 'common language' for blockchain analysis.

In other words, it does not mandate that all companies must adopt the same clustering algorithm, but rather encourages everyone to express analysis results according to a unified data structure. This would make the analytical process more transparent and facilitate third-party understanding, verification, and reproduction.

'Cluster' Is No Longer Sufficient

In the past, the industry widely used 'Clusters' as the basic unit of analysis, assuming multiple addresses collectively belonged to a single wallet or entity.

While simple and intuitive, this method's limitations have become increasingly apparent with the evolution of blockchain infrastructure.

Today, the wallet system of a large exchange might contain millions of addresses, each serving completely different functions such as deposits, withdrawals, cold/hot wallet management, consolidation, and change. If they are still simplistically grouped into a single Cluster, it becomes difficult to accurately describe the complex wallet structure.

Therefore, in its proposal, Chainalysis introduces the new concept of 'Wallet Segment'.

In the new model, an Entity can have multiple Wallets, each Wallet can be divided into multiple Wallet Segments, and each Segment contains specific Addresses. This hierarchical structure more realistically reflects the wallet management model of large institutions compared to the traditional Cluster approach, and allows for a more granular description of the control relationships between different addresses.

From 'Trust in Results' to 'Trust in the Process'

Beyond the model itself, a more significant change lies in the second layer of the design.

Traditional on-chain analysis focused more on the final result—who an address belongs to, where funds flow, whether it involves illicit activity.

The new Ontology emphasizes the inference process itself.

For every analytical conclusion, several questions should be explicitly answered:

  • What on-chain evidence supports this conclusion?
  • Which analytical rules were applied?
  • Was off-chain information referenced?
  • What is the confidence level of this inference?
  • Can a third party re-validate this process?

In other words, it's not just about telling someone 'what', but also explaining 'why'.

Chainalysis refers to this part as the Evidence and Confidence layer.

In the future, marking an address as an exchange wallet would no longer be just a simple label; it would be accompanied by a complete set of inference basis, including transaction patterns, address relationships, public information, investigation records, etc., along with a corresponding confidence level. This design better aligns with judicial evidence requirements for explainability and facilitates cross-validation between different institutions.

The Insight from the Bitcoin Fog Case

In fact, this proposal did not emerge in a vacuum; it is closely related to the prominent U.S. Bitcoin Fog money laundering case.

Bitcoin Fog was once one of the longest-running Bitcoin mixing services. The U.S. Department of Justice heavily relied on Chainalysis Reactor's analysis as key evidence during its investigation.

During the trial, the court held a notable Daubert hearing, subjecting Chainalysis's analysis methods to rigorous scrutiny, including:

  • Whether address clustering has a scientific basis;
  • Whether the analysis method can be repeatedly verified;
  • Whether it constitutes an unexplainable 'black-box algorithm';
  • Whether other experts can independently reproduce the analysis process.

Ultimately, the court recognized that Chainalysis's analytical methods possessed sufficient scientific reliability to be admissible as judicial evidence.

However, this case also exposed issues within the industry: if different analysis firms employ different standards, future similar cases could face more challenges. Therefore, establishing a unified framework for data expression and evidence became a key driver for Chainalysis to push forward the Ontology.

Blockchain Analysis Cannot Directly Identify Real-World Identities

It is worth noting that Chainalysis specifically emphasizes a key point in this proposal: on-chain analysis itself cannot directly identify real-world individual identities.

On-chain data can only reveal relationships between addresses and fund flow paths. Determining the real-world controller behind an address typically still relies on off-chain evidence, such as exchange KYC information, data subpoenaed by courts, server logs obtained by law enforcement, etc.

This means blockchain analysis provides high-quality data inference, not the definitive evidence that directly proves identity. A truly complete judicial evidence chain requires the combination of on-chain data and off-chain investigation.

From Data Quality to Industry Standard

Beyond the Ontology itself, the overall framework presented systematically addresses data quality, analytical transparency, and judicial admissibility. It is evident that Chainalysis aims to encourage the industry to focus not just on analysis results, but on whether the analytical process can be explained, verified, and reproduced.

This also indicates that the future competitive focus of the industry may shift from 'who covers more addresses' or 'who identifies more labels' to 'whose data quality is higher', 'whose analysis is more transparent', and 'whose evidence is more readily admissible in court'.

For regulators, law enforcement agencies, and large financial institutions, a system capable of explaining its analytical logic, supporting independent audits, and possessing reproducible verification capabilities is clearly more trustworthy than a 'black-box model' that only outputs results.

What Does This Proposal Mean?

From a longer-term perspective, what Chainalysis released this time is not an ordinary software upgrade, but more akin to an effort to push the blockchain analysis industry from being 'experience-driven' to 'standard-driven'.

If this Ontology is widely adopted by the industry, different analysis firms, exchanges, regulatory bodies, and even judicial authorities could potentially share analysis results under a unified data model, reducing communication costs, improving evidence consistency, and providing a more reliable foundation for cross-border law enforcement, anti-money laundering investigations, and digital asset regulation.

Of course, establishing standards is not achieved overnight. Balancing commercial secrecy with transparency, encouraging different institutions to adopt unified norms, and continuously refining the evidence model will require joint exploration by the industry.

However, it is certain that as digital assets become increasingly integrated into the global financial system, the focus of competition in blockchain analysis is changing: what will truly determine the industry's value in the future is not just the accuracy of algorithms, but also the explainability of the analysis process, data quality, and evidence credibility. And this is precisely the new direction that Chainalysis hopes to open up with the Blockchain Tracing Ontology.

İlgili Sorular

QAccording to the article, what is the core problem that Chainalysis's Blockchain Tracing Ontology aims to solve?

AThe article states that a core problem is the lack of unified standards in interpreting public blockchain data. Currently, different analysis companies use different algorithms and rules for 'address clustering,' leading to inconsistent results for the same address across platforms. This lack of standardization creates significant challenges for judicial investigations, asset freezing, and AML compliance, as courts require not just a conclusion but an explanation of 'why' that conclusion was reached.

QWhat is the primary difference between a traditional 'Cluster' and the new 'Wallet Segment' concept proposed in the Ontology?

AA traditional 'Cluster' groups addresses under a single entity or wallet, which is too simplistic for modern, complex wallet structures like those of large exchanges. The new 'Wallet Segment' concept introduces a more granular, hierarchical model: an Entity can own multiple Wallets, each Wallet can contain multiple Segments (representing different functions like deposits, withdrawals, or cold storage), and each Segment holds specific addresses. This better reflects real-world wallet management.

QHow does the proposed Ontology shift the focus of blockchain analysis, as described in the article?

AThe Ontology shifts the focus from just presenting final results (like who owns an address) to emphasizing the transparency and verifiability of the analysis process itself. It requires that any conclusion be accompanied by its underlying evidence (e.g., transaction patterns, address relationships), the rules used for inference, references to off-chain information, and a stated confidence level. This moves from 'result trustworthiness' to 'process trustworthiness,' making analysis more explainable and suitable for judicial evidence.

QWhat major legal case is cited as a key motivation for developing this Ontology, and why?

AThe article cites the U.S. Bitcoin Fog money laundering case as a key motivation. In this case, Chainalysis's analysis was subjected to rigorous judicial scrutiny in a Daubert hearing, which tested its scientific reliability and repeatability. While the methodology was accepted, the case highlighted a potential future problem: if different analysis firms use different standards, their evidence could face more challenges in court. The Ontology aims to create a unified evidence framework to address this.

QWhat crucial limitation of blockchain analysis does the article highlight, even with the new Ontology?

AThe article emphasizes that blockchain analysis, even with the proposed Ontology, cannot directly identify the real-world identity of a person behind an address. It can only reveal relationships between addresses and trace fund flows. Determining the actual controller of an address still typically requires combining this on-chain analysis with off-chain evidence, such as exchange KYC data, server logs obtained by law enforcement, or other investigative information.

İlgili Okumalar

Who is the Most Profitable Man in the Crypto World? Trump Rakes in Over $1.427 Billion in 2025

Who is the most profitable man in crypto? President Trump's 2025 financial disclosure, filed with the Office of Government Ethics, reveals crypto-related earnings exceeding $1.427 billion, starkly contrasting the broader market downturn. The bulk of this wealth stems not from passive investment but from his entities' roles as issuers and licensors. CIC Digital LLC, his memecoin operation, generated approximately $636 million in 2025, primarily from "Celebration Coins" royalty fees. DT Marks Defi LLC, a stakeholder in World Liberty Financial, earned about $594 million from asset sales and token distribution proceeds. Other entities held significant Bitcoin, Ethereum, and various altcoin wallets, along with substantial income from stablecoin ventures and his wife's NFT sales. This "issuer model" shields him from market downturns. While his namesake memecoin plummeted from ~$74 to ~$1.68, leaving many retail investors at a loss, his royalty income remained unaffected. The disclosure emerges as the U.S. Senate debates the CLARITY Act, which includes contentious ethics provisions aimed at preventing officials from profiting from the crypto sector they regulate. Critics argue Trump's earnings exemplify a critical conflict of interest, fueling demands for stricter rules to separate regulatory power from personal financial gain in the industry.

Foresight News21 dk önce

Who is the Most Profitable Man in the Crypto World? Trump Rakes in Over $1.427 Billion in 2025

Foresight News21 dk önce

Trump's 25-Year Financial Report: Family Earns Over $1 Billion Annually from Crypto, While Retail Investors Lose Money on $TRUMP

Former President Donald Trump's family earned approximately $1.2 billion from cryptocurrency ventures in 2025, according to a financial disclosure report. This revenue stream, outlined in a 927-page filing, now surpasses income from most of his long-established real estate holdings. The crypto earnings originated from two main sources: over $500 million from the sale of products like "governance tokens" by World Liberty Financial, a DeFi project co-owned by the Trump family, and roughly $635 million in royalties from the Trump-themed meme coin $TRUMP, issued by CIC Digital LLC. While Trump's entities profited, retail investors faced significant losses. The $TRUMP token, which peaked above $74 shortly after its January 2025 launch, has plummeted to around $1.68. World Liberty Financial's token has also fallen roughly 80% since its debut. Reports indicate that the majority of meme coin buyers have lost money, with Trump-linked entities still holding about 80% of $TRUMP's supply under vesting plans. The disclosure highlights a stark contrast: Trump's crypto and real estate businesses flourished—with new international property deals bringing in tens of millions—even as his administration shifted to crypto-friendly policies, relaxing the stringent regulatory stance of the previous Biden administration. The White House maintains that Trump acts only in the public interest, with his businesses placed in a trust managed by his sons, denying any conflict of interest. However, the report notes the difficulty of assessing such conflicts, particularly regarding foreign business dealings with countries that later received favorable U.S. policy decisions.

marsbit23 dk önce

Trump's 25-Year Financial Report: Family Earns Over $1 Billion Annually from Crypto, While Retail Investors Lose Money on $TRUMP

marsbit23 dk önce

The 'Conference Circuit' for the Second Half of the Year Begins! A Complete Overview of the 2026 Web3 Global Summit Schedule

"Web3 Global Summit Calendar for the Second Half of 2026" provides a comprehensive list of major Web3 and blockchain conferences worldwide, focusing on events from July to December 2026. The schedule starts in July with IVS in Kyoto, WebX in Tokyo, Canada Crypto Week in Toronto, and Malaysia Blockchain Week in Kuala Lumpur. August features Conviction in Ho Chi Minh City, Coinfest Asia in Bali, and Bitcoin Hong Kong. September is the most intense month, with notable events like NFT NYC in New York, ETHRome in Rome, Money20/20 in Saudi Arabia, European Blockchain Convention in Barcelona, and Korea Blockchain Week in Seoul. The fourth quarter begins with the significant TOKEN2049 Singapore in October, which will be the sole TOKEN2049 event of the year following the cancellation of the Dubai edition. November includes Devcon 8 and Bitcoin Amsterdam in Amsterdam, Digital Asset Summit and Solana Breakpoint in London. The year concludes in December with Blockchain Life in Dubai and Bitcoin MENA in Abu Dhabi. The article also lists key events from the first half of the year (January to June, marked as concluded) for reference, including Consensus Hong Kong, ETHDenver, and Paris Blockchain Week. The guide serves as a resource for planning attendance at these industry gatherings across Asia, Europe, North America, and the Middle East.

Foresight News1 saat önce

The 'Conference Circuit' for the Second Half of the Year Begins! A Complete Overview of the 2026 Web3 Global Summit Schedule

Foresight News1 saat önce

Video Edition Nano Banana Arrives: Built-in Gemini World Knowledge, Original Banana Generates Images in Just 4 Seconds

Google has unveiled two new multimodal AI models: Gemini Omni Flash and Nano Banana 2 Lite. Gemini Omni Flash is a video generation and editing model that leverages Gemini's world knowledge. It allows for conversational video editing using natural language prompts, maintains scene consistency, and integrates text/graphics with video actions. Priced at $0.10 per second of output, its current limitations include a 10-second video cap. Nano Banana 2 Lite (gemini-3.1-flash-lite-image) is an optimized image generation model focused on speed and cost. It produces a 1K resolution image in about 4 seconds at a cost of roughly $0.034, making it significantly faster and cheaper than its predecessor. It retains strong text rendering capabilities. A key highlight is the combined workflow: users can rapidly generate images with Nano Banana 2 Lite and then seamlessly feed them into Gemini Omni Flash to create videos. Google demonstrated this with three application demos: "Anywhere" for creating travel videos from photos, "Space Lift" for generating interior design walkthroughs, and "Omni Product Studio" for automating e-commerce ad creation from product photos. The release underscores Google's strategic focus on advancing multimodal AI for practical, commercial applications in areas like marketing, design, and content creation, despite competitive pressures in other AI domains.

marsbit1 saat önce

Video Edition Nano Banana Arrives: Built-in Gemini World Knowledge, Original Banana Generates Images in Just 4 Seconds

marsbit1 saat önce

İşlemler

Spot
活动图片