The DATA Foundation Launches to Tackle AI’s Multi-Billion Dollar Training Data Bottleneck

TheNewsCrypto发布于2026-06-25更新于2026-06-25

文章摘要

Story has rebranded as The DATA Foundation and launched the DATA Network alongside Trace, a public audit layer for AI training data provenance and licensing. The launch includes a flagship integration with Kled, the world's largest opt-in human data marketplace, registering 1.5 billion user-contributed records. This move addresses the AI industry's multi-billion dollar bottleneck in sourcing high-quality, legally compliant training data, as the supply of scrapable public web data is exhausted. Trace provides immutable receipts for each data contribution, enabling verification of consent, licensing, and provenance. The ecosystem also includes Poseidon's data processing and the contributor app Numo. The foundation's thesis is that proving data origin and ensuring fair compensation is critical for advancing trusted AI. The existing $IP token will migrate to $DATA on a one-to-one basis.

Palo Alto, United States, June 25th, 2026, Chainwire

Story rebrands as The DATA Foundation, launches DATA Network with flagship Kled AI integration, registering 1.5 billion user-contributed records on the platform

The Foundation also introduces Trace, the first public audit layer for consent, licensing, and data provenance at scale

Today, Story announces a strategic transition to become The DATA Foundation (“DATA”) and launches Trace, an onchain registry for AI training data provenance and licensing. The launch includes a flagship integration with Kled, the world’s largest opt-in human data marketplace, registering 1.5 billion user-contributed records on the Network. Andrea Muttoni becomes CEO of The DATA Foundation, and Kled’s founder, Avi Patel, joins in an advisor position as the Chief Data Officer.

AI’s Training Data Has Hit a Bottleneck

The shift to DATA reflects where the market is pulling hardest. AI training data has emerged as the most valuable and least solved category of IP. Frontier AI labs have hit a multi-billion-dollar data bottleneck, where the internet has been effectively exhausted for scraping. The remaining supply is either expensive and bespoke or legally undocumented, leaving labs without a way to source data at scale, prove its provenance, or guarantee its quality.

The legal stakes are rising, as frontier labs stake out market-defining products on data sourced through opaque networks, often without clear records of consent or jurisdiction. Scraped and undocumented data is no longer an option for enterprise-grade AI.

“The challenge in AI has shifted from compute and architecture to sourcing and provenance. As the scrapable web fractures, the question for labs now is who is keeping the receipts,” said Andrea Muttoni, CEO of The DATA Foundation. “With Kled, we combine full data transparency and auditability with the largest pool of AI training data on the planet.”

Building the Infrastructure for Trusted AI Data

DATA builds on the original mission to deliver a data and intellectual property (IP) layer for the internet, recognizing that the form of data and IP that is most critical in this era is AI training data. DATA Network brings essential infrastructure for training AI, anchored by a flagship integration with Kled. Starting today, Kled’s licensing rails and contributor receipts run on DATA Network with added support for stable coin payouts, which involves registering a staggering 1.5 billion user-contributed records with programmatic legal safeguards.

“Frontier labs have exhausted the supply of high-quality, human-generated public text available on the open web. Suppliers showing data-sourcing provenance will win the next decade of deals, and that’s our bet,” said Avi Patel, CEO and founder of Kled and part-time advisory CDO of The DATA Foundation. “Instead of sourcing data blindly, Kled’s data marketplace and DATA’s auditable chain of custody converge on what labs actually need to license data with confidence and transparency.”

Trace Launches as the Public Audit Layer for AI Training Data

Trace, The DATA Foundation’s public audit and search platform, also launches today alongside the Kled integration. Trace generates immutable, confidential receipts for every contribution, allowing labs to verify the legitimacy of datasets in seconds. For every single record uploaded by users worldwide, a receipt on DATA will be generated, enabling upstream compensation for contributors’ data and intellectual property. This addresses an urgent need for a verifiable and compliant AI training data market, which has become a legal and operational minefield.

A Wider Contributor Network

DATA’s thesis was validated by Poseidon, the AI data processing project incubated by Story, which cleans, normalizes, and scores raw human data for authenticity and quality, ensuring every record that reaches a buyer is model-ready. Poseidon’s early traction with frontier labs proved the AI training data opportunity. Backed by a16z and now running entirely on DATA, its contributor app Numo is live today, bringing thousands of contributors into the AI economy in exchange for real-time payouts.

“We started Story to build an IP layer for the internet, and the most important IP of this era is the data you can’t scrape: how a surgeon’s hands move, how a robot grips, how people speak, drive, and work in the real world,” said SY Lee, CEO of PIP Labs and strategic adviser to The DATA Foundation. “DATA is where that conviction goes next: an end-to-end network that proves real-world data’s origin, licenses it, and pays the people who made it. “

Token Migration and Ecosystem Continuity

The $IP token migrates to $DATA one-to-one with no action required from existing holders. Migration guidance, exchange timing, and an FAQ are available here.

About The DATA Foundation

Data is the biggest bottleneck in frontier AI. The data models need most either sits siloed with people and companies, or doesn’t exist yet, and won’t, until incentives are aligned to create it. DATA Network is the world’s AI audit rails built to answer the three questions every lab asks: can you source data at scale, prove where it came from, and guarantee its quality? Contributor apps including Numo and Kled supply opt-in human data; Trace gives every record a public, tamper-proof receipt; Poseidon turns it into model-ready datasets, so frontier AI can keep advancing on a foundation it can trust. $IP is now $DATA. More information available at datafdn.org.

Contact

HV
henri.vies@piplabs.xyz

热门币种推荐

相关问答

QWhat is the main purpose of The DATA Foundation's launch and its DATA Network?

AThe main purpose of The DATA Foundation's launch and its DATA Network is to tackle the multi-billion dollar AI training data bottleneck by providing infrastructure for sourcing, proving provenance, and guaranteeing the quality of AI training data at scale. It focuses on opt-in, legally compliant data to replace scraped and undocumented sources.

QWhat key problem does Trace, the new platform launched by The DATA Foundation, aim to solve?

ATrace aims to solve the problem of data provenance and licensing transparency for AI training data. It acts as a public audit layer by generating immutable, confidential receipts for every data contribution, allowing AI labs to verify dataset legitimacy, consent, and licensing in seconds, thereby creating a verifiable and compliant AI training data market.

QHow does the integration with Kled contribute to The DATA Foundation's network?

AThe integration with Kled contributes 1.5 billion user-contributed records to The DATA Network. It brings Kled's licensing rails and contributor receipts onto the network, adding support for stablecoin payouts. This combines Kled's large pool of opt-in human data with DATA's infrastructure for full transparency and auditability, providing AI labs with a trusted source of scalable, documented data.

QAccording to the article, why has scraped data become a problematic source for enterprise AI development?

AScraped data has become problematic because the internet has been effectively exhausted as a high-quality source, frontier AI labs face a data bottleneck, and such data often lacks clear records of consent, jurisdiction, or legal documentation. Using opaque, undocumented data poses significant legal risks and is no longer viable for building enterprise-grade AI products that require provenance and compliance.

QWhat changes are occurring with the $IP token as part of The DATA Foundation's launch?

AThe $IP token is migrating to the new $DATA token on a one-to-one basis. Existing holders do not need to take any action for this migration. Guidance on the migration process, exchange timing, and FAQs are provided by the foundation.

你可能也喜欢

Baillie Gifford 代币化债券基金加入 Solana 和以太坊 RWA 竞赛

据报道,英国资产管理公司Baillie Gifford正计划推出一个受监管的代币化债券基金,该基金将利用Solana和以太坊等公共区块链基础设施,并由纽约梅隆银行提供托管支持。此举使又一家传统大型资管机构加入了现实世界资产的代币化竞争。 代币化基金已成为传统金融与加密基础设施融合最清晰的领域之一。与投机性的代币发行不同,代币化债券和货币市场产品直接对接了机构对收益、结算效率和可编程分发的现有需求。 债券因其本身就存在于复杂的结算和托管体系中,天然适合代币化。代币化基金份额可以简化转让流程、提高透明度并支持更自动化的抵押品使用。以太坊和Solana等公链正竞相成为此类产品的承载平台,前者拥有机构熟悉度和深厚工具生态,后者则以速度和低成本见长。 现实世界资产叙事之所以强于其他加密叙事,是因为它紧密关联实用的金融基础设施。代币化国债、私募信贷、债券和基金份额都指向同一趋势:传统资产正逐渐与区块链结算兼容。尽管市场仍在争论哪条区块链将胜出,但受监管的代币化基金这一更广泛的趋势正在持续加强。 这一进展应被置于更广阔的市场结构中看待。当前市场正同时面对流动性减弱、政策问题趋严、机构产品推出以及高波动性代币再次承压等多重因素。因此,此类消息的价值在于揭示了资本、监管和基础设施的流动方向,而非必然成为短期的价格催化剂。

bitcoinist3小时前

Baillie Gifford 代币化债券基金加入 Solana 和以太坊 RWA 竞赛

bitcoinist3小时前

交易

现货
合约

热门文章

如何购买S

欢迎来到HTX.com!我们已经让购买Sonic(S)变得简单而便捷。跟随我们的逐步指南,放心开始您的加密货币之旅。第一步:创建您的HTX账户使用您的电子邮件、手机号码注册一个免费账户在HTX上。体验无忧的注册过程并解锁所有平台功能。立即注册第二步:前往买币页面,选择您的支付方式信用卡/借记卡购买:使用您的Visa或Mastercard即时购买Sonic(S)。余额购买:使用您HTX账户余额中的资金进行无缝交易。第三方购买:探索诸如Google Pay或Apple Pay等流行支付方法以增加便利性。C2C购买:在HTX平台上直接与其他用户交易。HTX场外交易台(OTC)购买:为大量交易者提供个性化服务和竞争性汇率。第三步:存储您的Sonic(S)购买完您的Sonic(S)后,将其存储在您的HTX账户钱包中。您也可以通过区块链转账将其发送到其他地方或者用于交易其他加密货币。第四步:交易Sonic(S)在HTX的现货市场轻松交易Sonic(S)。访问您的账户,选择您的交易对,执行您的交易,并实时监控。HTX为初学者和经验丰富的交易者提供了友好的用户体验。

2.6k人学过发布于 2025.01.15更新于 2026.06.02

如何购买S

相关讨论

欢迎来到HTX社区。在这里,您可以了解最新的平台发展动态并获得专业的市场意见。以下是用户对S(S)币价的意见。

活动图片