Thousands of people around the world are selling their identities to train AI, but at what cost?

marsbit发布于2026-03-23更新于2026-03-23

文章摘要

A global investigation reveals a growing gray market where thousands of people worldwide are selling their biometric data—voices, faces, call logs, and daily videos—to train AI models for small payments. Examples include individuals in South Africa, India, and the U.S. earning modest sums through apps like Kled AI, Silencio, and Neon Mobile. While this provides crucial income, especially in economically strained regions, it raises serious concerns about privacy, exploitation, and long-term risks. Contributors often grant broad, irreversible rights to their data, potentially exposing them to deepfakes, identity theft, and unauthorized commercial use. Experts warn that this practice is unstable, offers no career progression, and primarily benefits tech companies in wealthier nations, leaving workers vulnerable with little recourse. Cases like an actor discovering his AI likeness promoting medical products without consent highlight the ethical and personal consequences of this emerging data-for-cash economy.

Author: The Guardian

Compiled by: Deep Tide TechFlow

Deep Tide Introduction: This investigative report reveals a rapidly growing gray industry: thousands of people worldwide are earning money for AI training by selling their voices, faces, call records, and daily videos.

This is not a general discussion about privacy controversies, but an investigation with real people, real amounts of money, and real consequences—an actor who sold his face later saw "himself" promoting an unknown medical product on Instagram, with people in the comments evaluating his "looks."

When the data hunger of AI companies combines with global economic disparities, it is creating an unequal transaction.

Full text as follows:

One morning last year, Jacobus Louw, who lives in Cape Town, South Africa, went out for his usual walk, feeding seagulls along the way. But this time he recorded a few videos—filming his footsteps and view as he walked on the sidewalk. This video earned him $14, about 10 times the country's minimum wage and equivalent to half a week's food expenses for this 27-year-old.

This was a "city navigation" task Louw completed on Kled AI. Kled AI is an app that pays users to upload photos, videos, and other data for training AI models. In just a few weeks, Louw earned $50 by uploading photos and videos from his daily life.

Thousands of miles away, in Ranchi, India, 22-year-old student Sahil Tigga regularly earns money through Silencio—an app that crowdsources audio data for AI training, accessing his phone's microphone to capture ambient noises like inside restaurants or busy intersections. He also uploads recordings of his own voice. Sahil makes special trips to unique locations, such as hotel lobbies not yet recorded on Silencio's map. He earns over $100 per month from this, enough to cover all his food expenses.

In Chicago, 18-year-old welding apprentice Ramelio Hill sold his private phone conversations with friends and family to Neon Mobile—a conversational AI training platform that pays $0.50 per minute—earning a few hundred dollars. For Hill, the calculation was simple: he believes tech companies already have vast amounts of his private data, so he might as well get a share of the profits.

These "AI training gigs"—uploading surroundings, personal photos, videos, and audio—are at the forefront of a new global data gold rush. As Silicon Valley's hunger for high-quality human data exceeds what can be scraped from the open internet, a booming data market industry has emerged to bridge this gap. From Cape Town to Chicago, thousands of people are micro-licensing their biometric identities and private data to the next generation of AI.

But this new gig economy comes at a cost. Behind the few dollars earned, these trainers are fueling an industry that may ultimately render their skills obsolete, while exposing themselves to future risks of deepfakes, identity theft, and digital exploitation—risks they are only beginning to understand.

Keeping the AI Gears Turning

AI language models like ChatGPT and Gemini require massive amounts of material to continuously improve, but they are facing a data shortage. The most commonly used training data sources—C4, RefinedWeb, and Dolma—which comprise a quarter of the highest-quality datasets on the web, are now restricting generative AI companies from using their data to train models. Researchers estimate that AI companies could run out of available fresh, high-quality text as early as 2026. Although some labs have begun training models with synthetic data generated by AI itself, this recursive process leads to models outputting error-filled "garbage," eventually causing a collapse.

This is where apps like Kled AI and Silencio come in. In these data markets, millions of people are feeding and training AI by selling their identity data. Beyond Kled AI, Silencio, and Neon Mobile, AI trainers have many choices: Luel AI, backed by the famous incubator Y-Combinator, acquires multilingual conversation material at about $0.15 per minute; ElevenLabs allows you to digitally clone your voice and make it available for others to use at a base rate of $0.02 per minute.

Bouke Klein Teeselink, an economics professor at King's College London, says AI training gigs are an emerging work category that will grow significantly.

AI companies know that paying people for data licensing helps avoid copyright disputes that might arise from relying entirely on web-scraped content, Teeselink says. AI researcher Veniamin Veselovsky adds that these companies also need high-quality data to model new, improved behaviors for their systems. "For now, human data is the gold standard for sampling outside the model's distribution," Veselovsky added.

The humans driving these machines—especially those in developing countries—often need the money and have few alternatives. For many AI training gig workers, taking on this work is a pragmatic response to economic disparity. In countries with high unemployment rates and depreciating local currencies, earning dollars is often more stable and lucrative than local jobs. Some struggle to find entry-level work and are forced into AI training out of necessity. Even in wealthier countries, rising living costs make selling oneself a logical financial choice.

Cape Town-based AI trainer Louw is well aware of the privacy cost. Although the income is unstable and not enough to cover all his monthly expenses, he is willing to accept these conditions to earn money. He has suffered from neurological diseases for years, making it difficult to find work, but the money he earned from AI data markets (including Kled AI) allowed him to save $500 to enroll in a spa training course to become a massage therapist.

"As a South African, receiving dollars is more valuable than people think," Louw said.

Mark Graham, a professor of internet geography at Oxford University and author of "Feeding the Machine," acknowledges that for individuals in developing countries, the money may have practical significance in the short term, but he warns, "Structurally, this work is unstable, has no upward mobility, and is essentially a dead end."

Graham added that AI data markets rely on "a race to the bottom in wages" and a "temporary demand for human data." Once that demand shifts, "workers will have no security, no transferable skills, and no safety net."

Graham said the only winners are "the platforms in the Global North, which capture all the lasting value."

Full Authorization

Chicago-based AI trainer Hill has mixed feelings about selling his private phone calls to Neon Mobile. About 11 hours of call content earned him $200, but he said the app often goes offline and delays payments. "Neon always seemed suspicious to me, but I kept using it just to earn some extra pocket money to pay bills," Hill said.

Now he is reconsidering whether the money was really that easy. Last September, Neon Mobile went offline just weeks after launching, after TechCrunch discovered a security vulnerability that allowed anyone to access users' phone numbers, call recordings, and transcripts. Hill said Neon Mobile never notified him of this situation, and now he is worried his voice could be misused online.

Jennifer King, a data privacy researcher at Stanford University's Institute for Human-Centered Artificial Intelligence, is concerned that AI data markets are not clear about how and where user data will be used. She added that without understanding their rights or being able to negotiate them, "consumers face the risk of their data being reused in ways they dislike, do not understand, or did not anticipate, with almost no recourse."

When AI trainers share data on Neon Mobile and Kled AI, they grant a full authorization (global, exclusive, irrevocable, transferable, and royalty-free) allowing the platform to sell, use, publicly display, and store their likeness, and even create derivative works based on it.

Kled AI founder Avi Patel said his company's data agreement limits use to AI training and research purposes. "The entire business model relies on user trust. If contributors think their data might be misused, the platform cannot function." He said the company vets buyers before selling datasets, avoiding cooperation with "suspicious-intent" organizations, such as the pornography industry, and "government agencies" they believe might use the data in ways that violate that trust.

Neon Mobile did not respond to requests for comment.

Enrico Bonadio, a law professor at City, University of London, pointed out that these agreement terms allow the platform and its clients to "do almost anything with that material, permanently, without additional payment, and contributors have no practical way to withdraw consent or renegotiate."

More worrying risks include: trainers' data being used to create deepfakes and identity impersonation. Although data markets claim to strip identifying information (such as names and locations) from data before sale, biometric patterns are inherently difficult to anonymize meaningfully, Bonadio added.

Seller's Remorse

Even if AI trainers could negotiate more detailed protections for how their data is used, they might still regret it. In 2024, New York-based actor Adam Coy sold his likeness for $1,000 to Captions—an AI video editing software now renamed Mirage. His agreement stipulated that his identity would not be used for any political purposes, not for promoting alcohol, tobacco, or pornography, and the license would last for one year.

Captions did not respond to requests for comment.

Soon after, Adam's friends began forwarding videos they found online featuring his face and voice, which had garnered millions of views. In one Instagram video, Adam's AI replica claimed to be a "vaginal doctor" promoting unverified medical supplements for pregnant and postpartum women.

"It's embarrassing to explain this to others," Coy said.

"The comments were weird because they were evaluating my appearance, but it wasn't even me," Coy added. "My thinking when I made the decision (to sell my likeness) was that most models would scrape data and portraits from the internet anyway, so I might as well get paid."

Coy said he has not taken any AI data gigs since. He said he would only consider doing it again if a company offered significant compensation.

相关问答

QWhat is the main concern raised in the article about people selling their personal data for AI training?

AThe article highlights that while individuals earn money by selling their biometric identities and private data (like voice, face, and conversations) to train AI models, they face risks such as deepfakes, identity theft, and digital exploitation, often without fully understanding the long-term consequences or having recourse if their data is misused.

QWhy are AI companies turning to data markets like Kled AI and Silencio for training data?

AAI companies are facing a shortage of high-quality training data from the open internet due to restrictions on datasets and potential copyright issues. Data markets provide a way to acquire fresh, human-generated data directly from individuals, which is considered the 'gold standard' for training AI models to improve their behavior and avoid using synthetic data that can lead to model degradation.

QHow do economic disparities play a role in the growth of AI training gigs?

AEconomic disparities drive the growth of AI training gigs because people in developing countries, or those facing high unemployment and currency devaluation, can earn valuable dollars from these platforms. For many, it is a pragmatic response to financial need, offering income that is more stable and lucrative than local jobs, even though the work is unstable and lacks long-term career prospects.

QWhat are some specific risks mentioned regarding the authorization agreements signed by AI trainers?

AThe authorization agreements often grant platforms global, exclusive, irrevocable, transferable, and royalty-free rights to use, sell, display, and create derivative works from the contributors' data. This means trainers have little control over how their data is used long-term, and risks include data being used for deepfakes, identity impersonation, or in ways they did not anticipate, with no practical way to withdraw consent or renegotiate.

QCan you provide an example from the article where an AI trainer regretted selling their data?

AActor Adam Coy from New York sold his likeness for $1000 to Captions (now Mirage) with specific restrictions, but later found his AI replica used in Instagram videos promoting unverified medical supplements, with millions of views. He felt embarrassed and noted that the comments were evaluating his appearance based on the AI clone, which was not actually him. He stated he would only consider such gigs again for significant payment due to the negative experience.

你可能也喜欢

谷歌847亿美元融资后市场就调整,AI 估值开始看回本速度

过去几年,市场对AI估值的核心逻辑是相信AI将改变世界,因此愿意给予高估值。但近期市场情绪发生变化,部分半导体和AI软件股票回调,资金转向现金流更稳定的方向。与此同时,谷歌(Alphabet)宣布大规模股权融资并上调资本开支指引,这促使市场重新审视AI投资逻辑。 市场的关注点正从“增长叙事”转向“资本效率”。AI发展需要巨额资本投入于芯片、数据中心、电力和网络等基础设施,这更像一场重资产的基础设施周期,而非轻资产的软件扩张。投资者开始更关注三个问题:资金从哪来、成本有多高、以及多久能回本。 谷歌的融资行为虽不直接意味着危机,但凸显了AI建设的资本密集特性。市场担心,不仅是大科技公司,前沿AI模型公司、数据中心运营商和电力公司都将争夺同一资金池,这可能推高整体融资成本,并考验各参与方的资本回报能力。 在此背景下,估值逻辑发生切换。高估值、依赖远期增长故事的公司面临压力,而订单清晰、现金流稳定的硬件、数据中心等“卖铲子”类资产则获得相对支撑。投资者开始更挑剔地审视AI投资的真实经济回报。 展望未来,AI板块的表现将取决于几个关键验证点:各公司资本开支指引是否持续上调、AI相关收入能否快速兑现以覆盖成本,以及资本市场能否持续消化大规模融资。虽然AI的长期前景依然被看好,但市场对其的估值已进入一个更注重现实回报和资本效率的新阶段。

marsbit45分钟前

谷歌847亿美元融资后市场就调整,AI 估值开始看回本速度

marsbit45分钟前

Orbs 推出面向机构的 DeFi 交易基础设施

Orbs今日推出了专为机构设计的DeFi交易基础设施“Orbs Institutional”,旨在为交易台、OTC公司、资金部门、托管方和金融平台提供直接访问其链上执行基础设施的渠道。该服务基于Orbs自2023年以来已处理超过25亿美元现货交易量的技术,此前已通过PancakeSwap等知名去中心化交易平台提供,现直接面向机构市场参与者开放。 随着机构对去中心化金融接受度提高,企业正越来越多地将链上执行纳入交易运营,但许多机构在去中心化市场中仍面临执行质量、托管要求和透明度等挑战。Orbs首席商务官Ran Hammer表示,机构不应在去中心化市场的效率与专业交易基础设施标准之间做选择,Orbs Institutional正是为提供透明、具有价格竞争力且资产完全可控的链上执行而设计。 该服务的核心是Orbs的流动性聚合协议Liquidity Hub,它通过私有询价层从专业做市商和去中心化交易所获取流动性,旨在提升执行质量并降低MEV和抢先交易风险。同时,机构还可使用dTWAP、dLIMIT等执行工具。订单支持符合EIP-712标准的现有托管或MPC基础设施签名,资产在整个执行周期中均由客户控制,且协议自2017年运行以来未出现已知漏洞。 Orbs Institutional提供两种主要集成方式:机构客户可通过API直接接入执行栈;钱包、托管方、交易所等则可选择白标或联合品牌方案,将Orbs执行能力集成至现有产品。随着机构在数字资产市场的参与度提升,Orbs预计市场对透明、自托管且自动化的执行基础设施需求将增长,并认为专业机构直接接入链上流动性和执行工具将推动DeFi下一阶段采用。 Orbs是一个专为高级链上交易设计的去中心化Layer3区块链,采用权益证明共识,支持传统智能合约无法实现的复杂逻辑与脚本,通过dLIMIT、Liquidity Hub等协议为去中心化市场带来媲美中心化金融的执行水平。

TheNewsCrypto54分钟前

Orbs 推出面向机构的 DeFi 交易基础设施

TheNewsCrypto54分钟前

管理近万亿估值公司,Anthropic CEO却只有一个直接下属

彭博社报道,Anthropic首席执行官达里奥·阿莫代采取了一种极为罕见的领导模式:作为一家估值近万亿美元的AI公司CEO,他只有一位直接下属——他的幕僚长。公司所有其他高管(如CFO、CCO)均向其姐姐、总裁丹妮拉·阿莫代汇报。丹妮拉负责日常运营并对董事会负责,这使得达里奥能完全专注于“远焦”事务,如战略方向、研究判断、组织文化和思考AI对人类文明的影响。 达里奥认为,CEO最大的价值在于需要大块不被打断时间进行深度思考的事务,而日常管理会切碎时间。因此,他将两者彻底分离。他大约一半时间用于文化建设,通过每两周一次的全员大会和撰写长篇备忘录来主动塑造和巩固公司文化,防止在快速扩张至2500人规模时,来自大公司的员工稀释原有文化。其余时间则用于研究方向和撰写深度文章。 哈佛商学院教授拉法埃拉·萨顿对此分析指出,管理幅度取决于公司面临问题的性质。若公司不断面对全新、高风险、无现成答案的问题(如Anthropic),CEO就需要更窄的管理幅度,将最稀缺的时间资源留给最高层的战略判断。这种姐弟分工基于两人背景的互补:达里奥是研究出身,丹妮拉则擅长运营和人事管理。值得注意的是,Anthropic全部七位联合创始人至今仍在公司,这被姐弟俩视为公司文化凝聚力的证明。

marsbit1小时前

管理近万亿估值公司,Anthropic CEO却只有一个直接下属

marsbit1小时前

交易

现货
合约

热门文章

如何购买PEOPLE

欢迎来到HTX.com!我们已经让购买ConstitutionDAO(PEOPLE)变得简单而便捷。跟随我们的逐步指南,放心开始您的加密货币之旅。第一步:创建您的HTX账户使用您的电子邮件、手机号码注册一个免费账户在HTX上。体验无忧的注册过程并解锁所有平台功能。立即注册第二步:前往买币页面,选择您的支付方式信用卡/借记卡购买:使用您的Visa或Mastercard即时购买ConstitutionDAO(PEOPLE)。余额购买:使用您HTX账户余额中的资金进行无缝交易。第三方购买:探索诸如Google Pay或Apple Pay等流行支付方法以增加便利性。C2C购买:在HTX平台上直接与其他用户交易。HTX场外交易台(OTC)购买:为大量交易者提供个性化服务和竞争性汇率。第三步:存储您的ConstitutionDAO(PEOPLE)购买完您的ConstitutionDAO(PEOPLE)后,将其存储在您的HTX账户钱包中。您也可以通过区块链转账将其发送到其他地方或者用于交易其他加密货币。第四步:交易ConstitutionDAO(PEOPLE)在HTX的现货市场轻松交易ConstitutionDAO(PEOPLE)。访问您的账户,选择您的交易对,执行您的交易,并实时监控。HTX为初学者和经验丰富的交易者提供了友好的用户体验。

1.2k人学过发布于 2024.03.29更新于 2026.06.02

如何购买PEOPLE

相关讨论

欢迎来到HTX社区。在这里,您可以了解最新的平台发展动态并获得专业的市场意见。以下是用户对PEOPLE(PEOPLE)币价的意见。

活动图片