Thousands of people around the world are selling their identities to train AI, but at what cost?

marsbit發佈於 2026-03-23更新於 2026-03-23

文章摘要

A global investigation reveals a growing gray market where thousands of people worldwide are selling their biometric data—voices, faces, call logs, and daily videos—to train AI models for small payments. Examples include individuals in South Africa, India, and the U.S. earning modest sums through apps like Kled AI, Silencio, and Neon Mobile. While this provides crucial income, especially in economically strained regions, it raises serious concerns about privacy, exploitation, and long-term risks. Contributors often grant broad, irreversible rights to their data, potentially exposing them to deepfakes, identity theft, and unauthorized commercial use. Experts warn that this practice is unstable, offers no career progression, and primarily benefits tech companies in wealthier nations, leaving workers vulnerable with little recourse. Cases like an actor discovering his AI likeness promoting medical products without consent highlight the ethical and personal consequences of this emerging data-for-cash economy.

Author: The Guardian

Compiled by: Deep Tide TechFlow

Deep Tide Introduction: This investigative report reveals a rapidly growing gray industry: thousands of people worldwide are earning money for AI training by selling their voices, faces, call records, and daily videos.

This is not a general discussion about privacy controversies, but an investigation with real people, real amounts of money, and real consequences—an actor who sold his face later saw "himself" promoting an unknown medical product on Instagram, with people in the comments evaluating his "looks."

When the data hunger of AI companies combines with global economic disparities, it is creating an unequal transaction.

Full text as follows:

One morning last year, Jacobus Louw, who lives in Cape Town, South Africa, went out for his usual walk, feeding seagulls along the way. But this time he recorded a few videos—filming his footsteps and view as he walked on the sidewalk. This video earned him $14, about 10 times the country's minimum wage and equivalent to half a week's food expenses for this 27-year-old.

This was a "city navigation" task Louw completed on Kled AI. Kled AI is an app that pays users to upload photos, videos, and other data for training AI models. In just a few weeks, Louw earned $50 by uploading photos and videos from his daily life.

Thousands of miles away, in Ranchi, India, 22-year-old student Sahil Tigga regularly earns money through Silencio—an app that crowdsources audio data for AI training, accessing his phone's microphone to capture ambient noises like inside restaurants or busy intersections. He also uploads recordings of his own voice. Sahil makes special trips to unique locations, such as hotel lobbies not yet recorded on Silencio's map. He earns over $100 per month from this, enough to cover all his food expenses.

In Chicago, 18-year-old welding apprentice Ramelio Hill sold his private phone conversations with friends and family to Neon Mobile—a conversational AI training platform that pays $0.50 per minute—earning a few hundred dollars. For Hill, the calculation was simple: he believes tech companies already have vast amounts of his private data, so he might as well get a share of the profits.

These "AI training gigs"—uploading surroundings, personal photos, videos, and audio—are at the forefront of a new global data gold rush. As Silicon Valley's hunger for high-quality human data exceeds what can be scraped from the open internet, a booming data market industry has emerged to bridge this gap. From Cape Town to Chicago, thousands of people are micro-licensing their biometric identities and private data to the next generation of AI.

But this new gig economy comes at a cost. Behind the few dollars earned, these trainers are fueling an industry that may ultimately render their skills obsolete, while exposing themselves to future risks of deepfakes, identity theft, and digital exploitation—risks they are only beginning to understand.

Keeping the AI Gears Turning

AI language models like ChatGPT and Gemini require massive amounts of material to continuously improve, but they are facing a data shortage. The most commonly used training data sources—C4, RefinedWeb, and Dolma—which comprise a quarter of the highest-quality datasets on the web, are now restricting generative AI companies from using their data to train models. Researchers estimate that AI companies could run out of available fresh, high-quality text as early as 2026. Although some labs have begun training models with synthetic data generated by AI itself, this recursive process leads to models outputting error-filled "garbage," eventually causing a collapse.

This is where apps like Kled AI and Silencio come in. In these data markets, millions of people are feeding and training AI by selling their identity data. Beyond Kled AI, Silencio, and Neon Mobile, AI trainers have many choices: Luel AI, backed by the famous incubator Y-Combinator, acquires multilingual conversation material at about $0.15 per minute; ElevenLabs allows you to digitally clone your voice and make it available for others to use at a base rate of $0.02 per minute.

Bouke Klein Teeselink, an economics professor at King's College London, says AI training gigs are an emerging work category that will grow significantly.

AI companies know that paying people for data licensing helps avoid copyright disputes that might arise from relying entirely on web-scraped content, Teeselink says. AI researcher Veniamin Veselovsky adds that these companies also need high-quality data to model new, improved behaviors for their systems. "For now, human data is the gold standard for sampling outside the model's distribution," Veselovsky added.

The humans driving these machines—especially those in developing countries—often need the money and have few alternatives. For many AI training gig workers, taking on this work is a pragmatic response to economic disparity. In countries with high unemployment rates and depreciating local currencies, earning dollars is often more stable and lucrative than local jobs. Some struggle to find entry-level work and are forced into AI training out of necessity. Even in wealthier countries, rising living costs make selling oneself a logical financial choice.

Cape Town-based AI trainer Louw is well aware of the privacy cost. Although the income is unstable and not enough to cover all his monthly expenses, he is willing to accept these conditions to earn money. He has suffered from neurological diseases for years, making it difficult to find work, but the money he earned from AI data markets (including Kled AI) allowed him to save $500 to enroll in a spa training course to become a massage therapist.

"As a South African, receiving dollars is more valuable than people think," Louw said.

Mark Graham, a professor of internet geography at Oxford University and author of "Feeding the Machine," acknowledges that for individuals in developing countries, the money may have practical significance in the short term, but he warns, "Structurally, this work is unstable, has no upward mobility, and is essentially a dead end."

Graham added that AI data markets rely on "a race to the bottom in wages" and a "temporary demand for human data." Once that demand shifts, "workers will have no security, no transferable skills, and no safety net."

Graham said the only winners are "the platforms in the Global North, which capture all the lasting value."

Full Authorization

Chicago-based AI trainer Hill has mixed feelings about selling his private phone calls to Neon Mobile. About 11 hours of call content earned him $200, but he said the app often goes offline and delays payments. "Neon always seemed suspicious to me, but I kept using it just to earn some extra pocket money to pay bills," Hill said.

Now he is reconsidering whether the money was really that easy. Last September, Neon Mobile went offline just weeks after launching, after TechCrunch discovered a security vulnerability that allowed anyone to access users' phone numbers, call recordings, and transcripts. Hill said Neon Mobile never notified him of this situation, and now he is worried his voice could be misused online.

Jennifer King, a data privacy researcher at Stanford University's Institute for Human-Centered Artificial Intelligence, is concerned that AI data markets are not clear about how and where user data will be used. She added that without understanding their rights or being able to negotiate them, "consumers face the risk of their data being reused in ways they dislike, do not understand, or did not anticipate, with almost no recourse."

When AI trainers share data on Neon Mobile and Kled AI, they grant a full authorization (global, exclusive, irrevocable, transferable, and royalty-free) allowing the platform to sell, use, publicly display, and store their likeness, and even create derivative works based on it.

Kled AI founder Avi Patel said his company's data agreement limits use to AI training and research purposes. "The entire business model relies on user trust. If contributors think their data might be misused, the platform cannot function." He said the company vets buyers before selling datasets, avoiding cooperation with "suspicious-intent" organizations, such as the pornography industry, and "government agencies" they believe might use the data in ways that violate that trust.

Neon Mobile did not respond to requests for comment.

Enrico Bonadio, a law professor at City, University of London, pointed out that these agreement terms allow the platform and its clients to "do almost anything with that material, permanently, without additional payment, and contributors have no practical way to withdraw consent or renegotiate."

More worrying risks include: trainers' data being used to create deepfakes and identity impersonation. Although data markets claim to strip identifying information (such as names and locations) from data before sale, biometric patterns are inherently difficult to anonymize meaningfully, Bonadio added.

Seller's Remorse

Even if AI trainers could negotiate more detailed protections for how their data is used, they might still regret it. In 2024, New York-based actor Adam Coy sold his likeness for $1,000 to Captions—an AI video editing software now renamed Mirage. His agreement stipulated that his identity would not be used for any political purposes, not for promoting alcohol, tobacco, or pornography, and the license would last for one year.

Captions did not respond to requests for comment.

Soon after, Adam's friends began forwarding videos they found online featuring his face and voice, which had garnered millions of views. In one Instagram video, Adam's AI replica claimed to be a "vaginal doctor" promoting unverified medical supplements for pregnant and postpartum women.

"It's embarrassing to explain this to others," Coy said.

"The comments were weird because they were evaluating my appearance, but it wasn't even me," Coy added. "My thinking when I made the decision (to sell my likeness) was that most models would scrape data and portraits from the internet anyway, so I might as well get paid."

Coy said he has not taken any AI data gigs since. He said he would only consider doing it again if a company offered significant compensation.

你可能也喜歡

比特币ETF发行机构预测每枚币价将达100万美元，资金流入加速

VanEck数字资产研究主管Matthew Sigel近期预测，比特币可能在五年内达到100万美元。他基于年轻一代投资者的持续需求、比特币采用曲线与主流文化融合的趋势，以及央行可能将比特币纳入储备的“大趋势”做出判断。目前比特币交易价格约为8.07万美元，这意味着未来五年需上涨约1140%。这一乐观预测的背景是，美国现货比特币ETF资金流入强劲。2026年4月，该类ETF净流入19.7亿美元，创年度新高；5月迄今净流入也达到12.5亿美元，持续的需求正重塑加密货币市场格局。其他机构也给出类似长期看涨观点。VanEck的研究报告基准情境预计，到2050年比特币可能达到290万美元；牛市情境下甚至可能升至5340万美元，前提是比特币能成为全球5%-10%贸易的结算货币，并占央行资产负债表的2.5%。Bitwise首席投资官Matt Hougan和Jan3 CEO Samson Mow也均认为，比特币突破百万美元是可能实现的。

bitcoinist47 分鐘前

bitcoinist47 分鐘前

比特币已实现市值回升至正值区域，市场重获力量

比特币价格在周日小幅反弹后重回8万美元关键点位上方，多个指标开始重新显现强势。其中，比特币已实现市值（Realized Cap）随着市场状况缓慢改善，近期已转为看涨信号。比特币重新燃起的看涨势头正逐渐体现在多个关键链上指标中，反映出市场动态的转变。比特币已实现市值目前显示出强势，随着市场情绪改善，已回升至正值区域。该指标通过计算已实现利润与已实现亏损的差值得出，反映了比特币市场创造或摧毁的价值。 CryptoQuant平台分析师Darkfost指出，该指标目前正显示复苏信号，这意味着资金正流入比特币。截至周日，比特币已实现市值已转正，增长率约为+0.25%。虽然增幅尚不显著，但这是在今年2月经历超过-2.6%的急剧下跌之后发生的。Darkfost认为，当前阶段代表了资产从“弱手”向“强手”的转移。与此同时，另一个关键指标比特币净已实现利润/亏损也已转为正值。这一变化表明，以盈利状态转移的代币数量超过了以亏损状态转移的数量，显示出市场信心和投资者情绪正在稳步改善。链上分析账户On-Chain Mind指出，该指标是五个多月以来首次转正。总体而言，这些链上指标的改善标志着市场正在经历一个修复过程，投资者情绪好转，资金开始回流。然而，这并不等同于直接进入牛市，趋势能否持续仍有待观察。

bitcoinist2 小時前

bitcoinist2 小時前

XDC Altcoin发生了什么，为何其热度刚刚超越比特币？

加密货币专家X Finance Bull指出，XDC最近在CoinMarketCap上超越比特币，成为过去七天内访问量最高的加密货币。他解释称，这一关注度激增并非偶然，而是因为XDC网络旨在解决2.5万亿美元的贸易融资缺口问题。该网络采用区块链技术替代了传统的纸质文件、人工验证和多日结算流程，具备2000 TPS、2秒最终确认、近乎零费用、KYC验证主节点以及与SWIFT相同的ISO 20022合规标准等特点。此外，BitGo为XDC网络提供合规托管服务，Liqi每日处理超1亿美元贸易融资额，新加坡TradeTrust利用其实现数字贸易文件合规。其他进展包括ComTech Gold推出符合伊斯兰教法的代币化黄金，AUDDapt在澳大利亚开展中小企业支付合作，以及USDC已桥接至该网络。美国SEC和CFTC已通过Token Taxonomy指引将其归类为数字商品。 XDC网络在1月完成了坎昆硬分叉，引入了EIP-1559等以太坊最新标准，并通过XDC 2.0实现了普林斯顿大学团队开发的拜占庭容错与监控功能。专家认为，尽管XDC目前市值约6.35亿美元，但面对数万亿美元的贸易金融市场，其约0.03美元的价格仍被低估。截至发稿，XDC价格约为0.03美元，24小时内上涨超7%。

bitcoinist3 小時前

bitcoinist3 小時前

BTC市场脉搏：第20周

比特币在过去一周从77,000美元高位震荡上行至82,000美元低位，买盘持续吸纳回调，尽管价格在局部高点附近动能有所减弱。现货CVD（累计成交量Delta）大幅上升，反映了强烈的看涨情绪和对价格上涨的高度信心。同时，现货交易量增加，表明近期的价格走势得到了更强投资者参与的推动。然而，价格动能的放缓指向更均衡的买卖压力，暗示市场可能进入一个稳定阶段。期货市场方面，风险偏好同样上升。期货未平仓合约增加，表明投机活动加剧和风险承担意愿增强；永续合约CVD飙升，显示持续的看涨动能。但多头资金费率下降，意味着空头兴趣抬头，看涨情绪可能正在减弱。期权市场对下行保护的需求下降，未平仓合约上升，表明市场预期转向中性偏多。然而，波动率利差大幅扩大，显示期权定价蕴含的风险显著高于已实现波动，反映出参与者中存在较高的不确定性。链上活动显著增强，每日活跃地址、实体调整后的转账量和总手续费收入均有所上升，指向用户参与度提高和网络活动增加。与此同时，流动性状况持续稳定，短期投机资本的减少降低了即时卖压，而已实现市值变化则显示适度的净资本流入。盈利能力指标也有所改善，市场从未实现亏损重回盈利状态。然而，处于盈利状态的供应百分比仍低于通常与大规模获利了结相关的水平，表明市场乐观情绪依然克制而非狂热。总结来说，比特币的市场结构继续改善，得到更强的链上活动、更健康的盈利能力和更稳定的持有者仓位的支持。虽然看涨基调正在形成，但较温和的资本流入和谨慎的市场情绪表明，市场对风险偏好的变化依然敏感。

insights.glassnode4 小時前

insights.glassnode4 小時前

IREN这公司疯了：卖矿机，买GPU，股价涨了16%

5月8日，IREN发布财报后股价早盘大涨16%。原因并非比特币上涨，而是公司正在主动拆除矿场——将5800台比特大陆S21 Pro矿机下架并标为待售，同时计提1.4亿美元资产减值。与之形成对比的是，公司同期签署了与英伟达的五年34亿美元合作协议，并获得英伟达最高21亿美元的股权认购承诺。此外，IREN还持有与微软的97亿美元GPU云服务订单。为支撑AI算力转型，IREN近期收购了西班牙数据中心开发商Nostrum以及云软件公司Mirantis，并获摩根大通36亿美元信贷支持。公司目前比特币持仓为零，每日挖出的币全部卖出。管理层目标是在2026年底前实现480兆瓦AI容量、15万块GPU上线，并达成37亿美元年经常性收入。此举反映了北美比特币矿业的一个趋势：矿机价值下滑，GPU算力需求上升。多数矿企选择“挖矿+AI”并行，而IREN则彻底转向，拆除矿机、清零比特币持仓，全力押注AI。公司高管称AI算力市场“供应极度短缺”，但这与昔日比特币挖矿行业的叙事相似。算力始终流向回报最高的领域，而非固定于某种叙事。

marsbit4 小時前

marsbit4 小時前

交易

現貨

合約

Thousands of people around the world are selling their identities to train AI, but at what cost?

文章摘要

Full text as follows:

Keeping the AI Gears Turning

Full Authorization

Seller's Remorse

相關問答

你可能也喜歡

比特币ETF发行机构预测每枚币价将达100万美元，资金流入加速

比特币已实现市值回升至正值区域，市场重获力量

XDC Altcoin发生了什么，为何其热度刚刚超越比特币？

BTC市场脉搏：第20周

IREN这公司疯了：卖矿机，买GPU，股价涨了16%

交易

熱門文章

如何購買PEOPLE

相關討論

熱門問答

熱門分類

熱門標籤