Thousands of people around the world are selling their identities to train AI, but at what cost?

marsbit發佈於 2026-03-23更新於 2026-03-23

文章摘要

A global investigation reveals a growing gray market where thousands of people worldwide are selling their biometric data—voices, faces, call logs, and daily videos—to train AI models for small payments. Examples include individuals in South Africa, India, and the U.S. earning modest sums through apps like Kled AI, Silencio, and Neon Mobile. While this provides crucial income, especially in economically strained regions, it raises serious concerns about privacy, exploitation, and long-term risks. Contributors often grant broad, irreversible rights to their data, potentially exposing them to deepfakes, identity theft, and unauthorized commercial use. Experts warn that this practice is unstable, offers no career progression, and primarily benefits tech companies in wealthier nations, leaving workers vulnerable with little recourse. Cases like an actor discovering his AI likeness promoting medical products without consent highlight the ethical and personal consequences of this emerging data-for-cash economy.

Author: The Guardian

Compiled by: Deep Tide TechFlow

Deep Tide Introduction: This investigative report reveals a rapidly growing gray industry: thousands of people worldwide are earning money for AI training by selling their voices, faces, call records, and daily videos.

This is not a general discussion about privacy controversies, but an investigation with real people, real amounts of money, and real consequences—an actor who sold his face later saw "himself" promoting an unknown medical product on Instagram, with people in the comments evaluating his "looks."

When the data hunger of AI companies combines with global economic disparities, it is creating an unequal transaction.

Full text as follows:

One morning last year, Jacobus Louw, who lives in Cape Town, South Africa, went out for his usual walk, feeding seagulls along the way. But this time he recorded a few videos—filming his footsteps and view as he walked on the sidewalk. This video earned him $14, about 10 times the country's minimum wage and equivalent to half a week's food expenses for this 27-year-old.

This was a "city navigation" task Louw completed on Kled AI. Kled AI is an app that pays users to upload photos, videos, and other data for training AI models. In just a few weeks, Louw earned $50 by uploading photos and videos from his daily life.

Thousands of miles away, in Ranchi, India, 22-year-old student Sahil Tigga regularly earns money through Silencio—an app that crowdsources audio data for AI training, accessing his phone's microphone to capture ambient noises like inside restaurants or busy intersections. He also uploads recordings of his own voice. Sahil makes special trips to unique locations, such as hotel lobbies not yet recorded on Silencio's map. He earns over $100 per month from this, enough to cover all his food expenses.

In Chicago, 18-year-old welding apprentice Ramelio Hill sold his private phone conversations with friends and family to Neon Mobile—a conversational AI training platform that pays $0.50 per minute—earning a few hundred dollars. For Hill, the calculation was simple: he believes tech companies already have vast amounts of his private data, so he might as well get a share of the profits.

These "AI training gigs"—uploading surroundings, personal photos, videos, and audio—are at the forefront of a new global data gold rush. As Silicon Valley's hunger for high-quality human data exceeds what can be scraped from the open internet, a booming data market industry has emerged to bridge this gap. From Cape Town to Chicago, thousands of people are micro-licensing their biometric identities and private data to the next generation of AI.

But this new gig economy comes at a cost. Behind the few dollars earned, these trainers are fueling an industry that may ultimately render their skills obsolete, while exposing themselves to future risks of deepfakes, identity theft, and digital exploitation—risks they are only beginning to understand.

Keeping the AI Gears Turning

AI language models like ChatGPT and Gemini require massive amounts of material to continuously improve, but they are facing a data shortage. The most commonly used training data sources—C4, RefinedWeb, and Dolma—which comprise a quarter of the highest-quality datasets on the web, are now restricting generative AI companies from using their data to train models. Researchers estimate that AI companies could run out of available fresh, high-quality text as early as 2026. Although some labs have begun training models with synthetic data generated by AI itself, this recursive process leads to models outputting error-filled "garbage," eventually causing a collapse.

This is where apps like Kled AI and Silencio come in. In these data markets, millions of people are feeding and training AI by selling their identity data. Beyond Kled AI, Silencio, and Neon Mobile, AI trainers have many choices: Luel AI, backed by the famous incubator Y-Combinator, acquires multilingual conversation material at about $0.15 per minute; ElevenLabs allows you to digitally clone your voice and make it available for others to use at a base rate of $0.02 per minute.

Bouke Klein Teeselink, an economics professor at King's College London, says AI training gigs are an emerging work category that will grow significantly.

AI companies know that paying people for data licensing helps avoid copyright disputes that might arise from relying entirely on web-scraped content, Teeselink says. AI researcher Veniamin Veselovsky adds that these companies also need high-quality data to model new, improved behaviors for their systems. "For now, human data is the gold standard for sampling outside the model's distribution," Veselovsky added.

The humans driving these machines—especially those in developing countries—often need the money and have few alternatives. For many AI training gig workers, taking on this work is a pragmatic response to economic disparity. In countries with high unemployment rates and depreciating local currencies, earning dollars is often more stable and lucrative than local jobs. Some struggle to find entry-level work and are forced into AI training out of necessity. Even in wealthier countries, rising living costs make selling oneself a logical financial choice.

Cape Town-based AI trainer Louw is well aware of the privacy cost. Although the income is unstable and not enough to cover all his monthly expenses, he is willing to accept these conditions to earn money. He has suffered from neurological diseases for years, making it difficult to find work, but the money he earned from AI data markets (including Kled AI) allowed him to save $500 to enroll in a spa training course to become a massage therapist.

"As a South African, receiving dollars is more valuable than people think," Louw said.

Mark Graham, a professor of internet geography at Oxford University and author of "Feeding the Machine," acknowledges that for individuals in developing countries, the money may have practical significance in the short term, but he warns, "Structurally, this work is unstable, has no upward mobility, and is essentially a dead end."

Graham added that AI data markets rely on "a race to the bottom in wages" and a "temporary demand for human data." Once that demand shifts, "workers will have no security, no transferable skills, and no safety net."

Graham said the only winners are "the platforms in the Global North, which capture all the lasting value."

Full Authorization

Chicago-based AI trainer Hill has mixed feelings about selling his private phone calls to Neon Mobile. About 11 hours of call content earned him $200, but he said the app often goes offline and delays payments. "Neon always seemed suspicious to me, but I kept using it just to earn some extra pocket money to pay bills," Hill said.

Now he is reconsidering whether the money was really that easy. Last September, Neon Mobile went offline just weeks after launching, after TechCrunch discovered a security vulnerability that allowed anyone to access users' phone numbers, call recordings, and transcripts. Hill said Neon Mobile never notified him of this situation, and now he is worried his voice could be misused online.

Jennifer King, a data privacy researcher at Stanford University's Institute for Human-Centered Artificial Intelligence, is concerned that AI data markets are not clear about how and where user data will be used. She added that without understanding their rights or being able to negotiate them, "consumers face the risk of their data being reused in ways they dislike, do not understand, or did not anticipate, with almost no recourse."

When AI trainers share data on Neon Mobile and Kled AI, they grant a full authorization (global, exclusive, irrevocable, transferable, and royalty-free) allowing the platform to sell, use, publicly display, and store their likeness, and even create derivative works based on it.

Kled AI founder Avi Patel said his company's data agreement limits use to AI training and research purposes. "The entire business model relies on user trust. If contributors think their data might be misused, the platform cannot function." He said the company vets buyers before selling datasets, avoiding cooperation with "suspicious-intent" organizations, such as the pornography industry, and "government agencies" they believe might use the data in ways that violate that trust.

Neon Mobile did not respond to requests for comment.

Enrico Bonadio, a law professor at City, University of London, pointed out that these agreement terms allow the platform and its clients to "do almost anything with that material, permanently, without additional payment, and contributors have no practical way to withdraw consent or renegotiate."

More worrying risks include: trainers' data being used to create deepfakes and identity impersonation. Although data markets claim to strip identifying information (such as names and locations) from data before sale, biometric patterns are inherently difficult to anonymize meaningfully, Bonadio added.

Seller's Remorse

Even if AI trainers could negotiate more detailed protections for how their data is used, they might still regret it. In 2024, New York-based actor Adam Coy sold his likeness for $1,000 to Captions—an AI video editing software now renamed Mirage. His agreement stipulated that his identity would not be used for any political purposes, not for promoting alcohol, tobacco, or pornography, and the license would last for one year.

Captions did not respond to requests for comment.

Soon after, Adam's friends began forwarding videos they found online featuring his face and voice, which had garnered millions of views. In one Instagram video, Adam's AI replica claimed to be a "vaginal doctor" promoting unverified medical supplements for pregnant and postpartum women.

"It's embarrassing to explain this to others," Coy said.

"The comments were weird because they were evaluating my appearance, but it wasn't even me," Coy added. "My thinking when I made the decision (to sell my likeness) was that most models would scrape data and portraits from the internet anyway, so I might as well get paid."

Coy said he has not taken any AI data gigs since. He said he would only consider doing it again if a company offered significant compensation.

相關問答

QWhat is the main concern raised in the article about people selling their personal data for AI training?

AThe article highlights that while individuals earn money by selling their biometric identities and private data (like voice, face, and conversations) to train AI models, they face risks such as deepfakes, identity theft, and digital exploitation, often without fully understanding the long-term consequences or having recourse if their data is misused.

QWhy are AI companies turning to data markets like Kled AI and Silencio for training data?

AAI companies are facing a shortage of high-quality training data from the open internet due to restrictions on datasets and potential copyright issues. Data markets provide a way to acquire fresh, human-generated data directly from individuals, which is considered the 'gold standard' for training AI models to improve their behavior and avoid using synthetic data that can lead to model degradation.

QHow do economic disparities play a role in the growth of AI training gigs?

AEconomic disparities drive the growth of AI training gigs because people in developing countries, or those facing high unemployment and currency devaluation, can earn valuable dollars from these platforms. For many, it is a pragmatic response to financial need, offering income that is more stable and lucrative than local jobs, even though the work is unstable and lacks long-term career prospects.

QWhat are some specific risks mentioned regarding the authorization agreements signed by AI trainers?

AThe authorization agreements often grant platforms global, exclusive, irrevocable, transferable, and royalty-free rights to use, sell, display, and create derivative works from the contributors' data. This means trainers have little control over how their data is used long-term, and risks include data being used for deepfakes, identity impersonation, or in ways they did not anticipate, with no practical way to withdraw consent or renegotiate.

QCan you provide an example from the article where an AI trainer regretted selling their data?

AActor Adam Coy from New York sold his likeness for $1000 to Captions (now Mirage) with specific restrictions, but later found his AI replica used in Instagram videos promoting unverified medical supplements, with millions of views. He felt embarrassed and noted that the comments were evaluating his appearance based on the AI clone, which was not actually him. He stated he would only consider such gigs again for significant payment due to the negative experience.

你可能也喜歡

比特币已实现市值回升至正值区域,市场重获力量

比特币价格在周日小幅反弹后重回8万美元关键点位上方,多个指标开始重新显现强势。其中,比特币已实现市值(Realized Cap)随着市场状况缓慢改善,近期已转为看涨信号。 比特币重新燃起的看涨势头正逐渐体现在多个关键链上指标中,反映出市场动态的转变。比特币已实现市值目前显示出强势,随着市场情绪改善,已回升至正值区域。该指标通过计算已实现利润与已实现亏损的差值得出,反映了比特币市场创造或摧毁的价值。 CryptoQuant平台分析师Darkfost指出,该指标目前正显示复苏信号,这意味着资金正流入比特币。截至周日,比特币已实现市值已转正,增长率约为+0.25%。虽然增幅尚不显著,但这是在今年2月经历超过-2.6%的急剧下跌之后发生的。Darkfost认为,当前阶段代表了资产从“弱手”向“强手”的转移。 与此同时,另一个关键指标比特币净已实现利润/亏损也已转为正值。这一变化表明,以盈利状态转移的代币数量超过了以亏损状态转移的数量,显示出市场信心和投资者情绪正在稳步改善。链上分析账户On-Chain Mind指出,该指标是五个多月以来首次转正。 总体而言,这些链上指标的改善标志着市场正在经历一个修复过程,投资者情绪好转,资金开始回流。然而,这并不等同于直接进入牛市,趋势能否持续仍有待观察。

bitcoinist2 小時前

比特币已实现市值回升至正值区域,市场重获力量

bitcoinist2 小時前

BTC市场脉搏:第20周

比特币在过去一周从77,000美元高位震荡上行至82,000美元低位,买盘持续吸纳回调,尽管价格在局部高点附近动能有所减弱。现货CVD(累计成交量Delta)大幅上升,反映了强烈的看涨情绪和对价格上涨的高度信心。同时,现货交易量增加,表明近期的价格走势得到了更强投资者参与的推动。然而,价格动能的放缓指向更均衡的买卖压力,暗示市场可能进入一个稳定阶段。 期货市场方面,风险偏好同样上升。期货未平仓合约增加,表明投机活动加剧和风险承担意愿增强;永续合约CVD飙升,显示持续的看涨动能。但多头资金费率下降,意味着空头兴趣抬头,看涨情绪可能正在减弱。 期权市场对下行保护的需求下降,未平仓合约上升,表明市场预期转向中性偏多。然而,波动率利差大幅扩大,显示期权定价蕴含的风险显著高于已实现波动,反映出参与者中存在较高的不确定性。 链上活动显著增强,每日活跃地址、实体调整后的转账量和总手续费收入均有所上升,指向用户参与度提高和网络活动增加。与此同时,流动性状况持续稳定,短期投机资本的减少降低了即时卖压,而已实现市值变化则显示适度的净资本流入。 盈利能力指标也有所改善,市场从未实现亏损重回盈利状态。然而,处于盈利状态的供应百分比仍低于通常与大规模获利了结相关的水平,表明市场乐观情绪依然克制而非狂热。 总结来说,比特币的市场结构继续改善,得到更强的链上活动、更健康的盈利能力和更稳定的持有者仓位的支持。虽然看涨基调正在形成,但较温和的资本流入和谨慎的市场情绪表明,市场对风险偏好的变化依然敏感。

insights.glassnode4 小時前

BTC市场脉搏:第20周

insights.glassnode4 小時前

交易

現貨
合約

熱門文章

如何購買PEOPLE

歡迎來到HTX.com!在這裡,購買ConstitutionDAO (PEOPLE)變得簡單而便捷。跟隨我們的逐步指南,放心開始您的加密貨幣之旅。第一步:創建您的HTX帳戶使用您的 Email、手機號碼在HTX註冊一個免費帳戶。體驗無憂的註冊過程並解鎖所有平台功能。立即註冊第二步:前往買幣頁面,選擇您的支付方式信用卡/金融卡購買:使用您的Visa或Mastercard即時購買ConstitutionDAO (PEOPLE)。餘額購買:使用您HTX帳戶餘額中的資金進行無縫交易。第三方購買:探索諸如Google Pay或Apple Pay等流行支付方式以增加便利性。C2C購買:在HTX平台上直接與其他用戶交易。HTX 場外交易 (OTC) 購買:為大量交易者提供個性化服務和競爭性匯率。第三步:存儲您的ConstitutionDAO (PEOPLE)購買ConstitutionDAO (PEOPLE)後,將其存儲在您的HTX帳戶中。您也可以透過區塊鏈轉帳將其發送到其他地址或者用於交易其他加密貨幣。第四步:交易ConstitutionDAO (PEOPLE)在HTX的現貨市場輕鬆交易ConstitutionDAO (PEOPLE)。前往您的帳戶,選擇交易對,執行交易,並即時監控。HTX為初學者和經驗豐富的交易者提供了友好的用戶體驗。

717 人學過發佈於 2024.12.12更新於 2025.03.21

如何購買PEOPLE

相關討論

歡迎來到 HTX 社群。在這裡,您可以了解最新的平台發展動態並獲得專業的市場意見。 以下是用戶對 PEOPLE (PEOPLE)幣價的意見。

活动图片