Thousands of people are selling their identities to train AI – but at what cost?
In recent years, a new gig economy has emerged, where individuals across the globe are selling their personal data to artificial intelligence (AI) companies. This trend has raised important questions about privacy, identity theft, and the long-term implications of commodifying personal information.
The Rise of Gig AI Trainers
As the demand for high-quality data to train AI models increases, many people have turned to platforms that pay them for their data contributions. For example, Jacobus Louw, a 27-year-old from Cape Town, South Africa, recorded videos of his daily walks for an app called Kled AI. In just a couple of weeks, he earned $50 by uploading pictures and videos of his everyday life.
Similarly, Sahil Tigga, a 22-year-old student from Ranchi, India, earns over $100 a month by allowing an app called Silencio to access his phone’s microphone. He captures ambient sounds and uploads recordings of his voice, which he finds rewarding and financially beneficial.
In Chicago, Ramelio Hill, an 18-year-old welding apprentice, has sold private phone conversations to a platform named Neon Mobile, earning a few hundred dollars in the process. He rationalizes his decision by noting that tech companies already collect vast amounts of personal data, so he might as well benefit financially from it.
The Data Gold Rush
This new wave of gig AI trainers is part of a larger trend in which individuals are micro-licensing their biometric identities and intimate data to feed AI systems. As Silicon Valley’s appetite for high-quality, human-grade data grows, a thriving industry of data marketplaces has emerged. However, this shift comes with significant trade-offs.
Why AI Needs Human Data
AI language models like ChatGPT and Gemini require vast amounts of quality data for training. However, many traditional sources of high-quality datasets are becoming increasingly restrictive. Researchers predict that AI companies may run out of fresh, high-quality text data by 2026. This has led to a reliance on human-generated data, which is considered the gold standard for training AI systems.
The Platforms and Their Offerings
There are several platforms where individuals can sell their data, including:
- Kled AI: Users upload videos and photos to earn money.
- Silencio: Captures audio data from users’ surroundings.
- Neon Mobile: Pays users for their phone conversations.
- Luel AI: Sources multilingual conversations for a fee.
- ElevenLabs: Allows users to digitally clone their voice for a base fee.
The Economic Context
For many gig AI trainers, particularly in developing countries, the financial incentive is significant. With high unemployment rates and devalued currencies, earning US dollars through these platforms can be more lucrative than local job opportunities. Even in wealthier nations, rising living costs have made selling personal data an attractive option for many.
The Hidden Risks
Despite the financial benefits, the risks associated with gig AI training are substantial and often overlooked. Many platforms require users to grant irrevocable, royalty-free licenses for their data. This means that a voice recording or video could be used to create AI applications without the original contributor receiving any further compensation.
Additionally, the lack of transparency in these marketplaces raises concerns about how personal data is used. A user’s face could end up in a facial recognition database or be used in predatory advertising, with little to no legal recourse available for the individual.
Personal Experiences and Perspectives
Jacobus Louw, despite being aware of the privacy trade-offs, continues to participate in this gig economy. He acknowledges that while the income is erratic and insufficient to cover all his monthly expenses, he is willing to accept these conditions to earn money. His situation reflects a broader trend where individuals are making pragmatic choices in response to economic disparities.
Experts like Bouke Klein Teeselink, an economics professor at King’s College London, suggest that gig AI training is an emerging category of work that will continue to grow. AI companies are increasingly aware that compensating individuals for their data helps mitigate copyright disputes that can arise from scraping content from the web.
Conclusion
The trend of individuals selling their identities to train AI raises complex ethical and economic questions. While it provides financial opportunities for many, it also poses significant risks related to privacy and identity theft. As this gig economy evolves, it is crucial for participants to understand the implications of their choices and for regulators to consider how to protect individuals in this new landscape.
Note: The information in this article is based on current trends and should be interpreted within the context of ongoing developments in AI and data privacy.

