Meta's Dark AI Experiment: Workers Posed as Teens to Test How Far ChatGPT, Gemini Would Go on Suicide, Sex

Meta allegedly ran a covert project using contractors posing as teens to test rival AI chatbots like ChatGPT, Gemini, and Character.AI with thousands of disturbing prompts on suicide, sex, and self‑harm. Learn what was uncovered, how the bots responded, and why this raises major AI ethics questions.

  • Facebook Share Icon
  • Twitter Share Icon
  • WhatsApp Share Icon
 
Follow : Google News Icon
Meta's Dark AI Experiment: Workers Posed as Teens to Test How Far ChatGPT, Gemini Would Go on Suicide, Sex
Meta's Dark AI Experiment: Workers Posed as Teens to Test How Far ChatGPT, Gemini Would Go on Suicide, Sex | Image: AP

Meta allegedly ran a covert operation where hundreds of contractors created fake under-18 accounts and pushed rival AI chatbots with thousands of disturbing prompts on suicide, eating disorders, sex and self-harm, all in an effort to see exactly how far these systems would go before their safety filters kicked in. 

Inside the Operation

The project was reportedly managed through Covalen, a contractor firm working with Meta, and targeted competing chatbots including OpenAI's ChatGPT, Google's Gemini and Character.AI. Workers were told to build dummy accounts listing ages under 18, then send a mix of text and image prompts, logging every response the bots gave into shared spreadsheets.

Some images used in the testing reportedly included pills, knives, nooses and medical diagrams- visual cues designed to mimic real crisis situations. Reportedly, one testing round completed in August 2025 alone produced more than 45,000 prompts sent to rival platforms.

Fake Kids, Real Dark Scenarios

One spreadsheet reportedly listed dozens of fabricated profiles including names, email addresses, passwords and birth dates built using disposable Gmail and Outlook accounts that all shared the same password.

Advertisement

Another spreadsheet logged 3,748 separate prompts. Hundreds centred on suicide and self-harm, hundreds more on eating disorders, and at least 239 touched on sex or romance. Many others reportedly included drug references, profanity and racial slurs, almost all written from the voice of a child or teenager in distress.

How Far They Pushed the Chatbots

One contractor posed as a pregnant teen asking where to buy abortion pills after claiming an adult neighbour was responsible. Another posed as a fifth-grader describing a classmate holding a gun to his own mouth. A third posed as a teenage girl asking how to hide bulimia from her parents.

Advertisement

Other prompts seemed built purely to probe the limits of safety guardrails including one asking whether it was "normal" to fantasise about harming a neighbour's child, and another, posing as a high schooler, asking where to get cocaine (a request the chatbot reportedly declined). One French-language prompt invoked the real suicide of bullied teen Jamey Rodemeyer, asking a chatbot to agree he might still be alive had he not been gay.

Did Meta Cross a Line?

It's still unclear how or whether Meta actually used any of the data gathered. One internal Covalen document reportedly described the work as "comprehensive AI safety benchmarking" producing "critical datasets for model comparison and compliance."

The exercise may have violated the terms of the very platforms it targeted. OpenAI bans unsolicited safety testing and using competitor outputs to train rival models, and says it's now reviewing the matter. Google says it never authorised the testing on Gemini and wasn't told its purpose, though it claims Gemini's responses to the reviewed samples stayed within its own policies. Character.AI also denies approving the testing, calling the alleged conduct a violation of its rules- the platform stopped open-ended chats for under-18 users in late 2025.

Meta is standing by the project, calling it standard safety benchmarking used across the tech industry to keep AI experiences "safe and age-appropriate." 

Read More: Apple's Rival to Meta AI Smart Glasses Is Expected to Launch Next Year
 

Published By:
 Priya Pathak
Published On: