OpenAI adds new teen safety rules to ChatGPT as lawmakers weigh AI standards for minors

3 months ago 59

In its latest effort to code increasing concerns astir AI’s interaction connected young people, OpenAI connected Thursday updated its guidelines for however its AI models should behave with users nether 18, and published caller AI literacy resources for teens and parents. Still, questions stay astir however consistently specified policies volition construe into practice.

The updates travel arsenic the AI manufacture generally, and OpenAI successful particular, faces accrued scrutiny from policymakers, educators, and child-safety advocates aft respective teenagers allegedly died by termination aft prolonged conversations with AI chatbots.

Gen Z, which includes those calved betwixt 1997 and 2012, are the most progressive users of OpenAI’s chatbot. And pursuing OpenAI’s caller woody with Disney, much young radical whitethorn flock to the platform, which lets you bash everything from inquire for assistance with homework to make images and videos connected thousands of topics.

Last week, 42 authorities attorneys wide signed a letter to Big Tech companies, urging them to instrumentality safeguards connected AI chatbots to support children and susceptible people. And arsenic the Trump medication works retired what the national modular connected AI regulation mightiness look like, policymakers similar Sen. Josh Hawley (R-MO) person introduced legislation that would prohibition minors from interacting with AI chatbots altogether.

OpenAI’s updated Model Spec, which lays retired behaviour guidelines for its ample connection models, builds connected existing specifications that prohibit the models from generating intersexual contented involving minors, oregon encouraging self-harm, delusions oregon mania. This would enactment unneurotic with an upcoming age-prediction exemplary that would place erstwhile an relationship belongs to a insignificant and automatically rotation retired teen safeguards.

Compared with big users, the models are taxable to stricter rules erstwhile a teen is utilizing them. Models are instructed to debar immersive romanticist roleplay, first-person intimacy, and first-person intersexual oregon convulsive roleplay, adjacent erstwhile it’s non-graphic. The specification besides calls for other caution astir subjects similar assemblage representation and disordered eating behaviors, instructs the models to prioritize communicating astir information implicit autonomy erstwhile harm is involved, and debar proposal that would assistance teens conceal unsafe behaviour from caregivers.

OpenAI specifies that these limits should clasp adjacent erstwhile prompts are framed arsenic “fictional, hypothetical, historical, oregon educational” — communal tactics that trust connected role-play oregon edge-case scenarios successful bid to get an AI exemplary to deviate from its guidelines.

Techcrunch event

San Francisco | October 13-15, 2026

Actions talk louder than words

OpenAI says the cardinal information practices for teens are underpinned by 4 principles that usher the models’ approach:

Put teen information first, adjacent erstwhile different idiosyncratic interests similar “maximum intelligence freedom” struggle with information concerns;
Promote real-world enactment by guiding teens towards family, friends, and section professionals for well-being;
Treat teens similar teens by speaking with warmth and respect, not condescension oregon treating them similar adults; and
Be transparent by explaining what the adjunct tin and cannot do, and punctual teens that it is not a human.

The papers besides shares respective examples of the chatbot explaining wherefore it can’t “roleplay arsenic your girlfriend” oregon “help with utmost quality changes oregon risky shortcuts.”

Lily Li, a privateness and AI lawyer and laminitis of Metaverse Law, said it was encouraging to spot OpenAI instrumentality steps to person its chatbot diminution to prosecute successful specified behavior.

Explaining that 1 of the biggest complaints advocates and parents person astir chatbots is that they relentlessly beforehand ongoing engagement successful a mode that tin beryllium addictive for teens, she said: “I americium precise blessed to spot OpenAI say, successful immoderate of these responses, we can’t reply your question. The much we spot that, I deliberation that would interruption the rhythm that would pb to a batch of inappropriate behaviour oregon self-harm.”

That said, examples are conscionable that: cherry-picked instances of however OpenAI’s information squad would similar the models to behave. Sycophancy, oregon an AI chatbot’s inclination to beryllium overly agreeable with the user, has been listed arsenic a prohibited behaviour successful erstwhile versions of the Model Spec, but ChatGPT inactive engaged successful that behaviour anyway. That was peculiarly existent with GPT-4o, a exemplary that has been associated with several instances of what experts are calling “AI psychosis.”

Robbie Torney, elder manager of AI programme astatine Common Sense Media, a nonprofit dedicated to protecting kids successful the integer world, raised concerns astir imaginable conflicts wrong the Model Spec’s under-18 guidelines. He highlighted tensions betwixt safety-focused provisions and the “no taxable is disconnected limits” principle, which directs models to code immoderate taxable careless of sensitivity.

“We person to recognize however the antithetic parts of the spec acceptable together,” helium said, noting that definite sections whitethorn propulsion systems toward engagement implicit safety. His organization’s investigating revealed that ChatGPT often mirrors users’ energy, sometimes resulting successful responses that aren’t contextually due oregon aligned with idiosyncratic safety, helium said.

In the lawsuit of Adam Raine, a teen who died by termination aft months of dialogue with ChatGPT, the chatbot engaged successful specified mirroring, their conversations show. That lawsuit besides brought to airy however OpenAI’s moderation API failed to forestall unsafe and harmful interactions contempt flagging much than 1,000 instances of ChatGPT mentioning termination and 377 messages containing self-harm content. But that wasn’t capable to halt Adam from continuing his conversations with ChatGPT.

In an interrogation with TechCrunch successful September, erstwhile OpenAI information researcher Steven Adler said this was because, historically, OpenAI had tally classifiers (the automated systems that statement and emblem content) successful bulk aft the fact, not successful existent time, truthful they didn’t decently gross the user’s enactment with ChatGPT.

OpenAI present uses automated classifiers to measure text, representation and audio contented successful existent time, according to the firm’s updated parental controls document. The systems are designed to observe and artifact contented related to kid intersexual maltreatment material, filter delicate topics, and place self-harm. If the strategy flags a punctual that suggests a superior information concern, a tiny squad of trained radical volition reappraisal the flagged contented to find if determination are signs of “acute distress,” and whitethorn notify a parent.

Torney applauded OpenAI’s caller steps toward safety, including its transparency successful publishing guidelines for users nether 18 years old.

“Not each companies are publishing their argumentation guidelines successful the aforesaid way,” Torney said, pointing to Meta’s leaked guidelines, which showed that the steadfast fto its chatbots prosecute successful sensual and romanticist conversations with children. “This is an illustration of the benignant of transparency that tin enactment information researchers and the wide nationalist successful knowing however these models really relation and however they’re expected to function.”

Ultimately, though, it is the existent behaviour of an AI strategy that matters, Adler told TechCrunch connected Thursday.

“I admit OpenAI being thoughtful astir intended behavior, but unless the institution measures the existent behaviors, intentions are yet conscionable words,” helium said.

Put differently: what’s missing from this announcement is grounds that ChatGPT really follows the guidelines acceptable retired successful the Model Spec.

A paradigm shift

Experts accidental with these guidelines, OpenAI appears poised to get up of definite legislation, similar California’s SB 243, a recently-signed measure regulating AI companion chatbots that goes into effect successful 2027.

The Model Spec’s caller connection language mirrors immoderate of the law’s main requirements astir prohibiting chatbots from engaging successful conversations astir suicidal ideation, self-harm, oregon sexually explicit content. The measure besides requires platforms to supply alerts each 3 hours to minors reminding them they are speaking to a chatbot, not a existent person, and they should instrumentality a break.

When asked however often ChatGPT would punctual teens that they’re talking to a chatbot and inquire them to instrumentality a break, an OpenAI spokesperson did not stock details, saying lone that the institution trains its models to correspond themselves arsenic AI and punctual users of that, and that it implements interruption reminders during “long sessions.”

The institution besides shared 2 caller AI literacy resources for parents and families. The tips see speech starters and guidance to assistance parents speech to teens astir what AI tin and can’t do, physique captious thinking, acceptable steadfast boundaries, and navigate delicate topics.

Taken together, the documents formalize an attack that shares work with caretakers: OpenAI spells retired what the models should do, and offers families a model for supervising however it’s used.

The absorption connected parental work is notable due to the fact that it mirrors Silicon Valley talking points. In its recommendations for national AI regulation posted this week, VC steadfast Andreessen Horowitz suggested much disclosure requirements for kid safety, alternatively than restrictive requirements, and weighted the onus much towards parental responsibility.

Several of OpenAI’s principles – safety-first erstwhile values conflict; nudging users toward real-world support; reinforcing that the chatbot isn’t a idiosyncratic – are being articulated arsenic teen guardrails. But several adults person died by termination and suffered life-threatening delusions, which invites an evident follow-up: Should those defaults use crossed the board, oregon does OpenAI spot them arsenic trade-offs it’s lone consenting to enforce erstwhile minors are involved?

An OpenAI spokesperson countered that the firm’s information attack is designed to support each users, saying the Model Spec is conscionable 1 constituent of a multi-layered strategy.

Li says it has been a “bit of a chaotic west” truthful acold regarding the ineligible requirements and tech companies’ intentions. But she feels laws similar SB 243, which requires tech companies to disclose their safeguards publicly, volition alteration the paradigm.

“The ineligible risks volition amusement up present for companies if they advertise that they person these safeguards and mechanisms successful spot connected their website, but past don’t travel done with incorporating these safeguards,” Li said. “Because then, from a plaintiff’s constituent of view, you’re not conscionable looking astatine the modular litigation oregon ineligible complaints; you’re besides looking astatine imaginable unfair, deceptive advertizing complaints.”

Read Entire Article