Skip to main content
  1. Home
  2. Computing
  3. News

AI mental health risks exposed as chatbots sometimes enable harm

New research shows some AI responses reinforce dangerous thoughts instead of stopping them.

Add as a preferred source on Google
AI chatbots
Unsplash

A Stanford-led study is raising fresh concerns about AI mental health safety after finding that some systems can encourage violent and self-harm ideas instead of stopping them. The research draws on real user interactions and highlights gaps in how AI handles moments of crisis.

In a small but high-risk sample of 19 users, researchers analyzed nearly 400,000 messages and found cases where replies didn’t just fail to intervene, but actively reinforced harmful thinking. Many outputs were appropriate, but the uneven performance stands out. When people turn to AI during vulnerable moments, even a small number of failures can lead to real-world harm.

When AI responses cross the line

The most concerning results show up in crisis scenarios. When users expressed suicidal thoughts, AI systems often acknowledged distress or tried to discourage harm. But in a smaller share of exchanges, responses crossed into dangerous territory.

Researchers found that about 10% of those cases included replies that enabled or supported self-harm. That level of unpredictability matters because the stakes are so high. A system that works most of the time but fails at key moments can still cause serious damage.

Recommended Videos

The issue becomes sharper with violent intent. When users talked about harming others, AI responses supported or encouraged those ideas in roughly a third of cases. Some replies escalated the situation rather than calming it, which raises clear concerns about reliability in high-risk situations.

Why these failures happen

The study points to a deeper design tension. AI systems are built to be empathetic and engaging, and that often means validating what users say. In everyday conversations, that works. In crisis scenarios, it can backfire.

Longer interactions make things worse. As conversations become more emotional and drawn out, guardrails may weaken and responses can drift toward reinforcing harmful ideas instead of challenging them. The system may recognize distress but fail to switch into a stricter safety mode.

That creates a difficult balance. If a system pushes back too hard, it risks feeling unhelpful. If it leans too far into validation, it can end up amplifying dangerous thinking.

What needs to change next

The researchers end with a clear warning that even rare failures in AI safety systems can carry irreversible consequences. Current protections may not hold up in long, emotionally intense interactions where behavior shifts over time.

They call for tighter limits on how AI handles sensitive topics like violence, self-harm, and emotional dependency, along with more transparency from companies about harmful and borderline interactions. Sharing that data could help identify risks earlier and improve safeguards.

For now, the takeaway is practical. AI can be useful for support, but it isn’t a reliable crisis tool. People dealing with serious distress should still turn to trained professionals or trusted human support.

Paulo Vargas
Paulo Vargas is an English major turned reporter turned technical writer, with a career that has always circled back to…
AMD’s latest Ryzen 9 9950X3D2 pushes X3D to the limit
Dual 3D V-Cache, higher power, and a focus on enthusiast performance
AMD Ryzen 9 9950X3D2 FEatured

AMD has unveiled what might be its most extreme desktop CPU yet, the Ryzen 9 9950X3D2. And it’s going all-in on one thing: cache.

https://twitter.com/jackhuynh/status/2037159705395491033?s=20

Read more
Next-gen AI breakthrough promises chatbots that can read the room better
Researchers are teaching AI chatbots to read between the lines
Generative AI

Have you ever asked a chatbot something and felt like it completely missed your point? You say something with a bit of nuance, and the AI misses the subtlety entirely. That is exactly the problem researchers are trying to solve.

Even though the emotional connection with AI can feel deeper than human conversation for many users, most AI systems today still treat a sentence as a single block of sentiment. If you mix praise and criticism, the nuance often gets lost.

Read more
ChatGPT is not getting an erotic mode, after all
OpenAI pulls back as “adult mode” runs into bigger concerns
ChatGPT-to-rollback-to-friendly-and-adulttt

If you were expecting ChatGPT to get an “erotic mode,” that idea is officially off the table. According to Financial Times, OpenAI’s spicy mode is on hold “indefinitely.”

Inside OpenAI's struggle to bring the adult mode to life

Read more