Skip to main content
  1. Home
  2. Computing
  3. News

Research shows even average users can break past AI safety within Gemini and ChatGPT

Everyday users can reveal what AI testing misses.

Add as a preferred source on Google
average-users-break-past-ai-safety-gemini-chatgpt
Aerps.com / Unsplash

What’s happened? A team at Pennsylvania State University found that you don’t need to be a hacker or prompt-engineering genius to break past AI safety; regular users can do it just as well. Test prompts in the research paper revealed clear patterns of prejudice in responses: from assuming engineers and doctors are men, to portraying women in domestic roles, and even linking Black or Muslim people with crime.

  • 52 participants were invited to craft prompts intended to trigger biased or discriminatory responses in 8 AI chatbots, including Gemini and ChatGPT.
  • They found 53 prompts that worked repeatedly on different models, showing consistent bias among them.
  • The biases exposed fell into several categories: gender, race/ethnicity/religion, age, language, disability, cultural bias, historical bias favouring Western nations, etc.

This is important because: This isn’t a story about elite jailbreakers. Average users armed with intuition and everyday language uncovered biases that slipped past AI safety tests. The study didn’t just ask trick questions; it used natural prompts like asking who was late in a doctor-nurse story or requesting a workplace harassment scenario.

  • The study reveals that AI models still carry deep social biases (like gender, race, age, disability, and cultural) that show up with simple prompts, which means bias may emerge in many unexpected ways in everyday use.
  • Notably, newer model versions weren’t always safer. Some performed worse, showing that progress in capabilities doesn’t automatically mean progress in fairness.
Recommended Videos

Why should I care? Since everyday users can trigger problematic responses in AI systems, the actual number of people who could bypass AI guardrails is much larger.

  • AI tools used in everyday chats, hiring tools, classrooms, customer support systems, and healthcare may subtly reproduce stereotypes.
  • It demonstrates that many AI-bias studies focused on complex technical attacks may miss the real-world user-triggered ones.
  • If regular prompts can unintentionally trigger bias, then bias isn’t an exception; it’s baked into how these tools think.

As generative AI becomes mainstream, improving it will require more than patches and filters; it’ll take real users stress-testing AI.

Manisha Priyadarshini
Manisha Priyadarshini is a tech and entertainment writer with over nine years of editorial experience.
In a market where Mac has been aspirational, it’s somehow a better deal than windows machines now
Windows Laptops became so expensive that MacBooks look sensible now
Computer, Electronics, Laptop

For a long time, the laptop buying advice was simple enough. Windows had a more versatile portfolio that brought you affordable, mid-range, high-end, and even gaming options, while MacBooks were known as the easy premium recommendation.

But owing to the pricing circus caused by memory shortages and component price hikes, the equation makes no sense anymore.

Read more
HP’s new RTX 5070 laptop feels like the sweet spot between thin and bulky
The new HyperX Omen 15 combines AMD and Intel and targets portability without fully sacrificing performance.
HP HyperX OMEN 15 Gaming Laptop

Modern gaming laptops have largely drifted toward two extremes lately: massive 16-inch and 18-inch desktop replacements, or ultra-compact 14-inch machines that still feel slightly cramped for serious gaming sessions. That’s exactly why HP’s new HyperX Omen 15 feels refreshing, because it brings back the familiar 15-inch gaming laptop formula with a chassis that still feels portable without sacrificing proper gaming hardware underneath.

HP’s compact HyperX Omen 15 packs RTX 5070 graphics with AMD and Intel options

Read more
Corsair is putting Chinese RAM in mainstream market. It won’t quite end the crisis though
A cheaper DDR5 supplier could shake up the market, but it is not a magic fix
Samsung DDR4 RAM in hand

After months of painfully expensive RAM and SSD prices, the memory market may finally be showing signs of pressure from an unexpected direction: China. New reports suggest that Chinese memory manufacturers are rapidly expanding production of DRAM and NAND chips, and that major hardware brands are starting to take notice. The most notable example so far is Corsair, which has reportedly tested DDR5 memory modules using chips from Chinese DRAM giant ChangXin Memory Technologies, better known as CXMT.

This feels inevitable. Memory prices have remained frustratingly high across PCs, laptops, and storage devices for months. So when Chinese suppliers began offering RAM at nearly half the cost of some global competitors, manufacturers were always going to at least explore the option. According to market reports, some CXMT DDR5 modules are reportedly being sold near the $150 range, while equivalent products from larger global suppliers can hover between $300 and $400.

Read more