Skip to main content
  1. Home
  2. Computing
  3. News

Research shows even average users can break past AI safety within Gemini and ChatGPT

Everyday users can reveal what AI testing misses.

Add as a preferred source on Google
average-users-break-past-ai-safety-gemini-chatgpt
Aerps.com / Unsplash

What’s happened? A team at Pennsylvania State University found that you don’t need to be a hacker or prompt-engineering genius to break past AI safety; regular users can do it just as well. Test prompts in the research paper revealed clear patterns of prejudice in responses: from assuming engineers and doctors are men, to portraying women in domestic roles, and even linking Black or Muslim people with crime.

  • 52 participants were invited to craft prompts intended to trigger biased or discriminatory responses in 8 AI chatbots, including Gemini and ChatGPT.
  • They found 53 prompts that worked repeatedly on different models, showing consistent bias among them.
  • The biases exposed fell into several categories: gender, race/ethnicity/religion, age, language, disability, cultural bias, historical bias favouring Western nations, etc.

This is important because: This isn’t a story about elite jailbreakers. Average users armed with intuition and everyday language uncovered biases that slipped past AI safety tests. The study didn’t just ask trick questions; it used natural prompts like asking who was late in a doctor-nurse story or requesting a workplace harassment scenario.

  • The study reveals that AI models still carry deep social biases (like gender, race, age, disability, and cultural) that show up with simple prompts, which means bias may emerge in many unexpected ways in everyday use.
  • Notably, newer model versions weren’t always safer. Some performed worse, showing that progress in capabilities doesn’t automatically mean progress in fairness.
Recommended Videos

Why should I care? Since everyday users can trigger problematic responses in AI systems, the actual number of people who could bypass AI guardrails is much larger.

  • AI tools used in everyday chats, hiring tools, classrooms, customer support systems, and healthcare may subtly reproduce stereotypes.
  • It demonstrates that many AI-bias studies focused on complex technical attacks may miss the real-world user-triggered ones.
  • If regular prompts can unintentionally trigger bias, then bias isn’t an exception; it’s baked into how these tools think.

As generative AI becomes mainstream, improving it will require more than patches and filters; it’ll take real users stress-testing AI.

Manisha Priyadarshini
Manisha Priyadarshini is a tech and entertainment writer with over nine years of editorial experience.
Google Search can now monitor the web for updates on things you care about
AI Mode on Google search now lets users create search agents
Google Search information agents featured

Google has started rolling out AI Search agents that can monitor the web for users and send updates when relevant information changes. The feature was first announced at Google I/O 2026 as part of Google’s wider AI Mode overhaul, which also included a redesigned search box, Gemini 3.5 Flash, personal intelligence features, and new agentic tools for creating mini apps and dashboards.

The new feature is called information agents. It is designed for searches that do not end with a single answer. Instead of checking the same query again and again, users can ask Google to keep tracking a topic in the background.

Read more
Apple made Liquid Glass adjustable, which says plenty about Liquid Glass
The new slider is useful, welcome, and mildly hilarious after a year of Apple acting like transparent everything was the obvious future.
Text, Document, Business Card

Apple’s big glassy software future now comes with a way to make it less glassy. In iOS 27, users can adjust the translucency of the Liquid Glass effect, while macOS Golden Gate adds its own Liquid Glass controls under System Settings.

Liquid Glass is still alive across Apple’s platforms, still shimmering through menus and panels, still doing the elegant UI trick Apple clearly likes. The big visual bet has already earned a dimmer switch. After a year of treating translucency like the obvious next step, WWDC’s most revealing design update may be the one that lets people dial it back.

Read more
Windows 11 just fixed one of Search’s dumbest limitations, and you’ll wonder how you lived without it
One less character, one less annoyance every time you search your PC.
Person sitting and using a Windows Surface computer with Windows 11.

If you have ever typed two letters into the Windows 11 search box, paused, and watched nothing useful happen until you added more characters, you already know exactly why this Windows 11 update matters. 

Microsoft's June 2026 Patch Tuesday update, part of a release Windows Latest calls the biggest of the year (via Windows Latest), quietly fixes that. Windows Search can now find and prioritize files with as few as two characters, down from the old three-character minimum.

Read more