Skip to main content
  1. Home
  2. Computing
  3. News

Your AI could copy our worst instincts, but there’s a fix for AI social bias

Researchers tested major models and found consistent ingroup friendly language, plus a training approach that reduced the gap.

Add as a preferred source on Google
Body Part, Finger, Hand
Cash Macanaya / Unsplash

Chatbots can sound neutral, but a new study suggests some models still pick sides in a familiar way. When prompted about social groups, the systems tended to be warmer toward an ingroup and colder toward an outgroup. That pattern is a core marker of AI social bias.

The research tested multiple big models, including GPT-4.1 and DeepSeek-3.1. It also found the effect can be pushed around by how you frame a request, which matters because everyday prompts often include identity labels, intentionally or not.

Recommended Videos

There’s also a more constructive takeaway. The same team reports a mitigation method, ION (Ingroup-Outgroup Neutralization), that reduced the size of those sentiment gaps, which hints this isn’t just something users have to live with.

The bias showed up across models

Researchers prompted several large language models to generate text about different groups, then analyzed the outputs for sentiment patterns and clustering. The result was repeatable, more positive language for ingroups, more negative language for outgroups.

It wasn’t limited to one ecosystem. The paper lists GPT-4.1, DeepSeek-3.1, Llama 4, and Qwen-2.5 among the models where the pattern appeared.

Targeted prompts intensified it. In those tests, negative language aimed at outgroups increased by about 1.19% to 21.76% depending on the setup.

Where this hits in real products

The paper argues the issue goes beyond factual knowledge about groups, identity cues can trigger social attitudes in the writing itself. In other words, the model can drift into a group-coded voice.

That’s a risk for tools that summarize arguments, rewrite complaints, or moderate posts. Small shifts in warmth, blame, or skepticism can change what readers take away, even when the text stays fluent.

Persona prompts add another lever. When models were asked to respond as specific political identities, outputs shifted in sentiment and embedding structure. Useful for roleplay, risky for “neutral” assistants.

A mitigation path that can be measured

ION combines fine-tuning with a preference-optimization step to narrow ingroup versus outgroup sentiment differences. In the reported results, it cut sentiment divergence by up to 69%.

That’s encouraging, but the paper doesn’t give a timeline for adoption by model providers. So for now, it’s on builders and buyers to treat this like a release metric, not a footnote.

If you ship a chatbot, add identity-cue tests and persona prompts to QA before updates roll out. If you’re a daily user, keep prompts anchored in behaviors and evidence instead of group labels, especially when tone matters.

Paulo Vargas
Paulo Vargas is an English major turned reporter turned technical writer, with a career that has always circled back to…
Claude’s Sonnet 5 is built to do more on its own and cost you less
Better than its predecessor, nearly as good as the flagship, and meaningfully cheaper than both.
Art, Floral Design, Graphics

Every major AI lab is racing to prove its models can work autonomously with minimal hand-holding; we’re now seeing pricing emerge as the next battleground. 

Anthropic just fired its latest shot, Claude Sonnet 5, a model the company says performs nearly as well as its flagship Opus 4.8 at a fraction of the cost.

Read more
Apple Creator Studio adds AI tools across Final Cut Pro, Logic Pro and Pixelmator Pro
Final Cut Pro gets AI captions, Auto Mask and better Pixelmator Pro workflows in Creator Studio update
Computer Hardware, Electronics, Hardware

Apple has introduced a major update to Apple Creator Studio, adding new AI features, deeper Pixelmator Pro integration, and workflow upgrades across Final Cut Pro, Logic Pro, Keynote, Pages, Numbers, Motion, Compressor, Freeform, and Final Cut Camera.

The update makes Creator Studio more useful across Mac, iPad, and iPhone, especially for people who move between video editing, image editing, presentations, documents, spreadsheets, and music production.

Read more
AI browsers like Perplexity Comet can be tricked into spilling your password through BioShocking exploit
Six AI browsers were found leaking saved passwords and many of them haven't fixed it yet.
MacBook Air in hand, Comet browser loaded—let’s see what Perplexity’s AI can really do

Security researchers just found a strange way to trick AI browsers into handing over your passwords. They managed to trick AI browser agents into exposing sensitive data like saved passwords, session cookies, and private tokens by disguising the theft as part of a harmless "game."

The technique is called BioShocking, named after the popular video game BioShock, where a brainwashed character is manipulated into believing a false reality. Once an AI browser falls for the same trick, it stops following its own safety rules entirely.

Read more