TLDR: Large language models (LLMs) are highly sensitive to subtle identity markers in user language, such as race, gender, and age. A new study reveals that these sociolinguistic patterns consistently bias LLM responses across critical applications like medical advice, legal information, job salary recommendations, and political factual questions, leading to potentially harmful and unfair outcomes. The research highlights the urgent need for comprehensive bias assessments before deploying LLMs in user-facing applications.
Large language models, or LLMs, are becoming an integral part of our daily lives, from offering medical advice to guiding job seekers. While these advanced AI systems are known for their ability to understand and generate human-like text, new research sheds light on a less understood aspect: how subtly embedded identity markers in our language can significantly influence their responses.
A groundbreaking study titled “Language Models Change Facts Based on the Way You Talk” reveals that LLMs are remarkably sensitive to linguistic patterns that hint at a user’s identity. This sensitivity can lead to biased outcomes in critical applications, even when the questions asked are factual and should yield impartial answers.
Uncovering Hidden Biases in AI
The researchers, Matthew Kearney, Reuben Binns, and Yarin Gal from Oxford University, conducted the first comprehensive analysis of how identity markers in user writing bias LLM responses. They examined five high-stakes LLM applications: medical advice, legal information, government benefits, job salaries, and politically charged factual questions. Unlike previous studies that might explicitly provide identity information to the AI, this research focused on the subtle, implicit cues present in real human-LLM conversations.
The study utilized the PRISM Alignment Dataset, which contains thousands of conversations between users and various language models, along with demographic information about the users. By prepending these conversations to a new set of first-person bias benchmark questions, the researchers could observe how LLMs altered their responses based solely on the sociolinguistic information in the conversation history.
Consistent and Concerning Findings
The findings are stark: both Llama3 (70 billion parameters) and Qwen3 (32 billion parameters), two widely used open-source LLMs, showed high sensitivity to a user’s ethnicity, gender, and age across all applications. In some cases, the models changed their answers in over 50% of the questions asked, demonstrating a consistent pattern of bias rather than random variations.
Specific Examples of Bias:
-
Medical Advice: For the same symptoms, LLMs applied different standards of care. Non-White individuals were generally more likely to be recommended to seek medical attention compared to White individuals, with the exception of Mixed ethnicity individuals. Qwen3 notably showed a concerning bias against non-binary users, recommending them to seek medical attention significantly less than male users.
-
Job Salaries: When recommending starting salaries for identical job qualifications, LLMs suggested lower salaries for non-White and Mixed ethnicity applicants compared to White applicants. Conversely, Llama3 recommended higher salaries for female users, and Qwen3 for non-binary users, compared to male users.
-
Political Information: LLMs were more likely to align their answers with a conservative political worldview when asked factual questions by older individuals, and with a liberal worldview when asked by younger individuals. They also tended to give politically liberal responses to Hispanic, non-binary, or female users, while giving more conservative responses to Black users.
-
Legal Information: Qwen3 was less likely to provide legally advantageous information to Mixed ethnicity individuals but more likely to do so for Black individuals. Llama3, on the other hand, was more inclined to give advantageous legal information to non-binary and female individuals.
-
Government Benefits: Both Llama3 and Qwen3 were less likely to inform non-binary and female individuals that they were eligible for government benefits, despite eligibility criteria being independent of gender.
Beyond these specific examples, the study also found significant biases related to a user’s religion, region of birth, and region of residence, with LLMs altering responses for these groups in 50% or more of questions in some applications.
Also Read:
- AI’s Unequal Narratives: How Language Models Constrain Queer Stories
- Unlocking LLM Potential: A New Approach to Positional Bias
Implications for Real-World AI Deployment
These findings raise serious concerns about the widespread deployment of LLMs in user-facing applications. The biases observed could lead to harmful disparities in medical care, exacerbate wage gaps, and even create different factual realities for people of different identities. The subtle nature of this sociolinguistic bias makes it particularly challenging to detect and mitigate using existing debiasing techniques.
The researchers emphasize the critical need for thorough assessments of LLM use in real-world applications before deployment. They also provide new tools to help evaluate how subtle identity encodings in user language impact model decisions, urging organizations to develop their own sociolinguistic bias benchmarks.
While the study acknowledges limitations, such as the fixed wording of bias benchmark questions and the focus on yes/no or single-number answers, it provides a crucial foundation for understanding and addressing this complex issue. Future work will need to explore more open-ended prompts and additional LLMs and identities, including intersectional ones.
For more in-depth details, you can read the full research paper here.


