Research & Papers

New study finds conversation topics bias LLM advice more than demographics

Your conversation topic, not your identity, may skew AI advice

Deep Dive

A new study from researchers including Vera Neplenbroek, Gabriele Sarti, Arianna Bisazza, and Raquel Fernández investigates how conversational context affects LLM outputs, especially in high-stakes scenarios like legal, medical, and financial advice. The team analyzed whether LLMs can infer user sociodemographics from a single conversation history and compared that to the impact of psycholinguistic features such as conversation topic, emotions, and readability. They found that LLMs actually struggle to infer demographic traits from one conversation, and while some outcome disparities exist between groups, the magnitude is minimal.

The key driver of disparities turned out to be conversation topics. These topics function as proxies for sociodemographic groups, and they often influence AI advice in unpredictable ways. For example, a user discussing financial planning may receive different advice than one discussing health, even if their demographic background is identical. The paper calls for future research to better understand and mitigate these context-driven biases before deploying LLMs in critical domains.

Key Points
  • LLMs cannot reliably infer user sociodemographics from a single conversation history
  • Conversation topics are the most predictive feature of LLM-generated advice, often acting as proxies for demographics
  • Topics affect advice in unpredictable ways, raising concerns for fair and consistent AI in legal, medical, and financial settings

Why It Matters

Professionals relying on LLMs for high-stakes advice must account for topic-driven bias, not just demographic fairness.