What type of study is this?

This is a Quantitative Study study.

October 5, 2025Open Access

BiasCause: Evaluate Socially Biased Causal Reasoning of Large Language Models

Key Points

Majority of tested language models exhibited biased causal reasoning across multiple scenarios.
The evaluation revealed a total of 4,135 biased causal graphs generated by the models.
A novel framework for classifying causal reasoning processes in LLMs was proposed and validated.
3 strategies for reducing biased reasoning were identified, emphasizing the need for bias-free outputs.

Abstract

While large language models (LLMs) already play significant roles in society, research has shown that LLMs still generate content including social bias against certain sensitive groups. While existing benchmarks have effectively identified social biases in LLMs, a critical gap remains in our understanding of the underlying reasoning that leads to these biased outputs. This paper goes one step further to evaluate the causal reasoning process of LLMs when they answer questions eliciting social biases. We first propose a novel conceptual framework to classify the causal reasoning produced by LLMs. Next, we use LLMs to synthesize 1788 questions covering 8 sensitive attributes and manually validate them. The questions can test different kinds of causal reasoning by letting LLMs disclose their reasoning process with causal graphs. We then test 4 state-of-the-art LLMs. All models answer the majority of questions with biased causal reasoning, resulting in a total of 4135 biased causal graphs. Meanwhile, we discover 3 strategies for LLMs to avoid biased causal reasoning by analyzing the "bias-free" cases. Finally, we reveal that LLMs are also prone to "mistaken-biased" causal reasoning, where they first confuse correlation with causality to infer specific sensitive group names and then incorporate biased causal reasoning.

Read Full Paperexternally

Demander à l'IA

Bookmark

View Full Paper

Cite This Study

Xie et al. (Tue,) studied this question.

synapsesocial.com/papers/68e24e59d6d66a53c2472eb3 https://doi.org/https://doi.org/10.48550/arxiv.2504.07997

Demander à l'IA

Bookmark

View Full Paper