What question did this study set out to answer?

This research aims to examine the presence of gender-based bias in AI models' sentencing recommendations for various crimes.

March 28, 2026Open Access

Virtual Judgments: Testing the Chivalry Hypothesis and Attribution Theory in AI Models

Key Points

This research aims to examine the presence of gender-based bias in AI models' sentencing recommendations for various crimes.
Utilized experimental vignette design to assess AI responses to manipulated gender scenarios.
Evaluated six large language models for sentencing recommendations and stigmatization ratings.
Analyzed within-model consistency and variation across different AI systems and offense types.
Significant gender disparities appeared in intimate partner violence scenarios, favoring leniency towards female perpetrators.
Weak or no gender differences were observed in robbery and financial fraud cases.
AI models differed significantly in their gender-based sentencing recommendations and levels of stigmatization.

Abstract

Drawing on the chivalry hypothesis and attribution theory, this study examines whether large language models (LLMs) exhibit gender-based differentiation in sentencing recommendations and stigmatizing evaluations across intimate partner violence, robbery, and financial fraud. Using an experimental vignette design, six AI models evaluated gender-manipulated criminal scenarios, providing sentencing recommendations and stigmatization ratings. Repeated prompts were used to examine both within-model consistency and between-model variation, revealing offense-specific patterns. Within models, significant gender disparities emerged in intimate partner violence scenarios, with more lenient sentencing and lower stigmatization of female perpetrators, whereas such differences were weak or absent in robbery and financial fraud. In addition, within-model analyses revealed that the strength and consistency of the association between stigmatizing evaluations and sentencing severity varied across AI systems. Between-model analyses revealed substantial heterogeneity across offense types, with AI systems differing not only in their gender-based sentencing recommendations, but also in overall levels of stigmatizing evaluations in response to gender manipulations. Overall, the results suggest that AI systems neither uniformly replicate nor fully transcend human gender biases, underscoring the need for cautious deployment of AI tools in legal contexts.

Bookmark

View Full Paper

Cite This Study

Lam et al. (Thu,) studied this question.

synapsesocial.com/papers/69c772818bbfbc51511e30ac https://doi.org/https://doi.org/10.1007/s10506-026-09504-x

Bookmark

View Full Paper