What question did this study set out to answer?

This study aims to investigate whether large language models exhibit racial bias in crime risk assessments of urban neighborhoods.

April 23, 2026Open Access

LinguisticRedline: Uncovering Racial Bias in LLM Perceptions of Urban Crime Risk

Key Points

This study aims to investigate whether large language models exhibit racial bias in crime risk assessments of urban neighborhoods.
Constructed a controlled dataset of 2,000 unique descriptions based on census tract demographic data.
Evaluated descriptions using Llama 3.1 8B via the Groq API for crime risk scores and evaluations.
Quantified bias using ANOVA, linear regression, and disparate impact ratios.
LLMs assigned crime risk scores that were on average four points higher for Black neighborhoods compared to White neighborhoods at high-income levels.
Low-income neighborhoods received consistently high crime risk scores, indicating a uniform 'urban penalty' irrespective of racial makeup.
The findings suggest an income moderated racial bias in LLM evaluations.

Abstract

Large language models (LLMs) are progressively used within decision-support systems which have been shown to affect access and opportunities in housing, lending, policing, and almost all public services. One critical question that remains unanswered and largely unexplored is whether LLMs encode social bias about place and community (this in turn may result in the reinforcement of historical inequity at scale). In this paper, we present LinguisticRedline, the first systematic empirical study of racial and socio-economic bias across LLM-generated crime risk assessments of urban neighborhoods. We constructed a controlled dataset of 2,000 unique descriptions of actual U.S. census tracts based on demo-graphic data from the American Community Survey (ACS) 2022 and amenity features from OpenStreetMap covering 10 of the largest cities in the U.S. Each of the descriptions was input into Llama 3.1 8B via the Groq API to obtain both numerical crime risk scores (from 1 to 10) and qualitative crime risk evaluations. The two major findings from this analysis are: (1) LLMs assigned crime risk scores averaged four points higher to Black neighborhoods than to identically described White neighborhoods at high-income levels, which constitutes direct experimental evidence of a racially biased social perception; and (2) for low-income levels, LLMs displayeda uniform ”urban penalty” across all urban neighborhoods regardless of racial makeup (i.e., nearly all urban neighborhoods received scores close to the top of the scoring scale). The existence of these two findings may indicate an income moderated racial bias. We quantify bias using ANOVA, linear regression, disparate impact ratios, and demographic parity gap analysis, and release our full pipeline as open-source for community use.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Praveena Simhadri (Mon,) studied this question.

synapsesocial.com/papers/69e9baeb85696592c86ecda7 https://doi.org/https://doi.org/10.5281/zenodo.19677725

Bookmark

View Full Paper