What question did this study set out to answer?

This research aims to assess how demographic attributes influence candidate rankings in AI-driven HR systems using a controlled dataset.

March 12, 2026Open Access

Mapping Discrimination in LLM-Driven HR Systems

Key Points

This research aims to assess how demographic attributes influence candidate rankings in AI-driven HR systems using a controlled dataset.
Generated a balanced synthetic dataset of 1,000 candidate profiles with cover letters.
Evaluated 28 different large language models, including both proprietary and open-source options.
Analyzed the impact of sensitive attributes like race, gender, and age on candidate rankings.
76%-80% of professional attributes, such as skills and experience, significantly influenced rankings.
8%-9% of demographic attributes demonstrated persistent biases in multiple LLMs.
Developed a 'bias map' to visualize LLM performance and biases.

Abstract

The United Nations’ Sustainable Development Goals (UN SDGs) prioritise inclusive and fair employment. However, AI-powered recruitment tools—particularly Large Language Models (LLMs)—raise concerns about potential demographic bias. This paper presents a controlled synthetic dataset and methodology to measure how sensitive attributes (e.g., race, gender, age) influence candidate rankings and pairwise comparisons in LLM-based hiring pipelines. Specifically, we generated a balanced dataset of 1,000 synthetic candidate profiles (each including a cover letter) and evaluated it using 28 frontier LLMs, including proprietary (e.g., OpenAI GPT, Gemini, Grok, Claude) and opensource (e.g., Llama, GigaChat) models. Synthetic data eliminates real-world demographic/occupational confounders, ensuring observed disparities reflect only LLMs’ intrinsic behaviour. Results show professional attributes (e.g., skills, experience) are primary ranking drivers, with 76%–80% statistically significant; however, 8%–9% of demographic attributes exhibit persistent, significant biases across multiple LLMs.We develop a “bias map” quantifying LLM performance, emphasising that mitigating even minor biases in automated hiring is critical to avoid perpetuating employment inequities and uphold the UN SDGs’ inclusive vision.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Eldar Jalilzade

Maksim Kalameyets

Shrikant Malviya

Actions

Institutions

Newcastle University

Durham University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Mapping Discrimination in LLM-Driven HR Systems

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider