What question did this study set out to answer?

This research aims to evaluate and reduce societal stereotypes present in AI-generated images through a structured bias detection rubric.

May 14, 2026Open Access

Social stereotypes in AI text-to-image generation

Key Points

This research aims to evaluate and reduce societal stereotypes present in AI-generated images through a structured bias detection rubric.
Developed a Social Stereotype Index (SSI) to assess biases in outputs from DALL-E-3, Midjourney-6.1, and Stability AI Core.
Audited 100 queries across different geocultural, occupational, and adjectival categories.
Implemented prompt refinement and conducted a user study to assess its impact.
Refinement reduced SSI scores by 58% for geocultural, 66% for occupational, and 53% for adjectival categories.
User feedback indicated tensions between reduced bias and contextual alignment; stereotypical imagery was often perceived as 'expected'.
Highlighted the need for T2I systems to balance ethical considerations with maintaining contextual relevance.

Abstract

Abstract Advances in generative AI have enabled visual content creation through text-to-image (T2I) generation. Despite their creative potential, T2I models often replicate and amplify societal stereotypes related to gender, race, and culture. This paper introduces a theory-driven bias detection rubric and a Social Stereotype Index (SSI) to systematically evaluate bias in T2I outputs. We audited three major T2I model outputs–DALL-E-3, Midjourney−6. 1, and Stability AI Core with 100 queries across geocultural, occupational, and adjectival categories. Results show recurring stereotypes, including gendered professions, cultural markers, and Western beauty norms. Using our rubric, we applied prompt refinement, which reduced SSI scores by 58% (geocultural), 66% (occupational), and 53% (adjectival). We conducted a complementary user study, which revealed tensions—while refinement mitigates bias, it may weaken contextual alignment, and participants often viewed stereotypical imagery as more “expected. ” We call for T2I systems to balance ethical debiasing with contextual relevance, supporting inclusivity without oversimplifying social realities.

Mark Helpful

Bookmark

Relay

View Full Paper