What does this research mean for the field?

In skin lesion classification models, sex biases arise primarily from training data imbalances, whereas age biases consistently favor younger patient groups regardless of data distribution. Novelty: ClaimNovelty.INCREMENTAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This study evaluates how demographic bias in training data influences skin lesion classification performance.

June 1, 2026

Effect of Demographic Bias on Skin Lesion Classification

Key Points

This study evaluates how demographic bias in training data influences skin lesion classification performance.
Evaluated ResNet-based convolutional models on skin lesions.
Generated controlled datasets using linear programming for demographic characteristics.
Assessed three learning strategies: single-task, reinforcing multi-task, and adversarial learning.
Sex-specific training datasets improved model performance, particularly for males.
Reinforcing and adversarial learning narrowed bias gaps in balanced datasets but were less effective in male-majority settings.
Younger age groups showed consistently higher performance regardless of training data distribution.

Abstract

The influence of bias in datasets on the fairness of model predictions is a topic of ongoing research in various fields. In this study, we evaluate the performance of skin lesion classification using ResNet-based convolutional models, focusing on the impact of demographic bias in training data, particularly variations in patient sex and age. We use a linear programming method to generate datasets with controlled demographic characteristics, allowing systematic investigation of bias effects. Three distinct learning strategies are evaluated: a single-task model, a reinforcing multi-task model, and an adversarial learning scheme.Our sex-based analysis indicates that sex-specific training datasets optimise model performance. Notably, including male patients in the training data improved performance for the male subgroup, even in female-majority cases. Reinforcing and adversarial learning schemes narrowed or eliminated bias gaps in balanced and female-majority datasets. However, these strategies proved less effective in male-majority settings, where models continued to perform better for males than females. The two learning schemes showed marginal bias reduction compared to the baseline model in predominantly male patient populations.Age-based analysis demonstrates comparable baseline performance across the three model approaches, with per formance declining across age categories. Younger groups consistently achieve the highest performance, regardless of training data distribution. Although balanced training yields optimal results for the youngest age category, performance decreases in older categories.We find that sex biases arise mainly from data imbalances, while age biases consistently favour younger groups regardless of distribution. These distinct mechanisms require targeted mitigation strategies. Our work aims to advance equitable AI in medical imaging by addressing these specific sources of disparity.Additionally, cross-dataset validation on two external datasets revealed that domain shifts notably affect performance and demographic bias patterns. The source code and models are available on GitHub: https://github.com/raumannsr/demographic-fairness-extended

Demander à l'IA

Bookmark

View Full Paper