The influence of bias in datasets on the fairness of model predictions is a topic of ongoing research in various fields. In this study, we evaluate the performance of skin lesion classification using ResNet-based convolutional models, focusing on the impact of demographic bias in training data, particularly variations in patient sex and age. We use a linear programming method to generate datasets with controlled demographic characteristics, allowing systematic investigation of bias effects. Three distinct learning strategies are evaluated: a single-task model, a reinforcing multi-task model, and an adversarial learning scheme.Our sex-based analysis indicates that sex-specific training datasets optimise model performance. Notably, including male patients in the training data improved performance for the male subgroup, even in female-majority cases. Reinforcing and adversarial learning schemes narrowed or eliminated bias gaps in balanced and female-majority datasets. However, these strategies proved less effective in male-majority settings, where models continued to perform better for males than females. The two learning schemes showed marginal bias reduction compared to the baseline model in predominantly male patient populations.Age-based analysis demonstrates comparable baseline performance across the three model approaches, with per formance declining across age categories. Younger groups consistently achieve the highest performance, regardless of training data distribution. Although balanced training yields optimal results for the youngest age category, performance decreases in older categories.We find that sex biases arise mainly from data imbalances, while age biases consistently favour younger groups regardless of distribution. These distinct mechanisms require targeted mitigation strategies. Our work aims to advance equitable AI in medical imaging by addressing these specific sources of disparity.Additionally, cross-dataset validation on two external datasets revealed that domain shifts notably affect performance and demographic bias patterns. The source code and models are available on GitHub: https://github.com/raumannsr/demographic-fairness-extended
Raumanns et al. (Fri,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: