Los puntos clave no están disponibles para este artículo en este momento.
Osteoarthritis (OA) severity grading from hand distal interphalangeal (DIP) joint radiographs using the Kellgren–Lawrence (KL) scale is challenged by severe class imbalance, with higher grades (KL3 and KL4) markedly underrepresented in clinical datasets. To address this limitation, we propose a VGG19-based classification framework that systematically evaluates six training strategies targeting imbalance at the data level, algorithmic level, or in combination. Synthetic images for minority classes were generated using CycleGAN and subsequently filtered through rheumatologist validation. The evaluated strategies include baseline training, rheumatologist-validated synthetic augmentation (SD), oversampling (OS), focal loss (FL) optimization, and multiple combinations of these approaches. The results show that strategies incorporating oversampling demonstrated the most consistent and statistically robust improvements in minority-class performance. Specifically, the combination of synthetic data and oversampling (SD + OS) achieved the highest binary OA sensitivity (96.12%) and significantly improved OA F1 score compared to baseline (0.613 vs. 0.416, p = 0.029). The full combined strategy (SD + OS + FL) yielded the highest KL3 F1 score (0.527 vs. 0.280 baseline, p = 0.048) and significantly improved KL4 F1 score (0.730 vs. 0.570 baseline, p = 0.150). Importantly, all strategies maintained higher or similar overall performance with no significant change in majority-class performance (p > 0.10), indicating that improvements in minority classes were not achieved at the expense of sacrificing majority classes or overall model reliability. These findings suggest that the proposed imbalance-mitigation strategies may improve minority class OA detection, particularly when oversampling and validated synthetic augmentation are combined. It is worth noting that the above results are derived from a held-out test set comprising 1626 samples, among which only 43 are OA-positive due to data imbalance. The results should be treated as preliminary findings subject to change upon validation in larger cohorts of OA patients.
Tank et al. (Wed,) studied this question.