Abstract Background Chronic kidney disease (CKD) is a global health burden characterized by heterogeneous progression trajectories. Without timely and appropriate management, CKD can lead to increased morbidity and mortality and a reduced quality of life. Therefore, early identification of patients at high risk of developing end-stage renal disease (ESRD) or mortality is essential to facilitate timely intervention and improve patient outcomes. Objective This study aimed to develop and validate machine learning models to predict ESRD and all-cause mortality in patients with CKD. Methods We developed and validated machine learning models using data from patients with CKD and an estimated glomerular filtration rate of <60 mL/min/1.73 m 2 , who were treated at Taipei Veterans General Hospital between 2011 and 2021. Predictors included 69 routinely available demographic, clinical, medication, laboratory, and echocardiographic variables. The outcomes were ESRD and all-cause mortality. The cohort was randomly divided into training (n=23,741, 80%) and testing (n=5936, 20%) sets. The evaluated models included extreme gradient boosting, light gradient boosting machine, categorical boosting, random forest, and a stacking classifier. Model performance was assessed using the area under the receiver operating characteristic curve (AUROC), the area under the precision-recall curve, calibration, and decision-curve analysis. Supplementary time-to-event analyses were performed using the kidney failure risk equation and survival-based machine learning models. Results A total of 29,677 patients were included in the study. The median age was 79 (IQR 70.0‐88.0) years, and 16,359 (55.1%) were male. Among these patients, 14,993 (50.5%) had hypertension, 7908 (26.6%) had diabetes mellitus, and 1768 (6%) had cancer. During follow-up, 649 patients (2.2%) developed ESRD and 3631 (12.2%) died. The models demonstrated high predictive performance for ESRD, with AUROCs ranging from 0.839 to 0.894. For all-cause mortality, the predictive performance was more modest, with AUROCs ranging from 0.752 to 0.774. Given the low incidence of ESRD in this cohort, model performance was additionally evaluated using precision-recall curves. The area under the precision-recall curve ranged from 0.172 to 0.216 for ESRD prediction and from 0.330 to 0.356 for all-cause mortality across models. Calibration and decision-curve analyses supported model reliability and clinical utility. Conclusions Machine learning algorithms may serve as useful tools for risk stratification of ESRD and all-cause mortality in patients with CKD, with the potential to support more individualized clinical management.
Chen et al. (Fri,) studied this question.