What question did this study set out to answer?

This study aimed to develop and validate machine learning models for predicting end-stage renal disease and mortality in chronic kidney disease patients.

June 7, 2026Open Access

Predicting End-Stage Renal Disease and Mortality in Chronic Kidney Disease Using Machine Learning: Retrospective Cohort Study

Key Points

This study aimed to develop and validate machine learning models for predicting end-stage renal disease and mortality in chronic kidney disease patients.
Developed models using data from 29,677 CKD patients with eGFR <60 mL/min/1.73 m².
Randomly divided cohort into training (80%) and testing (20%) sets.
Evaluated various machine learning algorithms and assessed performance using AUROC and precision-recall curves.
AUROC for ESRD prediction ranged from 0.839 to 0.894, indicating high predictive performance.
All-cause mortality AUROC ranged from 0.752 to 0.774, showing more modest prediction.
Calibration and decision-curve analysis confirmed the models' reliability and clinical utility.

Abstract

Abstract Background Chronic kidney disease (CKD) is a global health burden characterized by heterogeneous progression trajectories. Without timely and appropriate management, CKD can lead to increased morbidity and mortality and a reduced quality of life. Therefore, early identification of patients at high risk of developing end-stage renal disease (ESRD) or mortality is essential to facilitate timely intervention and improve patient outcomes. Objective This study aimed to develop and validate machine learning models to predict ESRD and all-cause mortality in patients with CKD. Methods We developed and validated machine learning models using data from patients with CKD and an estimated glomerular filtration rate of <60 mL/min/1.73 m 2 , who were treated at Taipei Veterans General Hospital between 2011 and 2021. Predictors included 69 routinely available demographic, clinical, medication, laboratory, and echocardiographic variables. The outcomes were ESRD and all-cause mortality. The cohort was randomly divided into training (n=23,741, 80%) and testing (n=5936, 20%) sets. The evaluated models included extreme gradient boosting, light gradient boosting machine, categorical boosting, random forest, and a stacking classifier. Model performance was assessed using the area under the receiver operating characteristic curve (AUROC), the area under the precision-recall curve, calibration, and decision-curve analysis. Supplementary time-to-event analyses were performed using the kidney failure risk equation and survival-based machine learning models. Results A total of 29,677 patients were included in the study. The median age was 79 (IQR 70.0‐88.0) years, and 16,359 (55.1%) were male. Among these patients, 14,993 (50.5%) had hypertension, 7908 (26.6%) had diabetes mellitus, and 1768 (6%) had cancer. During follow-up, 649 patients (2.2%) developed ESRD and 3631 (12.2%) died. The models demonstrated high predictive performance for ESRD, with AUROCs ranging from 0.839 to 0.894. For all-cause mortality, the predictive performance was more modest, with AUROCs ranging from 0.752 to 0.774. Given the low incidence of ESRD in this cohort, model performance was additionally evaluated using precision-recall curves. The area under the precision-recall curve ranged from 0.172 to 0.216 for ESRD prediction and from 0.330 to 0.356 for all-cause mortality across models. Calibration and decision-curve analyses supported model reliability and clinical utility. Conclusions Machine learning algorithms may serve as useful tools for risk stratification of ESRD and all-cause mortality in patients with CKD, with the potential to support more individualized clinical management.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper