May 13, 2026Open Access

Interpretable machine learning to predict NOAF in ICU patients with CKD: validation in US and Chinese cohorts

Q: What is the clinical evidence from this study?

Study design: Cohort. Population: Critically ill patients with chronic kidney disease (CKD) at risk for New-Onset Atrial Fibrillation (NOAF) (n=19865). Intervention: Random Forest machine learning predictive model vs. Other machine learning models (Gradient Boosting, XGBoost, Logistic Regression, MLP, LightGBM). Primary outcome: Area under the receiver operating characteristic curve (AUC) for predicting NOAF.

Resultado clave

A Random Forest machine learning model utilizing 12 clinical features accurately predicted new-onset atrial fibrillation in critically ill patients with chronic kidney disease, achieving an AUC of 0.855 in internal validation.

Diseño del estudio

Tipo

Cohort (n=19,865)

Multicéntrico

Sí

PICO estructurado

Can machine learning models accurately predict new-onset atrial fibrillation in critically ill patients with chronic kidney disease?

Población

19,865 critically ill patients with chronic kidney disease (CKD) across three cohorts: MIMIC-IV derivation cohort (n=6,476), US-based eICU-CRD external validation cohort (n=12,509), and Chinese ICU database from Weifang People's Hospital external validation cohort (n=880).

Intervención

Machine learning predictive models (best performing: Random Forest) using 12 clinical features (age, SOFA score, minimum heart rate, congestive heart failure, average heart rate, minimum systolic blood pressure, mechanical ventilation use, minimum oxygen saturation, average respiratory rate, minimum magnesium, SAPS II score, and maximum white blood cell count) interpreted with SHAP.

Resultado

Prediction of New-Onset Atrial Fibrillation (NOAF) measured by Area Under the Curve (AUC)

An interpretable Random Forest machine learning model using 12 routine clinical features can effectively predict new-onset atrial fibrillation in critically ill patients with chronic kidney disease across different international cohorts.

Limitaciones

Retrospective study design
Attenuation in model performance during external validation
Lack of data on potentially important variables such as height and NT-proBNP

Resumen

Objective Critically ill patients with chronic kidney disease (CKD) are at high risk for New-Onset Atrial Fibrillation (NOAF), which significantly increases their risk of adverse events. Early detection of NOAF is crucial for prompt intervention and better outcomes. This study aims to develop and validate predictive models for the early identification and stratification of NOAF risk in this vulnerable population. Methods We developed and validated a predictive model using a cohort of 6,476 critically ill patients with CKD from the Medical Information Mart for Intensive Care-IV (MIMIC-IV) database. After performing feature selection via least absolute shrinkage and selection operator (Lasso) logistic regression, we trained six machine learning (ML) models. These algorithms included: Random Forest, Gradient Boosting, eXtreme Gradient Boosting (XGBoost), Logistic Regression (LR), Multi-layer Perceptron (MLP), and Light Gradient Boosting Machine (LightGBM). The best-performing model was interpreted using SHAP to provide clinical insights. Its robustness and generalizability were confirmed through rigorous external validation on two distinct international cohorts: the US-based eICU-CRD (eICU Collaborative Research Database) ( n = 12,509) and a Chinese ICU database from Weifang People’s Hospital ( n = 880). Results Ultimately, 12 predictive features were ultimately selected: age, SOFA score, minimum heart rate, congestive heart failure, average heart rate, minimum systolic blood pressure (SBP), mechanical ventilation use, minimum oxygen saturation (SpO 2 ), average respiratory rate, minimum magnesium, SAPS II score, and maximum white blood cell (WBC) count. The Random Forest model demonstrated the best overall performance, achieving an area under the receiver operating characteristic curve (AUC) of 0.855 in internal validation. The model’s robustness was confirmed through external validation on two independent cohorts, yielding an AUC of 0.760 on the eICU-CRD and 0.740 on the Weifang People’s Hospital database. According to the SHAP analysis, age, average heart rate, minimum heart rate, SOFA score, and SAPS II were the top five most influential predictors for NOAF. Conclusion We developed an interpretable machine learning model to predict NOAF in critically ill CKD patients, demonstrating strong generalizability through external validation on both a large US cohort (eICU-CRD) and a single-center Chinese cohort (Weifang People’s Hospital). SHAP analysis enhanced model interpretability.

Me gusta

Guardar

Ver artículo completo

Cite This Study

Zhang et al. (Wed,) conducted a cohort in Critically ill patients with chronic kidney disease (CKD) at risk for New-Onset Atrial Fibrillation (NOAF) (n=19,865). Random Forest machine learning predictive model vs. Other machine learning models (Gradient Boosting, XGBoost, Logistic Regression, MLP, LightGBM) was evaluated on Area under the receiver operating characteristic curve (AUC) for predicting NOAF. A Random Forest machine learning model utilizing 12 clinical features accurately predicted new-onset atrial fibrillation in critically ill patients with chronic kidney disease, achieving an AUC of 0.855 in internal validation.

synapsesocial.com/papers/6a0aa7060830b72fa67390df https://doi.org/https://doi.org/10.3389/fmed.2026.1816221

Me gusta

Guardar

Ver artículo completo