An interpretable XGBoost machine learning model using clinical data from the first 6 hours of ICU admission predicted in-hospital mortality in heart failure patients with an AUC of 0.797.
Cohort (n=12,110)
No
Can an interpretable machine learning model using early ICU data accurately predict in-hospital mortality in adult ICU patients with heart failure?
An interpretable XGBoost machine learning model using data from the first 6 hours of ICU admission can accurately predict in-hospital mortality in heart failure patients, potentially aiding early risk stratification.
Estimación del efecto: AUC 0.797 (95% CI 0.772-0.822)
valor p: p=<0.001
Heart failure (HF) remains a major cause of morbidity and mortality worldwide, and acute decompensation frequently necessitates intensive care. Early identification of high-risk patients is essential, yet traditional HF risk scores were developed largely in chronic or ward-based cohorts and often fail to capture early physiologic deterioration in the ICU. Explainable machine-learning (ML) models may improve early risk stratification. Using the MIMIC-IV v3.1 database, we conducted a retrospective study of adult ICU patients with heart failure identified by ICD-10 codes (I50.x) between October 1, 2015 and 2022. Predictors included demographics, comorbidities, pre-ICU cardiovascular medications, and vital signs and laboratory tests obtained within 0–6 h of ICU admission. Implausible values were removed; missing data were imputed using medians or modes. An XGBoost classifier was trained with an 80/20 stratified split and class weighting. Performance in the test cohort was assessed using AUC, average precision, sensitivity, specificity, F1-score, and Brier score. SHapley Additive exPlanations (SHAP) were used for global and feature-level interpretability. A total of 12,110 ICU patients with HF were included, of whom 2,041 (16.9%) died in hospital. In the test cohort (n = 2,422), the model achieved an AUC of 0.797 (95% CI, 0.772–0.822) and an average precision of 0.491. Sensitivity and specificity were 0.632 and 0.793 at the default threshold; using the Youden-optimal threshold increased sensitivity to 0.708. Kaplan–Meier analysis demonstrated significant separation of survival curves across predicted risk quartiles (log-rank P < 0.001), supporting the survival stratification of early risk predictions. SHAP analysis identified FiO₂, age, lactate, Charlson comorbidity index, BUN, and systolic blood pressure as the strongest predictors, revealing clinically coherent non-linear relationships and interactions. These findings suggest that an interpretable model based on routinely available early ICU data may facilitate structured and quantitative risk stratification within the first hours of admission, thereby assisting clinicians in identifying high-risk patients who may benefit from closer monitoring and timely escalation of supportive care, rather than replacing clinical judgment.
Sun et al. (Thu,) conducted a cohort in Acute decompensated heart failure in the ICU (n=12,110). XGBoost machine learning model was evaluated on In-hospital all-cause mortality (AUC 0.797, 95% CI 0.772-0.822, p=<0.001). An interpretable XGBoost machine learning model using clinical data from the first 6 hours of ICU admission predicted in-hospital mortality in heart failure patients with an AUC of 0.797.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: