Aug-SAPS II improved mortality prediction calibration and discrimination with an AUC of 0.866 compared to AUC of 0.841 for SAPS II, and reduced hospital-level random-effects variance by 35.3%.
Cohort
Yes
Does an augmented SAPS II model incorporating comorbidities improve in-hospital mortality prediction and benchmarking in adult ICU patients compared to the original SAPS II?
2,496,734 adult (≥18 years) intensive care unit (ICU) stays in France between 2015 and 2025 (derivation cohort n=2,156,332; validation cohort n=242,465; sensitivity cohort n=97,937), mean age 62.5 years, 36.6% female.
Augmented SAPS II (aug-SAPS II) model incorporating SAPS II score, demographic characteristics, and ICD-10-derived Elixhauser comorbidities
Original SAPS II model and recalibrated SAPS II (r-SAPS II)
In-hospital mortality (assessed via standardized mortality ratios [SMRs], calibration regression, and discrimination [AUC])hard clinical
Augmenting the SAPS II score with routinely collected administrative comorbidity data significantly improves mortality prediction calibration and discrimination while reducing unexplained between-hospital variability for ICU benchmarking.
Severity scoring systems such as the Simplified Acute Physiology Score II (SAPS II) are widely used for benchmarking intensive care unit (ICU) performance, but their calibration deteriorates over time, leading to systematic overestimation of mortality in contemporary populations. We aimed to evaluate recalibrated SAPS II (r-SAPS II) and augmented SAPS II (Aug-SAPS II) models for mortality prediction using routinely collected national data. This nationwide retrospective study used the French national database of hospital stays. Adult ICU stays from 2015 to 2023 constituted the derivation cohort; 2024 data were used for temporal external validation, and 2025 data for sensitivity analyses. Demographic characteristics, SAPS II score, and ICD-10–derived comorbidities defined according to the Elixhauser classification, were used to construct aug-SAPS II model. Model calibration was assessed using Cox calibration regression and standardized mortality ratios (SMRs), defined as the ratio of observed to predicted in-hospital deaths. Discrimination and between-hospital heterogeneity were quantified using the area under the receiver operating characteristic curve (AUC) and random-effects variance in SMRs from mixed-effects Poisson regression. In the derivation cohort (2 156 332 ICU stays), discrimination improved, with the AUC increasing from 0.841 (95% CI, 0.840–0.841) for SAPS II to 0.866 (95% CI, 0.865–0.866) for aug-SAPS II. Calibration improved for both r-SAPS II and aug-SAPS II, with calibration-in-the-large shifting from − 1.10 to approximately 0.00 and calibration slope from 0.72 to approximately 1.00 at both the hospital level and across clinical subgroups. In the validation cohort (242 465 stays), calibration remained close to ideal and SMRs were near 1; AUCs were 0.835 (95% CI, 0.833–0.837) for SAPS II and 0.862 (95% CI, 0.860–0.864) for aug-SAPS II. Hospital-level random-effects variance was unchanged with r-SAPS II but decreased by 28.7% with aug-SAPS II. Findings were consistent in 2025 data. Compared with the original SAPS II, aug-SAPS II improved calibration and discrimination and reduced between-hospital variability without requiring additional data beyond routinely collected national administrative information.
Building similarity graph...
Analyzing shared references across papers
Loading...
Toufik Kamel
Thierry Boulain
Critical Care
Centre hospitalier universitaire d'Orléans
Building similarity graph...
Analyzing shared references across papers
Loading...
Kamel et al. (Tue,) conducted a cohort in ICU stays mortality prediction (n=2,156,332). Augmented Simplified Acute Physiology Score II (Aug-SAPS II) vs. Simplified Acute Physiology Score II (SAPS II) was evaluated on In-hospital mortality prediction (AUC increased from 0.841 to 0.866, 95% CI 0.865-0.866). Aug-SAPS II improved mortality prediction calibration and discrimination with an AUC of 0.866 compared to AUC of 0.841 for SAPS II, and reduced hospital-level random-effects variance by 35.3%.
www.synapsesocial.com/papers/69b3ac4d02a1e69014ccdead — DOI: https://doi.org/10.1186/s13054-026-05948-4