What question did this study set out to answer?

This research aims to enhance the accuracy of mortality predictions in ICU settings using updated scoring models.

March 13, 2026Open Access

Augmenting the Simplified Acute Physiology Score II for intensive care unit mortality prediction and benchmarking: a French nationwide study

Q: What is the clinical evidence from this study?

Study design: Cohort. Population: ICU stays mortality prediction (n=2156332). Intervention: Augmented Simplified Acute Physiology Score II (Aug-SAPS II) vs. Simplified Acute Physiology Score II (SAPS II). Primary outcome: In-hospital mortality prediction (AUC increased from 0.841 to 0.866, 95% CI 0.865-0.866).

Key Result

Aug-SAPS II improved mortality prediction calibration and discrimination with an AUC of 0.866 compared to AUC of 0.841 for SAPS II, and reduced hospital-level random-effects variance by 35.3%.

Key Points

This research aims to enhance the accuracy of mortality predictions in ICU settings using updated scoring models.
Used French national hospital stay database for analysis from 2015 to 2025.
Constructed augmented SAPS II model considering demographic data and comorbidities.
Evaluated calibration using Cox regression and SMRs, and discrimination via AUC and mixed-effects Poisson regression.
Augmented SAPS II showed a significant AUC improvement from 0.841 to 0.866 compared to original SAPS II.
Calibration improved markedly, with calibration slopes nearing ideal values of 1.00.
Reductions in between-hospital variability were observed, enhancing reliability of performance benchmarking.

Study Design

Type

Cohort (n=2,156,332)

Multicenter

Yes

Structured PICO

Does an augmented SAPS II model incorporating comorbidities improve in-hospital mortality prediction and benchmarking in adult ICU patients compared to the original SAPS II?

Population

2,496,734 adult (≥18 years) intensive care unit (ICU) stays in France between 2015 and 2025 (derivation cohort n=2,156,332; validation cohort n=242,465; sensitivity cohort n=97,937), mean age 62.5 years, 36.6% female.

Intervention

Augmented SAPS II (aug-SAPS II) model incorporating SAPS II score, demographic characteristics, and ICD-10-derived Elixhauser comorbidities

Comparator

Original SAPS II model and recalibrated SAPS II (r-SAPS II)

Outcome

In-hospital mortality (assessed via standardized mortality ratios [SMRs], calibration regression, and discrimination [AUC])hard clinical

Augmenting the SAPS II score with routinely collected administrative comorbidity data significantly improves mortality prediction calibration and discrimination while reducing unexplained between-hospital variability for ICU benchmarking.

Main Result

Effect estimate: AUC increased from 0.841 to 0.866 (95% CI 0.865-0.866)

Absolute Event Rate: 19.8% vs 17.1%

Limitations

Residual miscalibration persists in a few diagnostic subgroups.
The study may not account for all confounding factors influencing outcomes at hospitals.

Abstract

Severity scoring systems such as the Simplified Acute Physiology Score II (SAPS II) are widely used for benchmarking intensive care unit (ICU) performance, but their calibration deteriorates over time, leading to systematic overestimation of mortality in contemporary populations. We aimed to evaluate recalibrated SAPS II (r-SAPS II) and augmented SAPS II (Aug-SAPS II) models for mortality prediction using routinely collected national data. This nationwide retrospective study used the French national database of hospital stays. Adult ICU stays from 2015 to 2023 constituted the derivation cohort; 2024 data were used for temporal external validation, and 2025 data for sensitivity analyses. Demographic characteristics, SAPS II score, and ICD-10–derived comorbidities defined according to the Elixhauser classification, were used to construct aug-SAPS II model. Model calibration was assessed using Cox calibration regression and standardized mortality ratios (SMRs), defined as the ratio of observed to predicted in-hospital deaths. Discrimination and between-hospital heterogeneity were quantified using the area under the receiver operating characteristic curve (AUC) and random-effects variance in SMRs from mixed-effects Poisson regression. In the derivation cohort (2 156 332 ICU stays), discrimination improved, with the AUC increasing from 0.841 (95% CI, 0.840–0.841) for SAPS II to 0.866 (95% CI, 0.865–0.866) for aug-SAPS II. Calibration improved for both r-SAPS II and aug-SAPS II, with calibration-in-the-large shifting from − 1.10 to approximately 0.00 and calibration slope from 0.72 to approximately 1.00 at both the hospital level and across clinical subgroups. In the validation cohort (242 465 stays), calibration remained close to ideal and SMRs were near 1; AUCs were 0.835 (95% CI, 0.833–0.837) for SAPS II and 0.862 (95% CI, 0.860–0.864) for aug-SAPS II. Hospital-level random-effects variance was unchanged with r-SAPS II but decreased by 28.7% with aug-SAPS II. Findings were consistent in 2025 data. Compared with the original SAPS II, aug-SAPS II improved calibration and discrimination and reduced between-hospital variability without requiring additional data beyond routinely collected national administrative information.

Bookmark

View Full Paper

Cite This Study

Kamel et al. (Tue,) conducted a cohort in ICU stays mortality prediction (n=2,156,332). Augmented Simplified Acute Physiology Score II (Aug-SAPS II) vs. Simplified Acute Physiology Score II (SAPS II) was evaluated on In-hospital mortality prediction (AUC increased from 0.841 to 0.866, 95% CI 0.865-0.866). Aug-SAPS II improved mortality prediction calibration and discrimination with an AUC of 0.866 compared to AUC of 0.841 for SAPS II, and reduced hospital-level random-effects variance by 35.3%.

synapsesocial.com/papers/69b3ac4d02a1e69014ccdead https://doi.org/https://doi.org/10.1186/s13054-026-05948-4

Bookmark

View Full Paper