What question did this study set out to answer?

The research aims to evaluate how Shapley values can make AI/ML sepsis diagnostic tools more understandable and useful for clinicians.

February 28, 2026Open Access

Interpretability of an FDA-authorized AI/ML sepsis diagnostic tool improved by SHAP values

Key Points

The research aims to evaluate how Shapley values can make AI/ML sepsis diagnostic tools more understandable and useful for clinicians.
Conducted structured assessments with 30 clinicians including providers and nurses
Evaluated the ability to understand SHAP values through a diagnostic interface
Used de-identified patient data in real clinical scenarios
Clinicians correctly interpreted SHAP values in 98% of assessments
Reported 100% improvement in understanding the diagnostic results with SHAP values
All 30 participants preferred the interface with SHAP values over the one without

Abstract

Abstract Objectives To assess the interpretability and acceptance of Shapley values for making artificial intelligence/machine learning (AI/ML) tools more transparent, interpretable, and useful to clinicians. Materials and Methods Structured assessments were conducted with 30 clinicians (15 providers; 15 nurses; 8 assessments per clinician) to evaluate their ability to understand interventional Shapley Additive exPlanations (SHAP) values, a type of Shapley value that provides individualized variable importance scores and ascertain their perspective on SHAP value utility for the use of an AI/ML sepsis diagnostic. Participants were shown the diagnostic interface for real clinical scenarios with de-identified patient data with and without SHAP values. The primary outcomes were clinician ability to correctly interpret SHAP values and clinician self-reported improvement in their understanding of how the AI/ML algorithm produced its result. Results Participants correctly interpreted SHAP values in 235 of 240 assessments (98%; CI, 95%-99%) and reported SHAP values improved their understanding of how the algorithm produced its result in every case (240/240; 100%; CI, 99%-100%). Participants were unanimous (30/30) in preferring the interface with SHAP values over the interface without. Discussion Clinician participants strongly preferred the device interface with SHAP values, were unanimous in reporting SHAP values improved their understanding of the AI/ML diagnostic, and scored nearly perfectly when asked to interpret SHAP values. Conclusion These results suggest health care providers value transparency into AI/ML algorithms designed for clinical use, and that Shapley values are a useful approach to providing that transparency, which in turn may improve tool adoption and clinical utility.

Interpretability of an FDA-authorized AI/ML sepsis diagnostic tool improved by SHAP values

Key Points

Abstract

Cite This Study