Key points are not available for this paper at this time.
Abstract While it is common to monitor deployed clinical artificial intelligence (AI) models for performance degradation, it is less common for the input data to be monitored for data drift – systemic changes to input distributions. However, when real-time evaluation may not be practical (eg., labeling costs) or when gold-labels are automatically generated, we argue that tracking data drift becomes a vital addition for AI deployments. In this work, we perform empirical experiments on real-world medical imaging to evaluate three data drift detection methods’ ability to detect data drift caused (a) naturally (emergence of COVID-19 in X-rays) and (b) synthetically. We find that monitoring performance alone is not a good proxy for detecting data drift and that drift-detection heavily depends on sample size and patient features. Our work discusses the need and utility of data drift detection in various scenarios and highlights gaps in knowledge for the practical application of existing methods.
Building similarity graph...
Analyzing shared references across papers
Loading...
Ali Kore
Elyar Abbasi Bavil
Vallijah Subasri
Nature Communications
Harvard University
University of Toronto
Massachusetts General Hospital
Building similarity graph...
Analyzing shared references across papers
Loading...
Kore et al. (Thu,) studied this question.
www.synapsesocial.com/papers/68e76cedb6db6435876e285a — DOI: https://doi.org/10.1038/s41467-024-46142-w
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: