Artificial intelligence (AI) promises to enhance breast cancer screening. Here we evaluated Google's mammography AI system (version 1.2) across two phases: a retrospective study using 115,973 mammograms from five National Health Service screening services with 39-month follow-up and prospective noninterventional feasibility deployment at 12 sites (9,266 cases). The primary endpoint was AI sensitivity and specificity versus first reader using a 5% noninferiority margin. The secondary endpoints were performance versus second or consensus readers and breast-level analyses. Retrospectively, AI achieved superior sensitivity (0.541 versus 0.437 for first reader, P < 0.001) and noninferior specificity (0.943 versus 0.952, P < 0.001). Cancer detection rate increased from 7.54 to 9.33 per 1,000 women, with AI detecting 25.0% of interval cancers. Performance was particularly strong for first screens (39.3% fewer recalls, 8.8% higher detection) and invasive cancers. No systematic demographic disparities were observed. Simulated second-reader replacement reduced reading time by 32% while increasing detection by 17.7%. Prospective deployment confirmed technical feasibility but revealed a distribution shift requiring threshold recalibration. Implementation requires adaptive calibration and continuous monitoring to ensure safety and equity.
Building similarity graph...
Analyzing shared references across papers
Loading...
Christopher Kelly
Marc Wilson
Lucy M. Warren
Nature Cancer
University of Cambridge
Imperial College London
Google (United States)
Building similarity graph...
Analyzing shared references across papers
Loading...
Kelly et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69b3aaa802a1e69014ccb625 — DOI: https://doi.org/10.1038/s43018-026-01127-0