• Marginal conformal coverage can mislead under one-time calibration workflows. • Small calibration sets yield large variability in conditional coverage. • Histology classification shows frequent under-coverage with limited calibration data. • Conditional guarantees need much larger calibration sets to be informative. • Implications for safe deployment and regulation of medical machine learning.
Kladny et al. (Mon,) studied this question.