Clinical utility assessment framework for machine learning-based fetal health classification in cardiotocography: an observational study | Synapse