DeepBeat achieved a sensitivity of 0.98 for atrial fibrillation detection compared to a sensitivity of 0.49 for the single-task model.
Observational (n=107)
No
Does a multitask deep learning model (DeepBeat) improve the detection of atrial fibrillation from wearable photoplethysmography devices compared to single-task models?
A multitask deep learning approach that jointly assesses signal quality and rhythm significantly improves the accuracy of atrial fibrillation detection from wearable PPG devices.
Effect estimate: RR 2.00 (95% CI 1.55-2.50)
Absolute Event Rate: 0.98% vs 0.49%
p-value: p=<0.001
Abstract Wearable devices enable theoretically continuous, longitudinal monitoring of physiological measurements such as step count, energy expenditure, and heart rate. Although the classification of abnormal cardiac rhythms such as atrial fibrillation from wearable devices has great potential, commercial algorithms remain proprietary and tend to focus on heart rate variability derived from green spectrum LED sensors placed on the wrist, where noise remains an unsolved problem. Here we develop DeepBeat, a multitask deep learning method to jointly assess signal quality and arrhythmia event detection in wearable photoplethysmography devices for real-time detection of atrial fibrillation. The model is trained on approximately one million simulated unlabeled physiological signals and fine-tuned on a curated dataset of over 500 K labeled signals from over 100 individuals from 3 different wearable devices. We demonstrate that, in comparison with a single-task model, our architecture using unsupervised transfer learning through convolutional denoising autoencoders dramatically improves the performance of atrial fibrillation detection from a F1 score of 0.54 to 0.96. We also include in our evaluation a prospectively derived replication cohort of ambulatory participants where the algorithm performed with high sensitivity (0.98), specificity (0.99), and F1 score (0.93). We show that two-stage training can help address the unbalanced data problem common to biomedical applications, where large-scale well-annotated datasets are hard to generate due to the expense of manual annotation, data acquisition, and participant privacy.
Soto et al. (Wed,) conducted a observational in Atrial Fibrillation (n=107). DeepBeat vs. Single-task model was evaluated on Atrial fibrillation detection (RR 2.00, 95% CI 1.55-2.50, p=<0.001). DeepBeat achieved a sensitivity of 0.98 for atrial fibrillation detection compared to a sensitivity of 0.49 for the single-task model.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: