What is the clinical evidence from this study?

Study design: Other. Population: Heart rate variability (HRV) analysis (n=21999). Intervention: Modular deep-learning framework vs. HeartPy and NeuroKit2. Primary outcome: RMSSD mean absolute error (MAE) on combined test set (p=< 10^-300).

What does this research mean for the field?

A modular deep-learning framework significantly improves the accuracy of heart rate variability analysis from ultra-short ECG windows, reducing RMSSD mean absolute error to 10.56 ms compared to classical baselines like HeartPy and NeuroKit2. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The aim is to develop a deep-learning framework for real-time heart rate variability analysis from short ECG windows.

April 22, 2026Open Access

Low-latency HRV analysis from ultra-short ECG windows using a modular deep-learning framework

Q: What are the key findings of this study?

The proposed modular deep-learning framework reduced RMSSD mean absolute error to 10.56 ms compared to 45.12 ms for HeartPy and 27.93 ms for NeuroKit2 on the combined ECG test set.

Q: What does this research mean for the field?

A modular deep-learning framework significantly improves the accuracy of heart rate variability analysis from ultra-short ECG windows, reducing RMSSD mean absolute error to 10.56 ms compared to classical baselines like HeartPy and NeuroKit2. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

Q: What question did this study set out to answer?

The aim is to develop a deep-learning framework for real-time heart rate variability analysis from short ECG windows.

Key Result

The proposed modular deep-learning framework reduced RMSSD mean absolute error to 10.56 ms compared to 45.12 ms for HeartPy and 27.93 ms for NeuroKit2 on the combined ECG test set.

Key Points

The aim is to develop a deep-learning framework for real-time heart rate variability analysis from short ECG windows.
Implemented a convolutional autoencoder for feature extraction from ECG data.
Developed a modular architecture with discriminators and regression heads for RMSSD estimation.
Conducted experiments using datasets LUDB, PTB-XL, and an Apple Watch subset for validation.
Achieved 92.12% accuracy with an F1 score of 95.43%, outperforming traditional methods.
Reduced mean absolute error (MAE) for RMSSD estimation to 10.56 ms, indicating better precision.
Model achieved real-time performance with 15.0 ms latency, significantly enhancing scalability.

Structured PICO

Does a modular deep-learning framework improve the accuracy and robustness of RMSSD estimation from ultra-short ECG windows compared to classical algorithmic baselines?

Population

21,999 clinical 10-second 12-lead ECG recordings (200 from LUDB, 21,799 from PTB-XL) and 50 single-lead ECG windows from an Apple Watch.

Intervention

Modular deep-learning framework comprising a pretrained convolutional autoencoder (frozen encoder) and task-specific heads (discriminator for quality screening and regression head for RMSSD estimation) with gated inference.

Comparator

Classical algorithmic toolkits (HeartPy and NeuroKit2).

Outcome

Discriminator accuracy and F1 score for quality screening, and Mean Absolute Error (MAE) for RMSSD estimation.surrogate

A novel modular deep-learning framework significantly improves the accuracy and robustness of real-time HRV (RMSSD) estimation from ultra-short ECG windows compared to standard algorithmic toolkits.

Main Result

Absolute Event Rate: 10.56% vs 45.12%

p-value: p=< 10^-300

Limitations

Weak supervision on PTB-XL RMSSD targets (pseudo-labels)
No prospective clinical validation
Ultra-short window regime estimates are not directly comparable to guideline-oriented short-term HRV
Limited wearable evidence and lack of pathological diversity in the Apple Watch subset
Over-conservative gating in borderline windows

Abstract

We present a universal modular deep-learning framework and demonstrate its application to low-latency, streaming-compatible heart rate variability (HRV) analysis using RMSSD as an exemplar metric. A convolutional autoencoder is first pretrained and then reused as a frozen encoder that maps raw ECG windows to a compact latent sequence. Task-specific heads, each comprising a BiLSTM adapter, a shallow Conv1D refinement, and temporal attention pooling operate on this shared representation. A discriminator head screens low-quality windows, while a regression head estimates RMSSD; a gated inference block routes outputs so RMSSD is produced only when the discriminator exceeds a threshold, replicating a robust “mask-then-estimate” pipeline in a single deployable graph. Using LUDB and PTB-XL with segmentation-assisted peak extraction for PTB-XL, plus an out-of-distribution Apple Watch subset, we enforce rigorous quality assurance to derive validity labels and RMSSD targets. Compared to two strong classical baselines (HeartPy and NeuroKit2), our discriminator improves combined-set accuracy to 92. 12% (vs. 80. 54% / 85. 58%) with F1 of 95. 43% (vs. 88. 82% / 91. 99%). On RMSSD estimation, the proposed model reduces combined MAE to 10. 56 ms (from 45. 12 ms / 27. 93 ms) and sharply curtails tail errors (P95: 47. 00 ms vs. 313. 35 ms / 167. 84 ms), indicating substantially improved robustness under pathological and noisy ECG. On a small out-of-distribution Apple Watch subset used as a sanity-check for acquisition shift, where the model attains the lowest MAE (7. 57 ms vs. 13. 96 ms / 9. 61 ms) under a selective gating regime. The end-to-end model is compact (2. 62 M parameters; 10. 07 MB on disk) and real-time capable, achieving 15. 0 ms mean latency at batch size 1 (66. 5 windows/s) and scaling to 4. 49k windows/s at batch size 1024 on a single consumer-grade GPU.

Bookmark

View Full Paper

Cite This Study

Dobrosolski et al. (Mon,) conducted a other in Heart rate variability (HRV) analysis (n=21,999). Modular deep-learning framework vs. HeartPy and NeuroKit2 was evaluated on RMSSD mean absolute error (MAE) on combined test set (p=< 10^-300). The proposed modular deep-learning framework reduced RMSSD mean absolute error to 10.56 ms compared to 45.12 ms for HeartPy and 27.93 ms for NeuroKit2 on the combined ECG test set.

synapsesocial.com/papers/69e865d76e0dea528ddea44e https://doi.org/https://doi.org/10.1038/s41598-026-48463-w

Bookmark

View Full Paper