AI-driven voice analysis classified heart failure admission vs. discharge states with an F1-score of 0.75 in 32 patients, detecting subtle vocal changes even with <2 kg weight loss.
Does AI-driven voice analysis detect fluid overload and distinguish between admission and discharge states in patients with chronic heart failure?
AI-driven voice analysis can distinguish between admission and discharge states in heart failure patients with an F1-score of 0.75, suggesting its potential as a non-invasive biomarker for early fluid congestion.
Absolute Event Rate: 0% vs 0%
Abstract Background Effective fluid status monitoring in chronic heart failure (HF) patients is critical for preventing decompensation and hospitalization. Conventional methods, such as daily weight tracking, often fail to detect early, pre-symptomatic fluid retention. As fluid accumulation affects the vocal tract, subtle voice alterations can serve as a biomarker for congestion. In this work, we present preliminary results from the first 55 patients enrolled at the German Heart Center at Charité (DHZC) within the multicenter VAMP-HF study, assessing the performance of AI-driven voice analysis for non-invasive fluid status monitoring. Methods The study design incorporated a run-in period for the first 20 patients to refine the recording setup and ensure sufficient audio quality for subsequent data collection. An additional three patients were excluded due to clinical deterioration or in-hospital mortality. Participants provided daily voice recordings (sustained vowels, standardized text, and varying sentences) alongside clinical measurements such as NT-proBNP, daily weight, and left-ventricular ejection fraction. A speaker-independent XGBoost classifier was trained on acoustic features to classify patient states (admission, discharge, and intermediate states). Performance was evaluated using a nested leave-one-patient-out approach, ensuring strict training and validation data separation. Results The final analysis included 32 patients (mean age 76.12 ± 12.65 years). At admission, patients had a mean NT-proBNP of 9575.55 ± 8202.87 pg/mL and a mean weight loss of 4.2 ± 4.29 kg until discharge. The trained machine learning model achieved an F1-score of 0.75 (Fig. 1b), demonstrating its ability to distinguish between admission and discharge states, based solely on voice samples. An exemplary prediction for a test patient is shown in Fig. 1a. Notably, the model was also able to detect subtle vocal changes even in patients with 2 kg weight loss. Conclusion Our findings suggest that AI-driven voice analysis is a feasible biomarker for detecting heart failure decompensation. While not directly compared with conventional early warning thresholds, these results support the hypothesis that voice-based biomarkers may serve as an early indicator of fluid congestion. Further validation in larger cohorts is needed to confirm these findings and enable integration into remote HF monitoring programs.
Riehle et al. (Sat,) reported a other. AI-driven voice analysis classified heart failure admission vs. discharge states with an F1-score of 0.75 in 32 patients, detecting subtle vocal changes even with <2 kg weight loss.