What does this research mean for the field?

AI-driven voice analysis can distinguish between admission and discharge stages in heart failure patients based on vocal changes, achieving an F1-score of 0.75. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

Evaluate the effectiveness of AI-driven voice analysis as a non-invasive method for monitoring fluid status in heart failure patients.

February 8, 2026

AI-driven voice analysis for early fluid overload detection in heart failure: preliminary results from the DHZC cohort of the VAMP-HF study

Key Result

AI-driven voice analysis classified heart failure admission vs. discharge states with an F1-score of 0.75 in 32 patients, detecting subtle vocal changes even with <2 kg weight loss.

Key Points

Evaluate the effectiveness of AI-driven voice analysis as a non-invasive method for monitoring fluid status in heart failure patients.
Conducted a study with 55 heart failure patients from the VAMP-HF study.
Collected daily voice recordings and clinical measurements including NT-proBNP and weight.
Trained an XGBoost classifier on acoustic features to classify patient states.
Final analysis included 32 patients with a mean age of 76.12 years.
The model achieved an F1-score of 0.75, distinguishing between patient states based on voice samples.
Detected vocal changes in patients, even with weight loss of less than 2 kg.

Structured PICO

Does AI-driven voice analysis detect fluid overload and distinguish between admission and discharge states in patients with chronic heart failure?

Population

32 patients with chronic heart failure, mean age 76.12 ± 12.65 years, from the German Heart Center at Charité (DHZC) cohort of the multicenter VAMP-HF study.

Intervention

AI-driven voice analysis using a speaker-independent XGBoost classifier trained on daily voice recordings (sustained vowels, standardized text, and varying sentences).

Outcome

Classification of patient states (admission vs. discharge) based solely on voice samples.surrogate

AI-driven voice analysis can distinguish between admission and discharge states in heart failure patients with an F1-score of 0.75, suggesting its potential as a non-invasive biomarker for early fluid congestion.

Main Result

Absolute Event Rate: 0% vs 0%

Abstract

Abstract Background Effective fluid status monitoring in chronic heart failure (HF) patients is critical for preventing decompensation and hospitalization. Conventional methods, such as daily weight tracking, often fail to detect early, pre-symptomatic fluid retention. As fluid accumulation affects the vocal tract, subtle voice alterations can serve as a biomarker for congestion. In this work, we present preliminary results from the first 55 patients enrolled at the German Heart Center at Charité (DHZC) within the multicenter VAMP-HF study, assessing the performance of AI-driven voice analysis for non-invasive fluid status monitoring. Methods The study design incorporated a run-in period for the first 20 patients to refine the recording setup and ensure sufficient audio quality for subsequent data collection. An additional three patients were excluded due to clinical deterioration or in-hospital mortality. Participants provided daily voice recordings (sustained vowels, standardized text, and varying sentences) alongside clinical measurements such as NT-proBNP, daily weight, and left-ventricular ejection fraction. A speaker-independent XGBoost classifier was trained on acoustic features to classify patient states (admission, discharge, and intermediate states). Performance was evaluated using a nested leave-one-patient-out approach, ensuring strict training and validation data separation. Results The final analysis included 32 patients (mean age 76.12 ± 12.65 years). At admission, patients had a mean NT-proBNP of 9575.55 ± 8202.87 pg/mL and a mean weight loss of 4.2 ± 4.29 kg until discharge. The trained machine learning model achieved an F1-score of 0.75 (Fig. 1b), demonstrating its ability to distinguish between admission and discharge states, based solely on voice samples. An exemplary prediction for a test patient is shown in Fig. 1a. Notably, the model was also able to detect subtle vocal changes even in patients with 2 kg weight loss. Conclusion Our findings suggest that AI-driven voice analysis is a feasible biomarker for detecting heart failure decompensation. While not directly compared with conventional early warning thresholds, these results support the hypothesis that voice-based biomarkers may serve as an early indicator of fluid congestion. Further validation in larger cohorts is needed to confirm these findings and enable integration into remote HF monitoring programs.

Bookmark