Progressive Wasserstein GAN-synthesized CAD-like heart sounds achieved a Fréchet Audio Distance score of 1.43 and outperformed traditional augmentation methods in classification performance.
PhysioNet/Computing in Cardiology (CinC) Challenge 2016 dataset of heart sound segments
Data augmentation using Progressive Wasserstein GANs to synthesize realistic CAD-like heart sound segments
Traditional augmentation and cost-sensitive learning methods
Classification performance (sensitivity, specificity, precision) and synthetic audio quality (Fréchet Audio Distance)
GAN-based data augmentation of heart sound datasets improves the classification performance of machine learning models for detecting coronary artery disease.
The PhysioNet/Computing in Cardiology (CinC) Challenge 2016 dataset has driven significant advancements in automated heart sound analysis using machine learning (ML) and deep learning (DL). However, these efforts are constrained by the dataset's limited size and severe class imbalance, particularly the underrepresentation of coronary artery disease (CAD) cases. This study addresses these limitations by employing generative adversarial networks (GANs) to synthesize realistic CAD-like heart sound segments, augmenting existing datasets to improve classification performance. A Progressive Wasserstein GAN architecture was implemented to generate high-quality audio segments that accurately capture CAD heart sounds' spectral and temporal characteristics. The quality of synthetic audio was assessed using the Fréchet Audio Distance (FAD), achieving scores of 1.43 and 2.23 when compared to reference CAD and healthy samples, respectively. Novel post-processing steps, including bandpass filtering, further enhanced the fidelity of the synthetic samples. By augmenting the imbalanced heart sound dataset with these samples, we observed substantial improvements in the performance of five classification models. The GAN-augmented training set outperformed traditional augmentation and cost-sensitive learning methods, demonstrating superior sensitivity, specificity, and precision. This study highlights the potential of GAN-based data augmentation to address critical challenges in medical audio datasets. It offers a scalable and cost-effective solution for improving the generalizability and robustness of heart sound classification models, paving the way for enhanced diagnostic tools in biomedical signal processing. • Progressive GANs synthesize high-quality heart sounds resembling CAD features. • Augmented datasets improve CAD heart sound classification performance. • Novel use of Progressive Wasserstein GANs for abnormal heart sounds synthesis. • Synthetic audios achieve FAD scores of 1.43 (CAD) and 2.23 (healthy). • Advanced GAN mechanisms ensure stable training and variety in synthetic samples.
Building similarity graph...
Analyzing shared references across papers
Loading...
Shaunak Chakraborty
Symbiosis International University
Prishita Kochhar
Boston University
Shruti Patil
Defence Institute of Advanced Technology
Computers in Biology and Medicine
Indian Statistical Institute
Monash University Malaysia
Symbiosis International University
Building similarity graph...
Analyzing shared references across papers
Loading...
Chakraborty et al. (Tue,) conducted a other in Coronary artery disease (CAD) / Heart sound abnormalities. Progressive Wasserstein Generative Adversarial Networks (GANs) data augmentation vs. Traditional augmentation and cost-sensitive learning methods was evaluated on Fréchet Audio Distance (FAD) score compared to reference CAD samples. Progressive Wasserstein GAN-synthesized CAD-like heart sounds achieved a Fréchet Audio Distance score of 1.43 and outperformed traditional augmentation methods in classification performance.
synapsesocial.com/papers/6a127fa2e407b26696350705 — DOI: https://doi.org/10.1016/j.compbiomed.2025.110623
Synapse has enriched 3 closely related papers on similar clinical questions. Consider them for comparative context: