Key points are not available for this paper at this time.
Manual annotation of audio material is cumbersome. Active learning aims at minimizing the annotation effort by iteratively selecting an acquisition batch of unlabeled data, asking a human to annotate the selected data and re-training a classifier until an annotation budget is depleted. In this paper we propose the Gaussian-dense active learning (GDAL) algorithm to train a sound event classifier. The classifier is a Bayesian neural network where the weights are normally distributed. This is in contrast to conventional neural networks where weights are not distributed, but have assigned values. The Bayesian nature of the classifier empowers GDAL to select acquisition batches from a set of unlabeled audio clips based on their estimated informativeness. Evaluation results on the UrbanSound8k dataset show that GDAL outperforms a state-of-the-art algorithm based on medoid active learning for all considered annotation budgets and an algorithm based on dropout active learning for sufficiently large annotation budgets.
Building similarity graph...
Analyzing shared references across papers
Loading...
Stepan Shishkin
Fraunhofer Institute for Digital Media Technology
Danilo Hollosi
Fraunhofer Institute for Digital Media Technology
Stefan Goetze
South Westphalia University of Applied Sciences
University of Sheffield
Fraunhofer Institute for Digital Media Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Shishkin et al. (Mon,) studied this question.
synapsesocial.com/papers/68e7388db6db6435876b1a0b — DOI: https://doi.org/10.1109/icassp48485.2024.10446970