Key points are not available for this paper at this time.
This report introduces a new corpus of music, speech, and noise. This dataset is suitable for training models for voice activity detection (VAD) and music/speech discrimination. Our corpus is released under a flexible Creative Commons license. The dataset consists of music from several genres, speech from twelve languages, and a wide assortment of technical and non-technical noises. We demonstrate use of this corpus for music/speech discrimination on Broadcast news and VAD for speaker identification.
Building similarity graph...
Analyzing shared references across papers
Loading...
David Snyder
ECRI Institute
Guoguo Chen
New England Biolabs (China)
Daniel Povey
Xiaomi (China)
Building similarity graph...
Analyzing shared references across papers
Loading...
Snyder et al. (Thu,) studied this question.
synapsesocial.com/papers/6a08fcfc944076d22073a909 — DOI: https://doi.org/10.48550/arxiv.1510.08484
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: