MUSAN: A Music, Speech, and Noise Corpus

Key Points

Key points are not available for this paper at this time.

Abstract

This report introduces a new corpus of music, speech, and noise. This dataset is suitable for training models for voice activity detection (VAD) and music/speech discrimination. Our corpus is released under a flexible Creative Commons license. The dataset consists of music from several genres, speech from twelve languages, and a wide assortment of technical and non-technical noises. We demonstrate use of this corpus for music/speech discrimination on Broadcast news and VAD for speaker identification.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

References and Citations

Add This Paper to Your Research Feed

Any time a new paper drops it will be there.