September 7, 2018

Front-end processing for the CHiME-5 dinner party scenario

Key Points

Key points are not available for this paper at this time.

Abstract

This contribution presents a speech enhancement system for the CHiME-5 Dinner Party Scenario. The front-end employs multi-channel linear time-variant filtering and achieves its gains without the use of a neural network. We present an adaptation of blind source separation techniques to the CHiME-5 database which we call Guided Source Separation (GSS). Using the baseline acoustic and language model, the combination of Weighted Prediction Error based dereverberation, guided source separation, and beamforming reduces the WER by 10:54% (relative) for the single array track and by 21:12% (relative) on the multiple array track.

Demander à l'IA

Bookmark

Cite This Study

Boeddecker et al. (Fri,) studied this question.

synapsesocial.com/papers/6a1e3abb1af840ad2140753b https://doi.org/https://doi.org/10.21437/chime.2018-8

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Demander à l'IA

Bookmark