Sparse Overcomplete Decomposition for Single Channel Speaker Separation

Key Points

Key points are not available for this paper at this time.

Abstract

We present an algorithm for separating multiple speakers from a mixed single channel recording. The algorithm is based on a model proposed by Raj and Smaragdis (2005). The idea is to extract certain characteristic spectra-temporal basis functions from training data for individual speakers and decompose the mixed signals as linear combinations of these learned bases. In other words, their model extracts a compact code of basis functions that can explain the space spanned by spectral vectors of a speaker. In our model, we generate a sparse-distributed code where we have more basis functions than the dimensionality of the space. We propose a probabilistic framework to achieve sparsity. Experiments show that the resulting sparse code better captures the structure in data and hence leads to better separation.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Madhusudana Shashanka

Boston University

Bhiksha Raj

Carnegie Mellon University

Paris Smaragdis

Moscow Institute of Thermal Technology

Actions

Institutions

Boston University

Mitsubishi Electric (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Shashanka et al. (Sun,) studied this question.

synapsesocial.com/papers/6a20849d78c6e96e5b3e8de9 — DOI: https://doi.org/10.1109/icassp.2007.366317

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Factorial models and refiltering for speech separation and denoising· 2003 · 173 citations
What Is the Goal of Sensory Coding?· 1994 · 1,225 citations
On the LambertW function· 1996 · 6,166 citations
Latent variable decomposition of spectrograms for single channel speaker separation· 2006 · 65 citations
Structure Learning in Conditional Probability Models via an Entropic Prior and Parameter Extinction· 1999 · 159 citations

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Factorial models and refiltering for speech separation and denoising· 2003 · 173 citations
What Is the Goal of Sensory Coding?· 1994 · 1,225 citations
On the LambertW function· 1996 · 6,166 citations
Latent variable decomposition of spectrograms for single channel speaker separation· 2006 · 65 citations
Structure Learning in Conditional Probability Models via an Entropic Prior and Parameter Extinction· 1999 · 159 citations

Sparse Overcomplete Decomposition for Single Channel Speaker Separation

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider