Spectral Subband Centroids for Robust Speaker Identification Using Marginalization-based Missing Feature Theory

Key Points

Key points are not available for this paper at this time.

Abstract

Until now, marginalization-based Missing Feature Theory (MFT) for speech classification has been limited to the use of Log Spectral Subband Energies (LSSEs) as features. These features are highly correlated, thus suboptimal for classification with diagonal-covariance Gaussian Mixture Models (GMMs), a common classifier in marginalization-based MFT. In this paper, we propose that Spectral Subband Centroids (SSCs) are more apt for marginalization-based MFT, as they are both decorrelated and spectrally local. Our results show that SSCs as features produce a more robust marginalization-based MFT, diagonal-covariance GMM-based, Automatic Speaker Identification (ASI) system than LSSEs as features, for at all tested SNR values (with Additive White Gaussian Noise (AWGN)). It is also shown that a fully-connected Deep Neural Network (DNN) can accurately estimate the Ideal Binary Mask (IBM) used for MFT.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Aaron Nicolson

Commonwealth Scientific and Industrial Research Organisation

Jack Hanson

Bethel University

James Lyons

Purdue University West Lafayette

Journals

International Journal of Signal Processing Systems

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Nicolson et al. (Thu,) studied this question.

synapsesocial.com/papers/69d6cb5e75cae9790bed8be4 — DOI: https://doi.org/10.18178/ijsps.6.1.12-16

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences· 1980 · 5,303 citations
Greedy Layer-Wise Training of Deep Networks· 2007 · 4,703 citations
Spectral subband centroid features for speech recognition· 2002 · 147 citations
Robust automatic speech recognition with missing and unreliable acoustic data· 2001 · 595 citations
Reconstruction of missing features for robust speech recognition· 2004 · 210 citations

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences· 1980 · 5,303 citations
Greedy Layer-Wise Training of Deep Networks· 2007 · 4,703 citations
Spectral subband centroid features for speech recognition· 2002 · 147 citations
Robust automatic speech recognition with missing and unreliable acoustic data· 2001 · 595 citations
Reconstruction of missing features for robust speech recognition· 2004 · 210 citations

Spectral Subband Centroids for Robust Speaker Identification Using Marginalization-based Missing Feature Theory

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider