Key points are not available for this paper at this time.
Until now, marginalization-based Missing Feature Theory (MFT) for speech classification has been limited to the use of Log Spectral Subband Energies (LSSEs) as features. These features are highly correlated, thus suboptimal for classification with diagonal-covariance Gaussian Mixture Models (GMMs), a common classifier in marginalization-based MFT. In this paper, we propose that Spectral Subband Centroids (SSCs) are more apt for marginalization-based MFT, as they are both decorrelated and spectrally local. Our results show that SSCs as features produce a more robust marginalization-based MFT, diagonal-covariance GMM-based, Automatic Speaker Identification (ASI) system than LSSEs as features, for at all tested SNR values (with Additive White Gaussian Noise (AWGN)). It is also shown that a fully-connected Deep Neural Network (DNN) can accurately estimate the Ideal Binary Mask (IBM) used for MFT.
Building similarity graph...
Analyzing shared references across papers
Loading...
Aaron Nicolson
Commonwealth Scientific and Industrial Research Organisation
Jack Hanson
Bethel University
James Lyons
Purdue University West Lafayette
International Journal of Signal Processing Systems
Building similarity graph...
Analyzing shared references across papers
Loading...
Nicolson et al. (Thu,) studied this question.
synapsesocial.com/papers/69d6cb5e75cae9790bed8be4 — DOI: https://doi.org/10.18178/ijsps.6.1.12-16
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: