May 1, 2013

Analyzing noise robustness of MFCC and GFCC features in speaker identification

Key Points

Key points are not available for this paper at this time.

Abstract

Automatic speaker recognition can achieve a high level of performance in matched training and testing conditions. However, such performance drops significantly in mismatched noisy conditions. Recent research indicates that a new speaker feature, gammatone frequency cepstral coefficients (GFCC), exhibits superior noise robustness to commonly used mel-frequency cepstral coefficients (MFCC). To gain a deep understanding of the intrinsic robustness of GFCC relative to MFCC, we design speaker identification experiments to systematically analyze their differences and similarities. This study reveals that the nonlinear rectification accounts for the noise robustness differences primarily. Moreover, this study suggests how to enhance MFCC robustness, and further improve GFCC robustness by adopting a different time-frequency representation.

Bookmark

Cite This Study

Zhao et al. (Wed,) studied this question.

synapsesocial.com/papers/6a1bfa1c00ee29383e9d3f49 https://doi.org/https://doi.org/10.1109/icassp.2013.6639061