What question did this study set out to answer?

The research aims to enhance automatic speaker recognition by using mel-frequency cepstral coefficients and spectrum-based features.

February 6, 2026Open Access

Mel-frequency cepstral coefficients and spectrum based additional features in automatic speaker recognition

Key Points

The research aims to enhance automatic speaker recognition by using mel-frequency cepstral coefficients and spectrum-based features.
Evaluated speaker recognition on two speech databases.
Used 21 mel-frequency cepstral coefficients in the feature vector.
Incorporated up to three additional features from the amplitude spectrum.
Tested on the CHAINS database and S-ADAPT emotional speech database.
Achieved 97.11% recognition accuracy on the CHAINS database.
Attained 98.65% accuracy on neutral speech within the S-ADAPT database.
Maximum recognition accuracy of 98.72% when considering the entire S-ADAPT database.

Abstract

The efficiency of the proposed automatic speaker recognizer is evaluated using two speech databases. The feature vector consists of 21 mel-frequency cepstral coefficients (MFCCs), along with up to three additional features derived from the amplitude spectrum. The additional features are calculated based on the logarithm of the energy around the appropriate local maximum in the spectrum, the frequency of that maximum, and the logarithm of the energy of the maximum component in the spectrum across all frames of the observed signal. The speaker identification procedure for a closed set of speakers is tested on the Solo section of the CHAINS database and a speech database with expressed emotions, developed within the S-ADAPT project. The achieved maximum mean recognition accuracies are 97.11%, on the CHAINS database, using a feature vector of 21 MFCCs and two additional features, and 98.65% on neutral speech, as well as 98.72% on the entire database, for the S-ADAPT database, using a feature vector of 21 MFCCs.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Jokić et al. (Wed,) studied this question.

synapsesocial.com/papers/698586238f7c464f2300a1c9 https://doi.org/https://doi.org/10.2298/fuee2504663j

Bookmark

View Full Paper