What does this research mean for the field?

Combining cepstral features (MFCC and LPCC) with Support Vector Machine (SVM) classifiers provides a highly accurate method (96.23%) for identifying native North Indian regional accents from non-native English speech, significantly outperforming Decision Tree classifiers. Novelty: ClaimNovelty.INCREMENTAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to develop an automated speaker identification system based on regional accents in North Indian languages.

May 16, 2026Open Access

Feature Extraction and Classification Technique Based Speaker Identification System of Indian Regional Accent

Key Points

This research aims to develop an automated speaker identification system based on regional accents in North Indian languages.
Collected speech data from native speakers of Hindi, Punjabi, and Bengali.
Extracted features using mel-frequency cepstral coefficients (MFCC) and linear predictive cepstral coefficients (LPCC).
Applied machine learning classification techniques such as support vector machines (SVM) and decision trees (DT) to the data.
The SVM classifier achieved an accuracy of 96.23%, outperforming the DT classifier with an accuracy of 93.75%.
Cepstral features combined with SVM classifiers provide a robust approach for identifying speakers with regional accents.
The findings indicate the potential for applications in improving speech recognition and biometric authentication.

Abstract

The speaker identification based on their accent is a challenging task, especially for Indian regional dialects, due to the subtle metric variations and close phonetic similarities. The aim of this research work is to create automated speaker identification (ASI) system that can categorise native speakers based on their regional accents in North Indian languages: Hindi (HIN), Punjabi (PUN), and Bengali (BEN). The proposed method is to collect the speech data from individuals who speak these languages as their native language, thereafter Mel-frequency cepstral coefficients (MFCC) and linear predictive cepstral coefficients (LPCC) are used to extract the features. After feature extraction, machine learning (ML) classification techniques are used, such as support vector machines (SVM) and decision trees (DT), on non-native English speech samples influenced by native accents. Experimental results demonstrate that the SVM classifier significantly outperforms the DT classifier, achieving an accuracy of 96.23% compared to 93.75%. These findings suggest that cepstral features, combined with SVM classifiers, offer a robust approach for identifying native language speakers in regional Indian accents. The proposed system has potential applications in improving speech recognition, language learning, and biometric authentication within multilingual environments.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Singh et al. (Fri,) studied this question.

synapsesocial.com/papers/6a080af2a487c87a6a40d144 https://doi.org/https://doi.org/10.1016/j.fraope.2026.100623

Bookmark

View Full Paper