August 7, 2025Open Access

Enhancing dysarthria severity classification: efficient audio based deep learning models

Key Points

The proposed deep learning method effectively classifies dysarthria severity, achieving accuracy over 98%.
Using advanced features like MFCC and STFT images enhances precision in detecting dysarthria.
Hybrid models, such as CNN-GRU, are utilized for comprehensive analysis and classification of dysarthria.
This approach could lead to personalized therapy strategies based on severity classification, improving patient outcomes.

Abstract

Abstract A complex motor speech disorder, dysarthria makes diagnosis and its severity classification extremely challenging, thereby affecting suitable therapy and intervention strategies. This paper presents a deep learning-based method based on TORGO dataset to overcome these challenges. Moreover, the problem statement focuses on the difficulty of exactly spotting dysarthria and assessing its degree of severity using traditional methods, which usually lack precision and efficiency. This work presents a new method combining advanced acoustic feature extraction techniques, such Mel-frequency cepstral coefficients (MFCC) and spectrogram analysis, with state-of- the-art neural network and its hybrid architectures such convolutional neural networks (CNNs), long- and short-term memory (LSTM) with CNN, and gated recurrent unit (GRU) combined with CNN. It offers an extensive framework for assessing the degree of dysarthria and also uses short-time Fourier transform (STFT) images obtained from a dataset for severity classification. The proposed CNN model obtained an accuracy of 98.2% using Mel-spectrogram for detecting the dysarthria and the hybrid CNN-GRU model reached an accuracy of 97% using the STFT images for classifying dysarthria based on its severity. Moreover, this work highlights the ability of proposed deep learning models to offer tailored therapy approaches depending on degree of severity and automates dysarthria diagnosis process.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Venugopal Koikal Varma

Arun Jana

Arijit Samal

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Enhancing dysarthria severity classification: efficient audio based deep learning models

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study