Key points are not available for this paper at this time.
This paper compares the performance of three popular machine learning algorithms for speech emotion recognition - Multi-Layer Perceptron (MLP), Decision Tree, and Convolutional Neural Network (CNN). The MLP model achieved competitive accuracy of 0.83 while being computationally efficient and easy to train. The Decision Tree algorithm, which is a popular technique for categorization tasks, achieved an accuracy score of 0.65. The CNN model exhibited superior performance compared to the MLP and Decision Tree models, achieving an accuracy score of 0.86 and optimal performance across all emotion classes. The study also found that certain emotions are more difficult to discern than others. While the research highlights the potential of CNN models in speech and audio recognition, there is still scope for improvement, particularly in recognizing challenging emotions. Overall, this paper provides significant insights into the efficiency of several algorithms in speech emotion recognition and recommends future research directions.
Building similarity graph...
Analyzing shared references across papers
Loading...
Ishan Dhakal
Navin Kumar Sah
Nepal Medical College Teaching Hospital
Kwok Wei Ling
Jain University
Building similarity graph...
Analyzing shared references across papers
Loading...
Dhakal et al. (Thu,) studied this question.
synapsesocial.com/papers/6a08043c09b3c820153794e3 — DOI: https://doi.org/10.1109/icccnt56998.2023.10308063