Key points are not available for this paper at this time.
Speech Emotion Recognition (SER) is the most prominent and emerging field in the digital signal processing area. Although plenty of research work has been carried out in this area, emotion recognition in a real-time environment is very challenging. This is because of the increased number of parameters considered for emotion recognition, like the voice, tone, pitch, and quality of the audio signal. The previous research papers do not cover all the deep learning techniques and the feature extraction techniques for speech emotion recognition. In this paper, the techniques used to analyze the emotion along with the recording environments are reviewed. The state-of-the-art deep learning techniques are reviewed and the dataset and feature extraction techniques used for this analysis are also considered in this paper.
Jothimani et al. (Wed,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: