What question did this study set out to answer?

The central aim is to enhance music emotion recognition by developing a novel model that combines CNN and BiGRU with optimization techniques.

March 22, 2026Open Access

Construction of music emotion recognition and classification model supported by neural networks

Key Points

The central aim is to enhance music emotion recognition by developing a novel model that combines CNN and BiGRU with optimization techniques.
Developed LOCBER model employing CNNs and BiGRUs for feature extraction and temporal sequence modeling.
Applied Lion Swarm Optimization to refine model parameters and minimize loss.
Evaluated the model's performance against traditional methods like Support Vector Machines and k-Nearest Neighbors.
Achieved 97.7% classification accuracy, outperforming conventional methods by 15.3%.
Demonstrated improved convergence and stability through LSO-based optimization.

Abstract

Music Emotion Recognition (MER) is a computational field of affective computing and audio signal processing. Although previous attempts used traditional machine learning algorithms, e.g., Support Vector Machines and k-Nearest Neighbors, for emotion classification in music, these methods are often challenged by the intricate temporal and spectral nature of sound signals, thereby constraining classification performance. To address this, the work proposes a new model, the Lion-Optimized CNN-BiGRU for Emotion Recognition (LOCBER), leveraging the strengths of Convolutional Neural Networks and Bidirectional Gated Recurrent Units for effective feature extraction and temporal sequence modeling. For additional performance optimization, LOCBER is optimized using the Lion Swarm Optimization (LSO) algorithm, which adjusts model parameters to minimize loss and achieve better accuracy and convergence. This study aims to improve music emotion recognition by integrating a CNN–BiGRU model with Lion Swarm Optimization, achieving 97.7% classification accuracy and outperforming conventional methods by 15.3%. LSO-based optimization achieved improved convergence and stability compared to traditional training processes. The LOCBER model benefits emotion-sensitive music recommendation systems, intelligent user interfaces, and music therapy applications. Experiments demonstrate that by fusion of state-of-the-art neural architectures with bio-inspired optimization, one can substantially improve the performance and validity of MER systems for real-world use.

Bookmark

View Full Paper

Cite This Study

Mingming Wu (Fri,) studied this question.

synapsesocial.com/papers/69bf8692f665edcd009e8f2b https://doi.org/https://doi.org/10.1186/s13636-026-00453-6

Bookmark

View Full Paper