What question did this study set out to answer?

This research aims to enhance facial emotion recognition by addressing key challenges in model robustness and feature representation.

March 21, 2026Open Access

SiaCon-DetNet with HySHO: a cutting-edge transformer-based deep learning framework for emotion-aware facial recognition

Puntos clave

This research aims to enhance facial emotion recognition by addressing key challenges in model robustness and feature representation.
Integration of SiaCon-DetNet and HySHO algorithms.
Face region detection using a Siamese convolutional network.
Feature enhancement through multi-head self-attention.
Dynamic adjustment of model parameters for improved learning.
Achieved up to 99.20% accuracy on the JAFFE database.
Demonstrated shorter training periods compared to existing methods.
Precision, recall, and F1-score consistently ranged from 98% to 99%.

Resumen

Facial emotion recognition (FER) plays a critical role in most applications in human–computer interaction, psychological analysis, and affective computing to make intelligent systems capable of effectively perceiving the emotions of humans. Current approaches lack sufficient strength against problems like poor feature representation, robustness of facial expression variation, and model generalization. In order to counter such shortcomings, this paper presents a new FER model that integrates the SiaCon-DetNet and HySHO algorithm. The most striking novelty of SiaCon-DetNet is its capacity to combine convolutional feature learning with transformer attention mechanisms in order to make strong detection of fine-grained facial features. Also, the suggested framework are based on its intelligent combination of bio-inspired top optimization and deep learning, resulting in an adaptive and efficient emotion detector. Meanwhile, HySHO dynamically adjusts model parameters to enhance learning convergence and reduce computation overhead. This method in the paper presumes an organized working process with the initial step being face region detection by a Siamese convolutional network and feature enhancement by multi-head self-attention in the detection transformer network. Comparative analysis of its performance indicates the new model shows better performance as compared to all other FER methods with up to 99.20% accuracy on JAFFE database and having very short training periods. Emotion-wise correlation and performance testing also validate the reliability of the proposed framework, with precision, recall, and F1-score consistently between 98–99%.

Me gusta

Guardar

Ver artículo completo