Key points are not available for this paper at this time.
Adult facial expression recognition (FER) is essential for human-computer interaction, mental health assessment, and social robotics applications because it improves user experiences and emotional well-being. This study presents a novel attention mechanism-based transformer approach designed to capture detailed patterns in facial features and dynamically focus on the most relevant regions for enhanced accuracy. Unlike conventional deep learning approaches, our method integrates an adaptive attention mechanism and dynamic token pruning, which optimizes computational efficiency while maintaining high accuracy. The model is evaluated on five widely used datasets: FER2013, CK+, AffectNet, RAF-DB, and AFEW. It achieves state-of-the-art performance, with accuracies of 98.67% on FER2013, 99.52% on CK+, 99.3% on AffectNet, 96.3% on AFEW, and 98.45% on RAF-DB. An ablation study further validates the contribution of each model component, and comparisons with CNN-based and transformer-based approaches confirm the effectiveness of the model. These findings establish the proposed method as a significant advancement in FER, which offers a scalable and efficient solution for real-world applications.
Building similarity graph...
Analyzing shared references across papers
Loading...
Uzma Nawaz
Indian Institute of Technology Madras
Zubair Saeed
Texas A&M University at Qatar
Kamran Atif
Deakin University
IEEE Access
Texas A&M University
Deakin University
National University of Sciences and Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Nawaz et al. (Wed,) studied this question.
synapsesocial.com/papers/6a08908f113ba5b476de4700 — DOI: https://doi.org/10.1109/access.2025.3555510