What question did this study set out to answer?

This study aims to improve emotion recognition from EEG signals by using the Super-Resolution Superlet Transform and self-attention convolutional neural network.

April 25, 2026Open Access

EEG-based emotion recognition using super-resolution superlet transform and self-attention convolutional neural network

Key Points

This study aims to improve emotion recognition from EEG signals by using the Super-Resolution Superlet Transform and self-attention convolutional neural network.
Evaluated Super-Resolution Superlet Transform against traditional time-frequency methods and adaptive decomposition methods.
Implemented strict trial-level cross-validation to eliminate data leakage.
Tested on DEAP and DREAMER datasets for subject-dependent and subject-independent classifications of emotional arousal and valence.
Subject-dependent classification achieved 74.41% accuracy for arousal and 72.04% for valence on DEAP.
Subject-independent classification reached 70.86% accuracy for arousal and 71.36% for valence on DEAP.
Grad-CAM analysis indicated self-attention focuses on β/γ bands for arousal and α bands for valence.

Abstract

Emotion recognition plays a critical role in mental health monitoring, decision-making, and enhancing human–computer interaction. The complex, non-stationary, and multi-component nature of EEG signals presents significant challenges for accurately detecting emotional states. Traditional time–frequency (TF) methods often fail to capture rapid and transient oscillatory events in EEG signals due to limited resolution. Moreover, the lack of standardized evaluation protocols in the field has led to inflated accuracy estimates, particularly when spectrogram-level data splitting allows correlated electrode samples from the same trial to appear in both training and test sets. To address these limitations, this study presents a rigorous evaluation of the Super-Resolution Superlet Transform (SLT) against both conventional TF methods (STFT, CWT, SPWVD) and adaptive signal decomposition methods (EMD-HHT, VMD-HHT) for EEG-based emotion recognition, using a self-attention convolutional neural network (SA-CNN). All experiments employ strict trial-level cross-validation to eliminate data leakage. The framework is evaluated on the DEAP and DREAMER datasets for subject-dependent (SD) and subject-independent (SI) classifications of arousal and valence states. Under the corrected evaluation protocol, SD classification achieves accuracies of 74.41% (arousal) and 72.04% (valence) on DEAP, and 78.63% (arousal) and 76.82% (valence) on DREAMER. For SI classification, it attains 70.86% (arousal) and 71.36% (valence) on DEAP, and 73.51% (arousal) and 75.46% (valence) on DREAMER. SLT consistently outperforms all five baseline TF methods across both evaluation protocols. Ablation studies confirm the contribution of the self-attention layer, and Grad-CAM analysis demonstrates that self-attention selectively focuses on neuroscience-consistent frequency bands. • First comparison of SLT vs. adaptive decomposition for EEG emotion recognition. • Electrode-level data leakage corrected; trial-level CV reduces SD accuracy 15–23pp. • SLT outperforms all five baseline TFRs under leak-free evaluation on DEAP/DREAMER. • Grad-CAM shows self-attention targets β / γ for arousal and α bands for valence. • Comprehensive SD and SI evaluation with Friedman/Nemenyi statistical testing.

EEG-based emotion recognition using super-resolution superlet transform and self-attention convolutional neural network

Key Points

Abstract

Cite This Study