What question did this study set out to answer?

The research aims to improve EEG classification during rapid visual tasks by exploring different modeling architectures.

March 13, 2026

A Few-Layer Multilayer Perceptron is Worth Attention for EEG Classification in Rapid Serial Visual Presentation Task

Key Points

The research aims to improve EEG classification during rapid visual tasks by exploring different modeling architectures.
Developed DisCo-Former, a Transformer-based framework with several innovative components.
Utilized channels that maintain global temporal patterns and contrastive learning for enhanced decoding.
Introduced DisCo-MLP, a multilayer perceptron variant, removing the Transformer encoder.
DisCo-MLP matched or surpassed the performance of the more complex DisCo-Former.
Mean AUCs in within-subject decoding ranged from 0.94 to 0.98 across datasets.
Simplicity in design, guided by neurophysiological insights, led to improved decoding performance.

Abstract

Rapid serial visual presentation (RSVP) enables efficient electroencephalography (EEG)-based brain-computer interfaces, yet single-trial decoding remains difficult due to signal overlap and multi-component entanglement. This work developed DisCo-Former, a Transformer-based framework incorporating three priors-guided components, including trend-periodicity disentanglement, channel-level embeddings that preserve global temporal pattern, and contrastive learning that exploits target-adjacent non-targets. Although DisCo-Former surpassed existing approaches, analysis revealed a consistent attention collapse: attention maps became nearly uniform, and value projection weights shrank toward zero. Removing the Transformer encoder yields DisCo-MLP, a purely multilayer perceptron (MLP) variant that preserves all remaining modules. Across two datasets and three evaluation regimes, DisCo-MLP matched or outperformed its Transformer-based counterpart. In within-subject decoding, mean AUCs ranged from approximately 0.94 to 0.98 across two datasets, consistently exceeding strong baselines. These results indicate that, for RSVP-EEG decoding, effectiveness stems less from architectural complexity and more from modeling the signal's structure. Simplicity motivated by paradigm-specific neurophysiological priors offers a practical path to state-of-the-art performance in EEG-based interfaces.

Bookmark

Cite This Study

Zhang et al. (Fri,) studied this question.

synapsesocial.com/papers/69b3abb202a1e69014cccbd8 https://doi.org/https://doi.org/10.1142/s0129065726500309

Bookmark