What question did this study set out to answer?

This research aims to enhance automatic sleep staging from polysomnography by addressing artifact challenges.

February 11, 2026Open Access

G-CMTF Net: Spectro-Temporal Disentanglement and Reliability-Aware Gated Cross-Modal Temporal Fusion for Robust PSG Sleep Staging

Key Points

This research aims to enhance automatic sleep staging from polysomnography by addressing artifact challenges.
Developed G-CMTF Net for end-to-end processing of EEG, EOG, and EMG signals.
Implemented a spectro-temporal disentanglement frontend to learn multi-scale temporal features.
Utilized reliability-aware gating to regulate cross-modal contributions and suppress artifacts.
Employed a convolution-augmented self-attention encoder to model long-range sleep dynamics.
Achieved a Macro-F1/ACC of 81.3%/85.5% on Sleep-EDF-20 and 78.2%/83.4% on Sleep-EDF-78.
Maintained high sensitivity and geometric-mean performance on transitional epochs.
Demonstrated effective suppression of artifact-prone auxiliary inputs, limiting noise transfer.

Abstract

Automatic sleep staging from polysomnography is challenged by marked spectro-temporal heterogeneity and non-stationary cross-channel artifacts, which often undermine naïve multimodal fusion. To address this, a Gated Cross-Modal and Temporal Fusion Network (G-CMTF Net) is proposed as an end-to-end model operating on 30 s EEG epochs and auxiliary EOG and EMG signals, in which cross-modal contributions are regulated through reliability-aware gating. A spectro-temporal disentanglement frontend learns multi-scale temporal features while incorporating FFT-derived band-power embeddings to preserve physiologically meaningful oscillatory cues. At the epoch level, gated fusion suppresses artifact-prone auxiliary inputs, thereby limiting noise transfer into a shared latent space. Long-range sleep dynamics are modeled via a convolution-augmented self-attention encoder that captures both local morphology and transition structure. On Sleep-EDF-20 and Sleep-EDF-78, G-CMTF Net achieves Macro-F1/ACC of 81.3%/85.5% and 78.2%/83.4%, respectively, while maintaining high sensitivity and geometric-mean performance on transitional epochs, consistent with the function of reliability-aware gated fusion under non-stationary auxiliary artifacts. From a symmetry perspective, the proposed framework enforces a structured balance between heterogeneous modalities by promoting representational consistency while adaptively suppressing asymmetric noise contributions.

Bookmark

View Full Paper

Bookmark

View Full Paper

G-CMTF Net: Spectro-Temporal Disentanglement and Reliability-Aware Gated Cross-Modal Temporal Fusion for Robust PSG Sleep Staging

Key Points

Abstract

Cite This Study