What question did this study set out to answer?

The aim is to enhance the robustness of facial expression recognition under various occlusions by utilizing visibility assessment and adaptive expert fusion.

March 18, 2026Open Access

Visibility-Prior Guided Dual-Stream Mixture-of-Experts for Robust Facial Expression Recognition Under Complex Occlusions

Key Points

The aim is to enhance the robustness of facial expression recognition under various occlusions by utilizing visibility assessment and adaptive expert fusion.
Developed a fixed offline benchmark with diverse synthetic occlusion patterns.
Proposed the Dual-Stream Adaptive Weighting Mixture-of-Experts framework.
Implemented an occlusion-aware weighting network to balance expert contributions dynamically.
Conducted extensive experiments on public datasets and the new occlusion benchmark.
Achieved more stable recognition under complex occlusions.
Observed a smaller and more consistent performance drop compared to existing methods.

Abstract

Facial occlusion induces sample-wise reliability shifts in facial expression recognition (FER), where the usefulness of global context and local discriminative cues varies dramatically with the amount of visible facial information. Existing occlusion-robust FER studies often evaluate under limited or homogeneous occlusion settings and commonly adopt static fusion strategies, which are insufficient for complex and heterogeneous real-world occlusions. In this work, we establish a rigorous occlusion robustness evaluation protocol by constructing a fixed offline test benchmark with diverse synthetic occlusion patterns (e.g., masks, sunglasses, texture blocks, and mixed occlusions) on top of public FER test splits. We further propose a Dual-Stream Adaptive Weighting Mixture-of-Experts framework (DS-AW-MoE) that fuses a global contextual expert and a local discriminative expert via an occlusion-aware weighting network. Crucially, we introduce a facial visibility assessment as a task-agnostic prior to explicitly regulate expert contributions, enabling dynamic re-allocation of model capacity according to input-dependent feature reliability. Extensive experiments on public datasets and the constructed occlusion benchmark demonstrate that DS-AW-MoE achieves more stable recognition under complex occlusions, characterized by a smaller and more consistent performance drop. To support reproducibility under dataset license constraints, we will release an anonymous, fully runnable repository containing the complete occlusion synthesis pipeline, evaluation protocol, and configuration files, allowing researchers to reproduce the benchmark after obtaining the original datasets.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Ma et al. (Mon,) studied this question.

synapsesocial.com/papers/69ba43384e9516ffd37a4388 https://doi.org/https://doi.org/10.3390/electronics15061230

Bookmark

View Full Paper