Key points are not available for this paper at this time.
Facial action units (AUs) encode the activations of facial muscle groups, playing a crucial role in expression analysis and facial animation. However, current deep learning AU detection methods primarily focus on single-image analysis, which limits the exploitation of rich temporal context for robust outcomes. Moreover, the scale of available datasets remains limited, leading models trained on these datasets to tend to suffer from overfitting issues. This paper proposes a novel AU detection method integrating spatial and temporal data with inter-subject feature reassignment for accurate and robust AU predictions. Our method first extracts regional features from facial images. Then, to effectively capture both the temporal context and identity-independent features, we introduce a Temporal feature Combination and Feature Reassignment (TC&FR) module, which transforms single-image features into a cohesive temporal sequence and fuses features across multiple subjects. This transformation encourages the model to utilize identity-independent features and temporal context, thus ensuring robust prediction outcomes. Experimental results demonstrate the enhancements brought by the proposed modules and the state-of-the-art (SOTA) results achieved by our method.
Building similarity graph...
Analyzing shared references across papers
Loading...
Sipeng Yang
Zhejiang Institute of Science and Technology Information
Hongyu Huang
Fuzhou University
Ying Sophie Huang
Wuchang University of Technology
Zhejiang University
Zhejiang University of Science and Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Yang et al. (Sat,) studied this question.
synapsesocial.com/papers/68e649fbb6db6435875dab4b — DOI: https://doi.org/10.22541/au.171843047.76223460/v1