Multimodal Emotion Recognition With Temporal and Semantic Consistency | Synapse