Sequence Modeling and Feature Fusion for Multimodal Emotion Recognition | Synapse