A multimodal fusion model for real-time environment emotion recognition using audio-visual-textual features | Synapse