Multimodal transformer augmented fusion for speech emotion recognition | Synapse