Combining modality specific deep neural networks for emotion recognition in video | Synapse