Exploiting multimodal video semantic hierarchy for emotion recognition in E-learning | Synapse