Audio-driven facial animation by joint end-to-end learning of pose and emotion | Synapse