Cross-Modal Latent Interaction Network for VQA: Towards Multimodal Reasoning for Interactive English Learning | Synapse