Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding | Synapse