Multiscale Feature Extraction and Fusion of Image and Text in VQA | Synapse