LXMERT: Learning Cross-Modality Encoder Representations from Transformers | Synapse