VinVL: Revisiting Visual Representations in Vision-Language Models | Synapse