Bilingual–Visual Consistency for Multimodal Neural Machine Translation | Synapse