Emergent Semantics from Disjoint Modalities: Unsupervised Cross-Domain Vision-Language Grounding | Synapse