Grounding spatial language for video search | Synapse