CoVR: Learning Composed Video Retrieval from Web Video Captions | Synapse