Multi-Modal Prompting for Open-Vocabulary Video Visual Relationship Detection | Synapse