CoSTA: End-to-End Comprehensive Space-Time Entanglement for Spatio-Temporal Video Grounding | Synapse