Diving Deep into the Motion Representation of Video-Text Models | Synapse