InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding | Synapse