Learning Video Representations from Large Language Models | Synapse