Less is More: CLIPBERT for Video-and-Language Learning via Sparse Sampling | Synapse