Masked Autoencoders As Spatiotemporal Learners | Synapse