Efficient Models for Long-Range Video Understanding | Synapse