Spatio-Temporal Self-Attention Network for Video Saliency Prediction | Synapse