End-to-End Dense Video Captioning with Masked Transformer | Synapse