Exploring and Distilling Cross-Modal Information for Image Captioning | Synapse