Dual-level Collaborative Transformer for Image Captioning | Synapse