Tag‐inferring and tag‐guided Transformer for image captioning | Synapse