From higher to lower: A guidance-propagation hierarchical attention for video captioning | Synapse