DuoFormer: Leveraging Hierarchical Visual Representations by Local and Global Attention | Synapse