DuoFormer: Leveraging Hierarchical Representations by Local and Global Attention Vision Transformer | Synapse