Multimodal Attention-Based Instruction-Following Part-Level Affordance Grounding | Synapse