Lightening the Load: Lightweighting Multimodal Understanding for Visual Grounding Tasks | Synapse