Diffusion Mask-Driven Visual-language Tracking | Synapse