Leveraging Text-to-Image Diffusion Models for Unsupervised Visual Object Tracking | Synapse