Unifying Visual and Vision-Language Tracking via Contrastive Learning | Synapse