Rethinking CLIP-based Video Learners in Cross-Domain Open-Vocabulary Action Recognition | Synapse