Audio-Vision Contrastive Learning for Phonological Class Recognition | Synapse