Probing Image-Language Transformers for Verb Understanding | Synapse