Exploring structured representations in vision–language and reasoning | Synapse