Multimodal interpretable image recognition network via language-guided global-local collaboratively alignment | Synapse