Contrastive Pre-training with Multi-level Alignment for Grounded Multimodal Named Entity Recognition | Synapse