Cross-Modal Contextualized Diffusion Models for Text-Guided Visual Generation and Editing | Synapse