MMoFusion: Multi-modal Co-Speech Motion Generation with Diffusion Model | Synapse