CoVAR: Co-generation of Video and Action for Robotic Manipulation via Multi-Modal Diffusion | Synapse