mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Synapse