DiFFPO: Training Diffusion LLMs to Reason Fast and Furious via Reinforcement Learning | Synapse