Principled and Tractable RL for Reasoning with Diffusion Language Models | Synapse