Step-Aware Policy Optimization for Reasoning in Diffusion Large Language Models | Synapse