Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning | Synapse