Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning | Synapse