Reinforcement Learning Stabilization for Quadrotor UAVs via Lipschitz-Constrained Policy Regularization | Synapse