Reparameterization Proximal Policy Optimization | Synapse