Policy Gradient Methods for Robotics | Synapse