Elementary Analysis of Policy Gradient Methods | Synapse