Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies | Synapse