Convergence of Proximal Policy Gradient Method for Problems with Control Dependent Diffusion Coefficients | Synapse