On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference | Synapse