On the policy improvement algorithm in continuous time | Synapse