Reinforcement learning for continuous-time optimal execution: actor–critic algorithm and error analysis | Synapse