Robust Reinforcement Learning via Leveraging Historically Optimal Policy With Regulation of Performance | Synapse