Design and Application of Deep Reinforcement Learning Algorithms Based on Unbiased Exploration Strategies for Value Functions | Synapse