Reinforcement Learning Maximized-Actor-Critic(MAC) Method Based on Policy-Gradient | Synapse