Two-Layered Reward Reinforcement Learning in Humanoid Robot Motion Tracking | Synapse