Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding | Synapse