Rectifying Reinforcement Learning for Reward Matching | Synapse