Reinforcement Learning from Human Feedback with Active Queries | Synapse