A survey of preference-based reinforcement learning methods | Synapse