Active Preference-Based Learning of Reward Functions | Synapse