Hindsight PRIORs for Reward Learning from Human Preferences | Synapse