Behavior Preference Regression for Offline Reinforcement Learning | Synapse