Online Bandit Learning with Offline Preference Data | Synapse