Dynamic preferences in multi-criteria reinforcement learning | Synapse