Active preference-based Gaussian process regression for reward learning and optimization | Synapse