Dynamic potential-based reward shaping | Synapse