Combining Automated Optimisation of Hyperparameters and Reward Shape | Synapse