Key points are not available for this paper at this time.
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chosen reward function. However, designing such a reward function often requires a lot of task-specifi...
WirthChristian et al. (Sun,) studied this question.