January 1, 2017

A survey of preference-based reinforcement learning methods

Key Points

Key points are not available for this paper at this time.

Abstract

Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chosen reward function. However, designing such a reward function often requires a lot of task-specifi...

AI에게 질문

Bookmark

Cite This Study

WirthChristian et al. (Sun,) studied this question.

synapsesocial.com/papers/6a0ed5789df4132b62f9bff3 https://doi.org/https://doi.org/10.5555/3122009.3208017

AI에게 질문

Bookmark