May 1, 2012

Stunt driving via policy search

Key Points

Key points are not available for this paper at this time.

Abstract

To explore or exploit? In this paper, we discuss the long-standing exploration-exploration dilemma in context of designing a learning controller for stunt-style driving with scarce samples. By making an efficient use of a single demonstration by an expert, our algorithm leverages our intuitive understanding of driving to extract a coarse dynamics model from the collected driving data, then formulate the policy search in a setting of gradient update with a specially designed cost function. Both theoretical and empirical results are detailed and discussed.

Mark Helpful

Bookmark

Relay

Cite This Study

Lau et al. (Tue,) studied this question.

synapsesocial.com/papers/6a1cb88b5b2142ad731d9e05 https://doi.org/https://doi.org/10.1109/icra.2012.6225164