Key points are not available for this paper at this time.
Recent research has shown the benefit of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of re- covering a utility function that makes the behavior induced by a near-optimal policy closely mimic demonstrated behavior. In this work, we develop a probabilistic approach based on the principle of maximum entropy. Our approach provides a well-defined, globally normalized distribution over decision sequences, while providing the same performance guarantees as existing methods. We develop our technique in the context of modeling real world navigation and driving behaviors where collected data is inherently noisy and imperfect. Our probabilistic approach enables modeling of route preferences as well as a powerful new approach to inferring destinations and routes based on partial trajectories.
Building similarity graph...
Analyzing shared references across papers
Loading...
Brian D. Ziebart
Andrew L. Maas
J. Andrew Bagnell
Carnegie Mellon University
Building similarity graph...
Analyzing shared references across papers
Loading...
Ziebart et al. (Mon,) studied this question.
www.synapsesocial.com/papers/6a0a51af8e4d6c8168573ee1 — DOI: https://doi.org/10.1184/r1/6555512
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: