July 29, 2008Open Access

Online Planning Algorithms for POMDPs

Key Points

Key points are not available for this paper at this time.

Abstract

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP is often intractable except for small problems due to their complexity. Here, we focus on online approaches that alleviate the computational complexity by computing good local policies at each decision step during the execution. Online algorithms generally consist of a lookahead search to find the best action to execute at each time step in an environment. Our objectives here are to survey the various existing online POMDP methods, analyze their properties and discuss their advantages and disadvantages; and to thoroughly evaluate these online approaches in different environments under various metrics (return, error bound reduction, lower bound improvement). Our experimental results indicate that state-of-the-art online heuristic search methods can handle large POMDP domains efficiently.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Stéphane Ross

Google (United States)

Joëlle Pineau

Canadian Institute for Advanced Research

S. Paquet

Université de Pau et des Pays de l'Adour

Journals

Journal of Artificial Intelligence Research

Actions

Institutions

McGill University

Université Laval

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Online Planning Algorithms for POMDPs

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study