What question did this study set out to answer?

The aim is to develop an efficient online POMDP solver for improving long-horizon motion planning under uncertainty.

June 28, 2026Open Access

Think fast and far: Long-horizon online POMDP planning via rapid state sampling

Key Points

The aim is to develop an efficient online POMDP solver for improving long-horizon motion planning under uncertainty.
Proposed a new solver called rop-ras3 using rapid state space sampling techniques.
Evaluated on various long-horizon POMDPs with up to 3000 lookahead steps and 35-dimensional state spaces.
Demonstrated capability with physical robot applications.
rop-ras3 outperforms existing methods by multiple folds in success rates for various POMDP scenarios.
Convergence to near-optimal solution is achieved at a rate dependent on sampled actions rather than action space size.

Abstract

Partially observable Markov decision processes ( pomdp s) are a general and principled framework for motion planning under uncertainty. Despite tremendous improvement in the scalability of pomdp solvers, long-horizon pomdp s remain difficult to solve. To alleviate the difficulty, this paper proposes a new approximate online pomdp solver, called reference-based online pomdp planning via rapid state space sampling ( rop-ras3 ). rop-ras3 uses novel extremely fast sampling-based motion planning techniques to sample the state space and generate a diverse set of macro-actions online, which are then used to bias belief-space sampling and infer high-quality policies without requiring exhaustive enumeration of the action space—a fundamental constraint for modern online pomdp solvers. rop-ras3 converges to a near-optimal reference-based solution at a rate that depends on the number of sampled actions, rather than the size of the action space. rop-ras3 is evaluated on various long-horizon pomdp s with up to 3000 lookahead steps and 35-dimensional state spaces, where the state, action and observation spaces can be continuous, discrete, or a hybrid of discrete and continuous. Although the reference-based optimal solution may not be the same as the optimal pomdp solution, empirical results indicate that in all of these problems, in terms of success rate, rop-ras3 outperforms other state-of-the-art methods by up to multiple folds . We also demonstrate the capability of our approach on a physical robot demonstration. This work extends the theory and empirical results of our ISRR24 paper. Code can be found at https://github.com/RDLLab/ROPRAS3 .

Think fast and far: Long-horizon online POMDP planning via rapid state sampling

Key Points

Abstract

Cite This Study

Also Consider

Also Consider