Dynamic programming with incomplete information to overcome navigational uncertainty in POMDPs

Key Points

Navigation policies constructed using dynamic programming with incomplete information show enhanced safety outcomes.
The implemented approach outperforms traditional methods for markov decision processes with lower measurement costs.
Incorporating controlled sensing methods contributes to the overall performance and cost-efficiency of the navigation policies.
This approach highlights the potential for better decision-making under uncertainty in navigational tasks.

Abstract

Using a generalizable novel nautical navigation environment, we show how dynamic programming can be used when only incomplete information about a partially observed Markov decision process (POMDP) is known. By incorporating uncertainty into our model, we show that navigation policies can be constructed that maintain safety, outperforming the baseline performance of traditional dynamic programming for Markov decision processes (MDPs). Adding in controlled sensing methods, we show that these policies can also lower measurement costs at the same time.

Bookmark

View Full Paper

Bookmark

View Full Paper

Dynamic programming with incomplete information to overcome navigational uncertainty in POMDPs

Key Points

Abstract

Cite This Study