What question did this study set out to answer?

The main goal is to enhance the efficiency of trajectory optimization through probabilistic methods.

February 8, 2026

Leveraging Probabilistic Optimal Control for Efficient Trajectory Optimization

Key Points

The main goal is to enhance the efficiency of trajectory optimization through probabilistic methods.
Reformulated trajectory optimization as risk-sensitive stochastic optimal control.
Introduced probabilistic policies to navigate the optimization landscape.
Used the expectation-maximization algorithm for converging to optimal solutions.
Utilized Gaussian linear affine controllers for approximating probabilistic policies.
Applied sigma-point techniques for uncertainty quantification in system dynamics.
Demonstrated improved numerical stability in algorithm performance.
Achieved accelerated convergence compared to traditional methods.
Validated methods through numerical simulations on various nonlinear systems.

Abstract

ABSTRACT This paper discusses two algorithms tailored to discrete‐time deterministic finite‐horizon nonlinear optimal control problems or so‐called trajectory optimization problems. Our key aim is to probe the optimization landscape more efficiently during iterations than traditional gradient‐based approaches do. This is achieved by first reformulating the problem as a risk‐sensitive stochastic optimal control (RSOC) and introducing probabilistic policies. The problem can then be cast as an instance of probabilistic optimal control. In turn this allows us to address the problem using the expectation‐maximization (EM) algorithm which produces a fixed‐point iteration of probabilistic policies that converge to the original optimum. These manipulations facilitate an alternative manner to search the original optimization space without affecting the outcome. In practice, we approximate the probabilistic policies using Gaussian linear affine controllers and rely on sigma‐point uncertainty quantification methods to propagate uncertainty through the system dynamics. The proposed algorithms are structurally closest related to the differential dynamic programming algorithm and related methods that use sigma‐point methods to avoid direct gradient evaluations. However, instead of establishing an ad hoc numerical iteration, a principled recursion is established that provably converges to the true optimum. The algorithms feature improved numerical stability and accelerated convergence as is demonstrated through numerical simulations on different nonlinear systems.

Bookmark

Leveraging Probabilistic Optimal Control for Efficient Trajectory Optimization

Key Points

Abstract

Cite This Study

Also Consider

Also Consider