What question did this study set out to answer?

This research aims to develop a method for solving online stochastic optimization problems with time-varying distributions.

May 24, 2026Open Access

A Model-Based Stochastic Augmented Lagrangian Method for Online Stochastic Optimization

Key Points

This research aims to develop a method for solving online stochastic optimization problems with time-varying distributions.
Proposed a model-based stochastic augmented Lagrangian method (MSALM) for obtaining decisions in each round.
Constructed model functions for the sample objective and constraint to lower computational complexity.
Evaluated performance using stochastic dynamic regret and constraint violation metrics.
Demonstrated that the algorithm's stochastic dynamic regret and constraint violation have a sublinear bound relative to the total number of slots T.
Showed improved performance through simulation experiments across various applications including adaptive filtering and path planning.
Validated enhanced capabilities of the algorithm when integrated with supervised learning in path planning.

Abstract

In this paper, we focus on online stochastic optimization problems in which random parameters follow time-varying distributions. In each round t, a decision is obtained from solving the current optimization problem. Then samples are drawn from distributions which are updated after obtaining the decision. The objective and constraint are updated in this process, and the updated problem is used to obtain the next decision. To solve the online stochastic optimization problem, we propose a model-based stochastic augmented Lagrangian method, which is referred to as the MSALM. In each round, we construct model functions for the sample objective and constraint functions based on their properties, which reduce computational complexity. The step size is designed in a dynamic way and decreases as t increases to accelerate convergence. Due to the setting of the online stochastic problem, we use stochastic dynamic regret and constraint violation to measure the performance of our algorithm. Under certain assumptions, we prove that our algorithm’s stochastic dynamic regret and constraint violation have a sublinear bound in terms of the total number of slots T. We design simulation experiments to verify the efficiency of our online algorithm. Its performance is evaluated on a range of information and system engineering problems, including adaptive filtering, online logistic regression, time-varying smart grid energy dispatch, online network resource allocation, and path planning. In addition, in the context of the path planning problem, we integrate our algorithm with supervised learning to demonstrate its enhanced capabilities. The experimental results validate the performance of our new algorithm in practical applications.

A Model-Based Stochastic Augmented Lagrangian Method for Online Stochastic Optimization

Key Points

Abstract

Cite This Study