What question did this study set out to answer?

The aim is to optimize ergodic stochastic systems using a black-box methodology that identifies measures of progress.

April 11, 2026Open Access

Black Box Optimization for Ergodic Systems in Markov Chains

Key Points

The aim is to optimize ergodic stochastic systems using a black-box methodology that identifies measures of progress.
Developed a recursive representation for a state-value quantity with oscillatory behavior.
Identified a Lyapunov-like function to reflect the long-term system behavior.
Constructed a procedure for obtaining the Lyapunov-like function through theoretical analysis.
Proposed method guarantees convergence to an optimal trajectory in Markov chains.
The Lyapunov-like function serves as a monotonic indicator that is non-increasing over time.
Validated methodology through numerical simulations showing consistent optimization results.

Abstract

This paper studies a black-box methodology for optimizing ergodic stochastic systems, focusing on the construction of scalar measures that reliably indicate progress toward optimality. Our starting point is a state-value quantity that inherently exhibits oscillatory behavior and does not converge under standard conditions. We show that, despite its fluctuations, this quantity admits a recursive representation derived from a one-step-ahead fixed-local-optimal policy. The approach relies on identifying a Lyapunov-like function whose evolution reflects the long-run behavior of the system without requiring explicit knowledge of its internal dynamics. Such a function provides a monotonic indicator—non-increasing over time—that remains valid for any initial probability distribution. Whenever an optimal trajectory of the Markov chain exists, the proposed method guarantees convergence to it. We also provide a constructive procedure for obtaining the Lyapunov-like function and validate the methodology through theoretical analysis and numerical simulations.

Read Full Paperexternally

اسأل الذكاء الاصطناعي

Bookmark

View Full Paper