What question did this study set out to answer?

The aim is to introduce a linear programming method for solving Markov decision processes using approximate dynamic programming.

May 16, 2026Open Access

OR for the classroom: the linear programming approach to approximate dynamic programming for Markov decision processes

Key Points

The aim is to introduce a linear programming method for solving Markov decision processes using approximate dynamic programming.
Presents three solution techniques: constraint sampling, constraint generation, and compact reformulation.
Focuses on infinite-horizon problems with a discounted-reward criterion.
Assumes audience familiarity with Markov decision processes, linear programming, and dynamic programming foundations.
Demonstrates the effectiveness of linear programming approaches in high-dimensional state spaces.
Utilizes a running example to clarify the methods and make key concepts more accessible.

Abstract

Abstract When the curse of dimensionality prevents an exact solution to a Markov decision process, approximate dynamic programming methods seek approximate solutions. This tutorial introduces the linear programming approach to approximate dynamic programming and three typical solution techniques, which are most suited for settings with a discrete and high-dimensional state space: constraint sampling, constraint generation, and compact reformulation. In our presentation, we assume that the target audience is familiar with the foundations of Markov decision processes, dynamic programming, and linear programming. Even though the ideas we present are generally applicable, we explain the core ideas only in the context of infinite-horizon problems with a discounted-reward criterion. A running example is used throughout to anchor the key ideas and make the methods more accessible.

Ask AI

Helpful

Bookmark

View Full Paper