April 1, 1979

Linear Programming and Markov Decision Chains

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

In this paper we show that for a finite Markov decision process an average optimal policy can be found by solving only one linear programming problem. Also the relation between the set of feasible solutions of the linear program and the set of stationary policies is analyzed.

Me gusta

Guardar

Cite This Study

Hordijk et al. (Sun,) studied this question.

synapsesocial.com/papers/6a2263c4ffccceb004b6f8bb https://doi.org/https://doi.org/10.1287/mnsc.25.4.352

Me gusta

Guardar