December 1, 1982

The variance of discounted Markov decision processes

Key Points

Key points are not available for this paper at this time.

Abstract

Formulae are presented for the variance and higher moments of the present value of single-stage rewards in a finite Markov decision process. Similar formulae are exhibited for a semi-Markov decision process. There is a short discussion of the obstacles to using the variance formula in algorithms to maximize the mean minus a multiple of the standard deviation.

Ask AI

Helpful

Bookmark

Cite This Study

Matthew J. Sobel (Wed,) studied this question.

synapsesocial.com/papers/6a16d9f8f3be5e880d6ba009 https://doi.org/https://doi.org/10.2307/3213832

Ask AI

Helpful

Bookmark