January 1, 1987

Discounted MDP’s: Distribution Functions and Exponential Utility Maximization

Key Points

Key points are not available for this paper at this time.

Abstract

The present value of the rewards associated with a discrete-time Markov process has a probability distribution which depends on the initial state. The first part of the paper applies fixed point theory to a system of equations for the distribution functions of the present value. The second part of the paper expands the model to a Markov decision process (MDP) and considers the maximization of the expected utility of the present value when the utility function is exponential.

Discounted MDP’s: Distribution Functions and Exponential Utility Maximization

Key Points

Abstract

Cite This Study