Key points are not available for this paper at this time.
The present value of the rewards associated with a discrete-time Markov process has a probability distribution which depends on the initial state. The first part of the paper applies fixed point theory to a system of equations for the distribution functions of the present value. The second part of the paper expands the model to a Markov decision process (MDP) and considers the maximization of the expected utility of the present value when the utility function is exponential.
Building similarity graph...
Analyzing shared references across papers
Loading...
Kun‐Jen Chung
Chung Yuan Christian University
Matthew J. Sobel
Case Western Reserve University
SIAM Journal on Control and Optimization
Georgia Institute of Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Chung et al. (Thu,) studied this question.
synapsesocial.com/papers/6a20ae27a5979e09a7d110fc — DOI: https://doi.org/10.1137/0325004
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: