February 1, 2007

Bias and Variance Approximation in Value Function Estimates

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

We consider a finite-state, finite-action, infinite-horizon, discounted reward Markov decision process and study the bias and variance in the value function estimates that result from empirical estimates of the model parameters. We provide closed-form approximations for the bias and variance, which can then be used to derive confidence intervals around the value function estimates. We illustrate and validate our findings using a large database describing the transaction and mailing histories for customers of a mail-order catalog firm.

Preguntar a la IA

Me gusta

Guardar

Cite This Study

Mannor et al. (Thu,) studied this question.

synapsesocial.com/papers/6a1c69b64defe5c851c3c714 https://doi.org/https://doi.org/10.1287/mnsc.1060.0614

Preguntar a la IA

Me gusta

Guardar