Los puntos clave no están disponibles para este artículo en este momento.
We consider a broad class of stochastic dynamic programming problems that are amenable to relaxation via decomposition. These problems comprise multiple subproblems that are independent of each other except for a collection of coupling constraints on the action space. We fit an additively separable value function approximation using two techniques, namely, Lagrangian relaxation and the linear programming (LP) approach to approximate dynamic programming. We prove various results comparing the relaxations to each other and to the optimal problem value. We also provide a column generation algorithm for solving the LP-based relaxation to any desired optimality tolerance, and we report on numerical experiments on bandit-like problems. Our results provide insight into the complexity versus quality trade-off when choosing which of these relaxations to implement.
Building similarity graph...
Analyzing shared references across papers
Loading...
Daniel Adelman
University of Minnesota
Adam J. Mersereau
University of North Carolina at Chapel Hill
Operations Research
University of Chicago
University of North Carolina at Chapel Hill
Building similarity graph...
Analyzing shared references across papers
Loading...
Adelman et al. (Tue,) studied this question.
synapsesocial.com/papers/6a0f4172a00258d2006cbbe4 — DOI: https://doi.org/10.1287/opre.1070.0445