November 1, 2002

The Complexity of Decentralized Control of Markov Decision Processes

Key Points

Key points are not available for this paper at this time.

Abstract

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalizations of both the fully observable case and the partially observable case that allow for decentralized control are described. For even two agents, the finite-horizon problems corresponding to both of these models are hard for nondeterministic exponential time. These complexity results illustrate a fundamental difference between centralized and decentralized control of Markov decision processes. In contrast to the problems involving centralized control, the problems we consider provably do not admit polynomial-time algorithms. Furthermore, assuming EXP ≠ NEXP, the problems require superexponential time to solve in the worst case.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Daniel S. Bernstein

University of Massachusetts Amherst

Robert Givan

John Brown University

Neil Immerman

Tufts University

Journals

Mathematics of Operations Research

Actions

Institutions

Purdue University West Lafayette

University of Massachusetts Amherst

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

The Complexity of Decentralized Control of Markov Decision Processes

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study