☆ 4.4 Article

The complexity of decentralized control of Markov decision processes

MATHEMATICS OF OPERATIONS RESEARCH (2002)

Journal

MATHEMATICS OF OPERATIONS RESEARCH

Volume 27, Issue 4, Pages 819-840

Publisher

INFORMS

DOI: 10.1287/moor.27.4.819.297

Keywords

computational complexity; Markov decision process; decentralized control

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalizations of both the fully observable case and the partially observable case that allow for decentralized control are described. For even two agents, the finite-horizon problems corresponding to both of these models are hard for nondeterministic exponential time. These complexity results illustrate a fundamental difference between centralized and decentralized control of Markov decision processes. In contrast to the problems involving centralized control, the problems we consider provably do not admit polynomial-time algorithms. Furthermore, assuming EXP not equal NEXP, the problems require superexponential time to solve in the worst case.

The complexity of decentralized control of Markov decision processes

Journal

MATHEMATICS OF OPERATIONS RESEARCH

Publisher

INFORMS

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

The complexity of decentralized control of Markov decision processes

Journal

MATHEMATICS OF OPERATIONS RESEARCH

Publisher

INFORMS

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper