The complexity of decentralized control of Markov decision processes

0.00 Avg rating—0 Votes

Article ID:	iaor2004694
Country:	United States
Volume:	27
Issue:	4
Start Page Number:	819
End Page Number:	840
Publication Date:	Nov 2002
Journal:	Mathematics of Operations Research
Authors:	Bernstein Daniel S., Givan Robert, Immerman Neil, Zilberstein Shlomo

Abstract:

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalizations of both the fully observable case and the partially observable case that allow for decentralized control are described. For even two agents, the finite-horizon problems corresponding to both of these models are hard for nondeterministic exponential time. These complexity results illustrate a fundamental difference between centralized and decentralized control of Markov decision processes. In contrast to the problems involving centralized control, the problems provably do not admit polynomial-time algorithms. Futhermore, assuming EXP ≠ NEXP, the problems require suprex potential time to solve in the worst case.

Reviews

Required fields are marked *. Your email address will not be published.