Article ID: | iaor1992275 |
Country: | Switzerland |
Volume: | 32 |
Start Page Number: | 215 |
End Page Number: | 230 |
Publication Date: | Aug 1991 |
Journal: | Annals of Operations Research |
Authors: | White Chelsea C. |
The paper surveys several computational procedures for the partially observed Markov decision process (POMDP) that have been developed since the Monahan survey was published in 1982. The POMDP generalizes the standard, completely observed Markov decision process by permitting the possibility that state observations may be noise-corrupted and/or costly. Several computational procedures presented are convergence accelerating variants of, or approximations to, the Smallwood-Sondik algorithm. Finite-memory suboptimal design results are reported, and new research directions involving heuristic search are discussed.