Article ID: | iaor20032450 |
Country: | United States |
Volume: | 47 |
Issue: | 9 |
Start Page Number: | 1235 |
End Page Number: | 1251 |
Publication Date: | Sep 2001 |
Journal: | Management Science |
Authors: | Lauritzen Steffen L., Nilsson Dennis |
We introduce the notion of LImited Memory Influence Diagram (LIMID) to describe multistage decisional problems in which the traditional assumption of no forgetting is relaxed. This can be relevant in situations with multiple decision makers or when decisions must be prescribed under memory constraints, such as in partially observed Markov decision processes (POMDPs). We give an algorithm for improving any given strategy by local computation of single policy updates and investigate conditions for the resulting strategy to be optimal.