Article ID: | iaor19941927 |
Country: | Netherlands |
Volume: | 49 |
Issue: | 1 |
Start Page Number: | 159 |
End Page Number: | 177 |
Publication Date: | Jan 1994 |
Journal: | Stochastic Processes and Their Applications |
Authors: | Feinberg Eugene A., Park Haechurl |
This paper deals with a discrete time Markov decision model with a finite state space, arbitrary action space, and bounded reward function under the average reward criteria. The authors consider four average reward criteria and prove the existence of persistently nearly optimal strategies in various classes of strategies for models with complete state information. They show that such strategies exist in any class of strategies satisfying the following condition: along any trajectory at different epochs the controller knowns different information about the past. Though neither optimal nor stationary nearly optimal strategies may exist, the authors show that for some nonempty set of states the described nearly optimal strategies may be chosen either stationary or optimal.