Article ID: | iaor1988282 |
Country: | United States |
Volume: | 13 |
Issue: | 3 |
Start Page Number: | 395 |
End Page Number: | 420 |
Publication Date: | Aug 1988 |
Journal: | Mathematics of Operations Research |
Authors: | Dekker Rommert, Hordijk Arie |
In this paper the authors consider a (discrete-time) Markov decision chain with a denumerable state space and compact action sets and they assume that for all states the rewards and transition probabilities depend continuously on the actions. The first objective of this paper is to develop an analysis for average optimality without assuming a special Markov chain structure. In doing so, the authors present a set of conditions guaranteeing average optimality, which are automatically fulfilled in the finite state and action model. The second objective is to study simultaneously average and discount optimality as Veinott did for the finite state and action model. The authors investigate the concepts of