Article ID: | iaor19911685 |
Country: | Switzerland |
Volume: | 28 |
Start Page Number: | 185 |
End Page Number: | 212 |
Publication Date: | Apr 1991 |
Journal: | Annals of Operations Research |
Authors: | Dekker R., Hordijk A. |
In this paper the authors investigate denumerable state semi-Markov decision chains with small interest rates. They consider average and Blackwell optimality and allow for multiple closed sets and unbounded immediate rewards. The present analysis uses the existence of a Laurent series expansion for the total discounted rewards and the continuity of its terms. The assumptions are expressed in terms of a weighted supremum norm. The present method is based on an algebraic treatment of Laurent series; it constructs an appropriate linear space with a lexicographic ordering. Using two operators and a positiveness property the authors establish the existence of bounded solutions to optimality equations. The theory is illustrated with an example of a