| Article ID: | iaor19911685 |
| Country: | Switzerland |
| Volume: | 28 |
| Start Page Number: | 185 |
| End Page Number: | 212 |
| Publication Date: | Apr 1991 |
| Journal: | Annals of Operations Research |
| Authors: | Dekker R., Hordijk A. |
In this paper the authors investigate denumerable state semi-Markov decision chains with small interest rates. They consider average and Blackwell optimality and allow for multiple closed sets and unbounded immediate rewards. The present analysis uses the existence of a Laurent series expansion for the total discounted rewards and the continuity of its terms. The assumptions are expressed in terms of a weighted supremum norm. The present method is based on an algebraic treatment of Laurent series; it constructs an appropriate linear space with a lexicographic ordering. Using two operators and a positiveness property the authors establish the existence of bounded solutions to optimality equations. The theory is illustrated with an example of a