Article ID: | iaor19911681 |
Country: | Switzerland |
Volume: | 28 |
Start Page Number: | 101 |
End Page Number: | 134 |
Publication Date: | Apr 1991 |
Journal: | Annals of Operations Research |
Authors: | Altman E., Shwartz A. |
The authors consider the constrained optimization of a finite-state, finite action Markov chain. In the adaptive problem, the transition probabilities are assumed to be unknown, and no prior distribution on their values is given. The authors consider constrained optimization problems in terms of several cost criteria which are asymptotic in nature. For these criteria they show that it is possible to achieve the same optimal cost as in the non-adaptive case. The authors first formulate a constrained optimization problem under each of the cost criteria and establish the existence of optimal stationary policies. Since the adaptive problem is inherently non-stationary, they suggest a class of