Article ID: | iaor19952246 |
Country: | Germany |
Volume: | 39 |
Start Page Number: | 257 |
End Page Number: | 288 |
Publication Date: | Jun 1994 |
Journal: | Mathematical Methods of Operations Research (Heidelberg) |
Authors: | Feinberg E.A. |
This paper deals with constrained average reward Semi-Markov Decision Processes with finite state and action sets. The paper considers two average reward criteria. The first criterion is time-average rewards, which equal the lower limits of the expected average rewards per unit time, as the horizon tends to infinity. The second criterion is ratio-average rewards, which equal the lower limits of the ratios of the expected total rewards during the first