Article ID: | iaor20001038 |
Country: | Germany |
Volume: | 49 |
Issue: | 1 |
Start Page Number: | 97 |
End Page Number: | 109 |
Publication Date: | Jan 1999 |
Journal: | Mathematical Methods of Operations Research (Heidelberg) |
Authors: | Filar J.A., Altman Eitan, Avrachenkov K.E. |
In this paper we consider a singularly perturbed Markov decision process with finitely many states and actions and the limiting expected average reward criterion. We make no assumptions about the underlying ergodic structure. We present algorithms for the computation of a uniformly optimal deterministic control, that is, a control which is optimal for all values of the perturbation parameter that are sufficiently small. Our algorithms are based on Jeroslow's Asymptotic Linear Programming.