Article ID: | iaor20105141 |
Volume: | 71 |
Issue: | 3 |
Start Page Number: | 401 |
End Page Number: | 425 |
Publication Date: | Jun 2010 |
Journal: | Mathematical Methods of Operations Research |
Authors: | Iyer Krishnamurthy, Hemachandra Nandyala |
We consider a discrete time Markov Decision Process (MDP) under the discounted payoff criterion in the presence of additional discounted cost constraints. We study the sensitivity of optimal Stationary Randomized (SR) policies in this setting with respect to the upper bound on the discounted cost constraint functionals. We show that such sensitivity analysis leads to an improved version of the Feinberg–Shwartz algorithm (1996) for finding optimal policies that are ultimately stationary and deterministic.