Article ID: | iaor200972009 |
Country: | Germany |
Volume: | 70 |
Issue: | 3 |
Start Page Number: | 527 |
End Page Number: | 540 |
Publication Date: | Dec 2009 |
Journal: | Mathematical Methods of Operations Research |
Authors: | Hernndez-Lerma Onsimo, Prieto-Rumeau Toms |
This paper deals with denumerable-state continuous-time controlled Markov chains with possibly unbounded transition and reward rates. It concerns optimality criteria that improve the usual expected average reward criterion. First, we show the existence of average reward optimal policies with minimal average variance. Then we compare the variance minimization criterion with overtaking optimality. We present an example showing that they are opposite criteria, and therefore we cannot optimize them simultaneously. This leads to a multiobjective problem for which we identify the set of Pareto optimal policies (also known as nondominated policies).