Average cost Markov decision processes under the hypothesis of Doeblin

0.00 Avg rating—0 Votes

Article ID:	iaor19911692
Country:	Switzerland
Volume:	29
Start Page Number:	375
End Page Number:	386
Publication Date:	Apr 1991
Journal:	Annals of Operations Research
Authors:	Kurano M.

Abstract:

Average cost Markov decision processes (MDPs) with compact state and action spaces and bounded lower semicontinuous cost functions are considered. Kurano has treated the general case in which several ergodic classes and a transient set are permitted for the Markov process induced by any randomized stationary policy under the hypothesis of Doeblin and showed the existence of a minimum pair of state and policy. This paper considers the same case as that discussed in Kurano and proves some new results which give the existence theorem of an optimal stationary policy under some reasonable conditions.

Reviews

Required fields are marked *. Your email address will not be published.