Average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs

Average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs

0.00 Avg rating0 Votes
Article ID: iaor1989299
Country: United States
Volume: 37
Issue: 4
Start Page Number: 626
End Page Number: 633
Publication Date: Jul 1989
Journal: Operations Research
Authors:
Keywords: programming: dynamic
Abstract:

This paper deals with infinite state Markov decision processes with unbounded costs. Three simple conditions, based on the optimal discounted value function, guarantee the existence of an expected average cost optimal stationary policy. Sufficient conditions are the existence of a distinguished state of smallest discounted value and a single stationary policy inducing an irreducible, ergodic Markov chain for which the average cost of a first passage from any state to the distinguished state is finite. A result to verify this is also given. Two examples illustrate the ease of applying the criteria.

Reviews

Required fields are marked *. Your email address will not be published.