On the value function in constrained control of Markov chains

On the value function in constrained control of Markov chains

0.00 Avg rating0 Votes
Article ID: iaor19972493
Country: Germany
Volume: 44
Issue: 3
Start Page Number: 387
End Page Number: 399
Publication Date: Nov 1996
Journal: Mathematical Methods of Operations Research (Heidelberg)
Authors: , ,
Abstract:

It is known that the value function in an unconstrained Markov decision process with finitely many states and actions is a piecewise rational function in the discount factor α, and that the value function can be expressed as a Laurent series expansion about α=1 for α close enough to 1. The authors show in this paper that this property also holds for the value function of Markov decision processes with additional constraints. More precisely, they show by a constructive proof that there are numbers 0=α01<ëëë<αmÅ-1m=1 such that for every j=1,2,...,m-1 either the problem is not feasible for all discount factors α in the open interval (αÅjÅ+1j) or the value function is a rational function in α in the closed interval [αÅjÅ-1j]. µAs a consequence, if the constrained problem is feasible in the neighborhood of α=1, then the value function has a Laurent series expansion about α=1. The present proof technique for the constrained case provides also a new proof for the unconstrained case.

Reviews

Required fields are marked *. Your email address will not be published.