Continuity of the optimal average cost in Markov decision chains with small risk-sensitivity

Continuity of the optimal average cost in Markov decision chains with small risk-sensitivity

0.00 Avg rating0 Votes
Article ID: iaor201526335
Volume: 81
Issue: 3
Start Page Number: 269
End Page Number: 298
Publication Date: Jun 2015
Journal: Mathematical Methods of Operations Research
Authors: , ,
Keywords: programming: markov decision, risk, optimization
Abstract:

This note concerns discrete‐time controlled Markov chains driven by a decision maker with constant risk‐sensitivity λ equ1 . Assuming that the system evolves on a denumerable state space and is endowed with a bounded cost function, the paper analyzes the continuity of the optimal average cost with respect to the risk‐sensitivity parameter, a property that is promptly seen to be valid at each no‐null value of λ equ2 . Under standard continuity‐compactness conditions, it is shown that a general form of the simultaneous Doeblin condition allows to establish the continuity of the optimal average cost at λ = 0 , and explicit examples are given to show that, even if every state is positive recurrent under the action of any stationary policy, the above continuity conclusion can not be ensured under weaker recurrence requirements, as the Lyapunov function condition.

Reviews

Required fields are marked *. Your email address will not be published.