The critical discount factor for finite Markovian decision processes with an absorbing set

The critical discount factor for finite Markovian decision processes with an absorbing set

0.00 Avg rating0 Votes
Article ID: iaor20041141
Country: Germany
Volume: 57
Issue: 1
Start Page Number: 1
End Page Number: 19
Publication Date: Jan 2003
Journal: Mathematical Methods of Operations Research (Heidelberg)
Authors: ,
Abstract:

This paper deals with a Markovian decision process with an absorbing set J0. We are interested in the largest number β* ≥ 1, called the critical discount factor, such that for all discount factors β smaller than β* the limit V of the N-stage value function VN for N → ∞ exists and is finite for each choice of the one-stage reward function. Several representations of β* are given. The equality of 1/β* with the maximal Perron/Frobenius eigenvalue of the MDP links our problem and our results to topics studied intensively (mostly for β = 1) in the literature. We derive in a unified way a large number of conditions, some of which are known, which are equivalent either to β < β* or to β* < 1. In particular, the latter is equivalent to transience of the MDP. A few of our findings are extended with the aid of results in Rieder to models with standard Borel state and action space. We also complement an algorithm of policy iteration type, due to Mandl and Seneta, for the computation of β*. Finally we determine β* explicitly in two models with stochastically monotone transition law.

Reviews

Required fields are marked *. Your email address will not be published.