Target-level criterion in Markov decision-processes

Target-level criterion in Markov decision-processes

0.00 Avg rating0 Votes
Article ID: iaor19962233
Country: United States
Volume: 86
Issue: 1
Start Page Number: 1
End Page Number: 15
Publication Date: Jan 1995
Journal: Journal of Optimization Theory and Applications
Authors: ,
Abstract:

The Markov decision process is studied under the maximization of the probability that total discounted rewards exceed a target level. The authors focus on and study the dynamic programming equations of the model. They give various properties of the optimal return operator and, for the infinite planning-horizon model, the authors characterize the optimal value function as a maximal fixed point of the previous operator. Various turnpike results relating the finite and infinite-horizon models are also given.

Reviews

Required fields are marked *. Your email address will not be published.