Markov decision models with weighted discounted criteria

Markov decision models with weighted discounted criteria

0.00 Avg rating0 Votes
Article ID: iaor19941898
Country: United States
Volume: 19
Issue: 1
Start Page Number: 152
End Page Number: 168
Publication Date: Feb 1994
Journal: Mathematics of Operations Research
Authors: ,
Keywords: stochastic processes
Abstract:

The authors consider a discrete time Markov Decision Process with infinite horizon. The criterion to be maximized is the sum of a number of standard discounted rewards, each with a different discount factor. Situations in which such criteria arise include modeling investmnts, production, modeling projects of different durations and systems with multiple criteria, and some axiomatic formulations of multi-attribute preference theory. The authors show that for this criterion for some positive there need not exist an -optimal (randomized) stationary strategy, even when the state and action sets are finite. However, ∈-optimal Markov (nonrandomized) strategies and optimal Markov strategies exist under weak conditions. The authors exhibit ∈-optimal Markov strategies which are stationary from some time onward. When both state and action spaces are finite, there exists an optimal Markov strategy with this property. The authors provide an explicit algorithm for the computation of such strategies and give a description of the set of optimal strategies.

Reviews

Required fields are marked *. Your email address will not be published.