Weighted reward criteria in Competitive Markov Decision Processes

0.00 Avg rating—0 Votes

Article ID:	iaor1993655
Country:	Germany
Volume:	36
Start Page Number:	343
End Page Number:	358
Publication Date:	Jul 1992
Journal:	Mathematical Methods of Operations Research (Heidelberg)
Authors:	Filar J.A., Vrieze O.J.

Abstract:

The authors consider Competitive Markov Decision Processes in which the controllers/players are antagonistic and aggregate their sequences of expected rewards according to ‘weighted’ or ‘horizon-sensitive’ criteria. These are either a convex combination of two discounted objectives, or of one discounted and one limiting average reward objective. In both cases the authors establish the existence of the game-theoretic value vector, and supply a description of •-optimal non-stationary strategies.

Reviews

Required fields are marked *. Your email address will not be published.