Article ID: | iaor1993655 |
Country: | Germany |
Volume: | 36 |
Start Page Number: | 343 |
End Page Number: | 358 |
Publication Date: | Jul 1992 |
Journal: | Mathematical Methods of Operations Research (Heidelberg) |
Authors: | Filar J.A., Vrieze O.J. |
The authors consider Competitive Markov Decision Processes in which the controllers/players are antagonistic and aggregate their sequences of expected rewards according to ‘weighted’ or ‘horizon-sensitive’ criteria. These are either a convex combination of two discounted objectives, or of one discounted and one limiting average reward objective. In both cases the authors establish the existence of the game-theoretic value vector, and supply a description of