Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance

Cooperative Markov decision processes: time consistency, greedy players satisfaction, and cooperation maintenance

0.00 Avg rating0 Votes
Article ID: iaor20131247
Volume: 42
Issue: 1
Start Page Number: 239
End Page Number: 262
Publication Date: Feb 2013
Journal: International Journal of Game Theory
Authors: , ,
Keywords: programming: markov decision
Abstract:

We deal with multi‐agent Markov decision processes (MDPs) in which cooperation among players is allowed. We find a cooperative payoff distribution procedure (MDP‐CPDP) that distributes in the course of the game the payoff that players would earn in the long run game. We show under which conditions such a MDP‐CPDP fulfills a time consistency property, contents greedy players, and strengthen the coalition cohesiveness throughout the game. Finally we refine the concept of Core for Cooperative MDPs.

Reviews

Required fields are marked *. Your email address will not be published.