Time aggregated Markov decision processes via standard dynamic programming

0.00 Avg rating—0 Votes

Article ID:	iaor20115254
Volume:	39
Issue:	3
Start Page Number:	193
End Page Number:	197
Publication Date:	May 2011
Journal:	Operations Research Letters
Authors:	Arruda Edilson F, Fragoso Marcelo D
Keywords:	programming: dynamic

Abstract:

This note addresses the time aggregation approach to ergodic finite state Markov decision processes with uncontrollable states. We propose the use of the time aggregation approach as an intermediate step toward constructing a transformed MDP whose state space is comprised solely of the controllable states. The proposed approach simplifies the iterative search for the optimal solution by eliminating the need to define an equivalent parametric function, and results in a problem that can be solved by simpler, standard MDP algorithms.

Reviews

Required fields are marked *. Your email address will not be published.