Finite-horizon variance penalised Markov decision processes

Finite-horizon variance penalised Markov decision processes

0.00 Avg rating0 Votes
Article ID: iaor19972496
Country: Germany
Volume: 19
Issue: 1
Start Page Number: 35
End Page Number: 39
Publication Date: Jan 1997
Journal: OR Spektrum
Authors:
Abstract:

The paper considers a finite horizon Markov decision process with only terminal rewards. It describes a finite algorithm for computing a Markov deterministic policy which maximises the variance penalised reward and the paper outlines a vertex elimination algorithm which can reduce the computation involved.

Reviews

Required fields are marked *. Your email address will not be published.