A Learning Algorithm for Risk-Sensitive Cost

A Learning Algorithm for Risk-Sensitive Cost

0.00 Avg rating0 Votes
Article ID: iaor200954193
Country: United States
Volume: 33
Issue: 4
Start Page Number: 880
End Page Number: 898
Publication Date: Nov 2008
Journal: Mathematics of Operations Research
Authors: , ,
Keywords: learning
Abstract:

A linear function approximation–based reinforcement learning algorithm is proposed for Markov decision processes with infinite horizon risk–sensitive cost. Its convergence is proved using the “ODE” (Ordinary Differential Equation ) method for stochastic approximation. The scheme is also extended to continuous state space processes.

Reviews

Required fields are marked *. Your email address will not be published.