Asynchronous stochastic-approximation and Q-learning

Asynchronous stochastic-approximation and Q-learning

0.00 Avg rating0 Votes
Article ID: iaor1999815
Country: United States
Volume: 16
Issue: 3
Start Page Number: 185
End Page Number: 202
Publication Date: Jul 1994
Journal: Machine Learning
Authors:
Keywords: programming: dynamic, markov processes
Abstract:

We provide some general results on the convergence of a class of stochastic approximation algorithms and their parallel and asynchronous variants. We then use these results to study the Q-learning algorithm, a reinforcement learning method for solving Markov decision problems, and establish its convergence under conditions more general than previously available.

Reviews

Required fields are marked *. Your email address will not be published.