A learning algorithm for discrete-time stochastic control

A learning algorithm for discrete-time stochastic control

0.00 Avg rating0 Votes
Article ID: iaor20018
Country: United States
Volume: 14
Issue: 2
Start Page Number: 243
End Page Number: 258
Publication Date: Jan 2000
Journal: Probability in the Engineering and Informational Sciences
Authors:
Abstract:

A simulation-based algorithm for learning good policies for a discrete-time stochastic control process with unknown transition law is analyzed when the state and action spaces are compact subsets of Euclidean spaces. This extends the Q-learning scheme of discrete state/action problems along the lines of Baker. Almost sure convergence is proved under suitable conditions.

Reviews

Required fields are marked *. Your email address will not be published.