Journal: Machine Learning

Found 4 papers in total
A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis
2004,
We present a Reinforcement Learning (RL) algorithm based on policy iteration for...
Asynchronous stochastic-approximation and Q-learning
1994,
We provide some general results on the convergence of a class of stochastic...
An upper bound on the loss from approximate optimal-value functions
1994,
Many reinforcement learning approaches can be formulated using the theory of Markov...
Toward efficient agnostic learning
1994,
In this paper we initiate an investigation of generalizations of the Probably...
Papers per page: