Minimizing the learning loss in adaptive control of Markov chains under the weak accessibility condition

Minimizing the learning loss in adaptive control of Markov chains under the weak accessibility condition

0.00 Avg rating0 Votes
Article ID: iaor19932362
Country: Israel
Volume: 28
Issue: 4
Start Page Number: 779
End Page Number: 790
Publication Date: Dec 1991
Journal: Journal of Applied Probability
Authors:
Keywords: control processes, adaptive processes
Abstract:

The paper considers the adaptive control of Markov chains under the weak accessibility condition with a view to minimizing the learning loss. A certainty equivalence control with a forcing scheme is constructed. The paper uses a stationary randomized control scheme for forcing and computes a maximum likelihood estimate of the unknown parameter from the resulting observations. It obtains an exponential upper bound on the rate of decay of the probability of error. This allows the choice of the rate of forcing appropriately, whereby a o(f(n)logn) learning loss for any function f(n)•• as n•• is achieved.

Reviews

Required fields are marked *. Your email address will not be published.