Adaptive policies for time-varying stochastic systems under discounted criterion

Adaptive policies for time-varying stochastic systems under discounted criterion

0.00 Avg rating0 Votes
Article ID: iaor2003711
Country: Germany
Volume: 54
Issue: 3
Start Page Number: 419
End Page Number: 425
Publication Date: Jan 2001
Journal: Mathematical Methods of Operations Research (Heidelberg)
Authors: ,
Abstract:

We consider a class of time-varying stochastic control systems, with Borel state and action spaces, and possibly unbounded costs. The processes evolve according to a discrete-time equation xn+1 = Gn(xn,ann), n = 0,1,..., where the ζn are i.i.d. ℝk-valued random vectors whose common density is unknown, and the Gn are given functions converging, in a restricted way, to some function G as n → ∞. Assuming observability of ζn, we construct an adaptive policy which is asymptotically discounted cost optimal for the limiting control system xn+1 = G(xn,ann).

Reviews

Required fields are marked *. Your email address will not be published.