Adaptive control for discrete-time Markov processes with unbounded costs: Average criterion

Adaptive control for discrete-time Markov processes with unbounded costs: Average criterion

0.00 Avg rating0 Votes
Article ID: iaor19992537
Country: Germany
Volume: 48
Issue: 1
Start Page Number: 37
End Page Number: 55
Publication Date: Jan 1998
Journal: Mathematical Methods of Operations Research (Heidelberg)
Authors: ,
Abstract:

The paper deals with a class of discrete-time Markov control processes with Borel state and action spaces, and possibly unbounded one-stage costs. The processes are given by recurrent equations xt+1 = F(xt,att), t = 1, 2, ... with independent, identically distributed ℜk-valued random vectors ξt whose density p is unknown. Assuming observability of ξt, and taking advantage of the procedure of statistical estimation of p used in a previous work by the authors, we construct an average cost optimal adaptive policy.

Reviews

Required fields are marked *. Your email address will not be published.