Structured policies in the sequential design of experiments

Structured policies in the sequential design of experiments

0.00 Avg rating0 Votes
Article ID: iaor1992382
Country: Switzerland
Volume: 32
Start Page Number: 165
End Page Number: 188
Publication Date: Sep 1991
Journal: Annals of Operations Research
Authors: ,
Keywords: programming: dynamic, statistics: multivariate
Abstract:

A general control model under uncertainty is considered. Using a Bayesian approach and dynamic programming, the authors investigate structural properties of optimal decision rules. In particular, they show the monotonicity of the total expected reward and of the so-called Gittins-Index. The authors extend the stopping rule and the stay-on-a-winner rule, which are well-known in bandit problems. The present approach is based on the multivariate likelihood ratio order and TP2 functions.

Reviews

Required fields are marked *. Your email address will not be published.