Structured policies in the sequential design of experiments

0.00 Avg rating—0 Votes

Article ID:	iaor1992382
Country:	Switzerland
Volume:	32
Start Page Number:	165
End Page Number:	188
Publication Date:	Sep 1991
Journal:	Annals of Operations Research
Authors:	Rieder Ulrich, Wagner Hartmut
Keywords:	programming: dynamic, statistics: multivariate

Abstract:

A general control model under uncertainty is considered. Using a Bayesian approach and dynamic programming, the authors investigate structural properties of optimal decision rules. In particular, they show the monotonicity of the total expected reward and of the so-called Gittins-Index. The authors extend the stopping rule and the stay-on-a-winner rule, which are well-known in bandit problems. The present approach is based on the multivariate likelihood ratio order and TP2 functions.

Reviews

Required fields are marked *. Your email address will not be published.