Article ID: | iaor1992382 |
Country: | Switzerland |
Volume: | 32 |
Start Page Number: | 165 |
End Page Number: | 188 |
Publication Date: | Sep 1991 |
Journal: | Annals of Operations Research |
Authors: | Rieder Ulrich, Wagner Hartmut |
Keywords: | programming: dynamic, statistics: multivariate |
A general control model under uncertainty is considered. Using a Bayesian approach and dynamic programming, the authors investigate structural properties of optimal decision rules. In particular, they show the monotonicity of the total expected reward and of the so-called Gittins-Index. The authors extend the stopping rule and the stay-on-a-winner rule, which are well-known in bandit problems. The present approach is based on the multivariate likelihood ratio order and