A two-armed bandit problem with possibility of no information

A two-armed bandit problem with possibility of no information

0.00 Avg rating0 Votes
Article ID: iaor1989792
Country: India
Volume: 10
Issue: 2
Start Page Number: 281
End Page Number: 302
Publication Date: May 1989
Journal: Journal of Information & Optimization Sciences
Authors:
Keywords: programming: dynamic
Abstract:

A sequential design problem which is also called the ‘two-armed bandit problem’ is considered under the condition that a continuous random variable is obtained from the general one-parameter distribution with probability p and no observation is obtained with probability 1-p. This problem is formulated by the principle of optimality of dynamic programming and some properties of the optimal strategy for this problem is obtained under several conditions. In the case of one arm known, the optimal strategy is derived explicitly by using the critical value function.

Reviews

Required fields are marked *. Your email address will not be published.