On the optimality of the Gittins index rule for multi-armed bandits with multiple plays

On the optimality of the Gittins index rule for multi-armed bandits with multiple plays

0.00 Avg rating0 Votes
Article ID: iaor20003685
Country: Germany
Volume: 50
Issue: 3
Start Page Number: 449
End Page Number: 461
Publication Date: Jan 1999
Journal: Mathematical Methods of Operations Research (Heidelberg)
Authors: ,
Abstract:

We investigate the general multi-armed bandit problem with multiple servers. We determine a condition on the reward processes sufficient to guarantee the optimality of the strategy that operates at each instant of time the projects with the highest Gittins indices. We call this strategy the Gittins index rule for multi-armed bandits with multiple plays, or briefly the Gittins index rule. We show by examples that: (i) the aforementioned sufficient condition is not necessary for the optimality of the Gittins index rule; and (ii) when the sufficient condition is relaxed the Gittins index rule is not necessarily optimal. Finally, we present an application of the general results to the multiserver scheduling of parallel queues without arrivals.

Reviews

Required fields are marked *. Your email address will not be published.