Article ID: | iaor20173082 |
Volume: | 65 |
Issue: | 4 |
Start Page Number: | 1078 |
End Page Number: | 1094 |
Publication Date: | Aug 2017 |
Journal: | Operations Research |
Authors: | Scheller-Wolf Alan, Harchol-Balter Mor, Gardner Kristen, Velednitsky Mark, Zbarsky Samuel |
Keywords: | networks: flow, design, simulation, performance, queues: applications, internet |
Redundancy is an important strategy for reducing response time in multi‐server distributed queueing systems. This strategy has been used in a variety of settings, but only recently have researchers begun analytical studies. The idea behind redundancy is that customers can greatly reduce response time by waiting in multiple queues at the same time, thereby experiencing the minimum time across queues. Redundancy has been shown to produce significant response time improvements in applications ranging from organ transplant waitlists to Google’s BigTable service. However, despite the growing body of theoretical and empirical work on the benefits of redundancy, there is little work addressing the questions of how many copies one needs to make to achieve a response time benefit, and the magnitude of the potential gains. In this paper we propose a theoretical model and dispatching policy to evaluate these questions. Our system consists of