Article ID: | iaor200954165 |
Country: | United States |
Volume: | 33 |
Issue: | 4 |
Start Page Number: | 932 |
End Page Number: | 944 |
Publication Date: | Nov 2008 |
Journal: | Mathematics of Operations Research |
Authors: | Asmussen Sren, Rolski Tomasz, Lipsky Lester, Fiorini Pierre, Sheahan Robert |
Keywords: | project management |
Many processes must complete in the presence of failures. Different systems respond to task failure in different ways. The system may resume a failed task from the failure point (or a saved checkpoint shortly before the failure point), it may give up on the task and select a replacement task from the ready queue, or it may restart the task. The behavior of systems under the first two scenarios is well documented, but the third (