| Article ID: | iaor200954165 |
| Country: | United States |
| Volume: | 33 |
| Issue: | 4 |
| Start Page Number: | 932 |
| End Page Number: | 944 |
| Publication Date: | Nov 2008 |
| Journal: | Mathematics of Operations Research |
| Authors: | Asmussen Sren, Rolski Tomasz, Lipsky Lester, Fiorini Pierre, Sheahan Robert |
| Keywords: | project management |
Many processes must complete in the presence of failures. Different systems respond to task failure in different ways. The system may resume a failed task from the failure point (or a saved checkpoint shortly before the failure point), it may give up on the task and select a replacement task from the ready queue, or it may restart the task. The behavior of systems under the first two scenarios is well documented, but the third (