| Article ID: | iaor2000455 |
| Country: | United Kingdom |
| Volume: | 35 |
| Issue: | 3 |
| Start Page Number: | 589 |
| End Page Number: | 599 |
| Publication Date: | Sep 1998 |
| Journal: | Journal of Applied Probability |
| Authors: | Cooper William L. |
| Keywords: | markov processes |
Given a sequence of random variables (rewards), the Haviv–Puterman differential equation relates the expected infinite-horizon λ-discounted reward and the expected total reward up to a random time that is determined by an independent negative binomial random variable with parameters 2 and λ. This paper provides an interpretation of this proven, but previously unexplained, result. Furthermore, the interpretation is formalized into a new proof, which then yields new results for the general case where the rewards are accumulated up to a time determined by an independent negative binomial random variable with parameters