Article ID: | iaor2000414 |
Country: | Netherlands |
Volume: | 87 |
Issue: | 1 |
Start Page Number: | 263 |
End Page Number: | 272 |
Publication Date: | Apr 1999 |
Journal: | Annals of Operations Research |
Authors: | Cao Xi-Ren, Fu Michael C., Hu Jian-Qiang |
We consider the problem of sample path-based gradient estimation for long-run (steady-state) performance measures defined on discrete-time Markov chains. We show how two estimators – one derived using the likelihood ratio method with conditional Monte Carlo and splitting, and the other derived using performance potentials and perturbation analysis – are related. In particular, one can be expressed as the conditional expectation of a suitably weighted average of the other. This demonstrates yet another connection between the two gradient estimation techniques of perturbation analysis and the likelihood ratio method.