Article ID: | iaor20041742 |
Country: | United States |
Volume: | 20 |
Issue: | 4 |
Start Page Number: | 923 |
End Page Number: | 936 |
Publication Date: | Nov 1995 |
Journal: | Mathematics of Operations Research |
Authors: | Borkar V.S., Bhatnagar S. |
The ergodic control problem for semi-Markov processes is reformulated as an optimization problem over the set of suitably defined ‘ergodic occupation measures’. This set is shown to be closed and convex, with its extreme points corresponding to stationary strategies. This leads to the existence of optimal stationary strategies under additional hypotheses. A pathwise analysis of the joint empirical occupation measures of the state and control processes shows that this optimality is in the strong (i.e., almost sure) sense.