A dynamic programming algorithm for the optimal control of piecewise deterministic Markov processes

A dynamic programming algorithm for the optimal control of piecewise deterministic Markov processes

0.00 Avg rating0 Votes
Article ID: iaor20023729
Country: United States
Volume: 40
Issue: 2
Start Page Number: 525
End Page Number: 539
Publication Date: Aug 2001
Journal: SIAM Journal on Control and Optimization
Authors:
Keywords: markov processes
Abstract:

A piecewise deterministic Markov process is a continuous time Markov process consisting of continuous, deterministic trajectories interrupted by random jumps. The trajectories may be controlled with the object of minimizing the expected costs associated with the process. A method of representing this controlled PDP as a discrete time decision process is presented, allowing the value function for the problem to be expressed as the fixed point of a dynamic programming operator. Decisions take the form of trajectory segments. The expected costs may then be minimized through a dynamic programming algorithm, rather than through the solution of the Bellman Hamilton Jacobi equation, assuming the trajectory segments are numerically tractable. The technique is applied to the optimal capacity expansion problem, that is, the problem of planning the construction of new production facilities to meet rising demand.

Reviews

Required fields are marked *. Your email address will not be published.