Application of dynamic-programming to high-dimensional nonlinear optimal-control problems

0.00 Avg rating—0 Votes

Article ID:	iaor19962231
Country:	United States
Volume:	52
Issue:	1
Start Page Number:	239
End Page Number:	250
Publication Date:	Jul 1990
Journal:	International Journal of Control
Authors:	Luus R.
Keywords:	control, queues: applications

Abstract:

The paper considers the problem of routing N arrivals to M queues in parallel when no information other than the prior statistics are available for the decision process. The work it presents here differs from results in the literature in two significant ways: (a) the paper addresses the nonstationary problem of open-loop routing for N<• arrivals, and (b) it does not decompose the problem into an allocation phase and a routing phase, but look instead for an overall optimal policy. Exhaustive enumeration and dynamic programming turn out to be computationally infeasible approaches for realistic values of the parameters N an M. Examination of the necessary conditions for optimality as given by Pontryagin’s maximum principle leads to the formulation of a policy iteration algorithm. Convergence of the algorithm in a finite number of iterations, as well as monotonicity results, are established. While only convergence to a local minimum is guaranteed, extensive computational experimentation points to its near-optimality. Some variants of the basic policy iteration algorithm are also discussed.

Reviews

Required fields are marked *. Your email address will not be published.