Article ID: | iaor1988236 |
Country: | Netherlands |
Volume: | 42 |
Issue: | 2 |
Start Page Number: | 273 |
End Page Number: | 306 |
Publication Date: | Nov 1988 |
Journal: | Mathematical Programming |
Authors: | Byrd Richard H., Schnabel Robert B., Schultz Gerald A. |
Keywords: | parallel processing |
We discuss methods for solving the unconstrained optimization problem on parallel computers, when the number of variables is sufficiently small that quasi-Newton methods can be used. We concentrate mainly, but not exclusively, on problems where function evaluation is expensive. First we discuss ways to parallelize both the function evaluation costs and the linear algebra calculations in the standard sequential secant method, the BFGS method. Then we discuss new methods that are appropriate when there are enough processors to evaluate the function, gradient, and part but not all of the Hessian at each iteration. We develop new algorithms that utilize this information and analyze their convergence properties. We present computational experiments showing that they are superior to parallelization either the BFGS methods or Newton’s method under our assumptions on the number of processors and cost of functions evaluation. Finally we discuss ways to effectively utilize the gradient values at unsuccessful trial points that are available in our parallel methods and also in some sequential software packages.