Incremental gradient algorithms with stepsizes bounded away from zero

0.00 Avg rating—0 Votes

Article ID:	iaor20002940
Country:	Netherlands
Volume:	11
Issue:	1
Start Page Number:	23
End Page Number:	36
Publication Date:	Oct 1998
Journal:	Computational Optimization and Applications
Authors:	Solodov M.V.
Keywords:	neural networks

Abstract:

We consider the class of incremental gradient methods for minimizing a sum of continuously differentiable functions. An important novel feature of our analysis is that the stepsizes are kept bounded away from zero. We derive the first convergence results of any kind for this computationally important case. In particular, we show that a certain ϵ-approximate solution can be obtained and establish the linear dependence of ϵ on the stepsize limit. Incremental gradient methods are particularly well-suited for large neural network training problems where obtaining an approximate solution is typically sufficient and is often preferable to computing an exact solution. Thus, in the context of neural networks, the approach presented here is related to the principle of tolerant training. Our results justify numerous stepsize rules that were derived on the basis of extensive numerical experimentation but for which no theoretical analysis was previously available. In addition, convergence to (exact) stationary points is established when the gradient satisfies a certain growth property.

Reviews

Required fields are marked *. Your email address will not be published.