March 30, 2004

Towards stochastic conjugate gradient methods

Key Points

Key points are not available for this paper at this time.

Abstract

The method of conjugate gradients provides a very effective way to optimize large, deterministic systems by gradient descent. In its standard form, however, it is not amenable to stochastic approximation of the gradient. We explore a number of ways to adopt ideas from conjugate gradient in the stochastic setting, using fast Hessian-vector products to obtain curvature information cheaply. In our benchmark experiments the resulting highly scalable algorithms converge about an order of magnitude faster than ordinary stochastic gradient descent.

KI fragen

Bookmark

Cite This Study

Schraudolph et al. (Tue,) studied this question.

synapsesocial.com/papers/6a1309f15bb7edc7189ea55f https://doi.org/https://doi.org/10.1109/iconip.2002.1198180

KI fragen

Bookmark