January 2, 2003

Learning rate schedules for faster stochastic gradient search

Key Points

Key points are not available for this paper at this time.

Abstract

The authors propose a new methodology for creating the first automatically adapting learning rates that achieve the optimal rate of convergence for stochastic gradient descent. Empirical tests agree with theoretical expectations that drift can be used to determine whether the crucial parameter c is large enough. Using this statistic, it will be possible to produce the first adaptive learning rates which converge at optimal speed.>

Mark Helpful

Bookmark

Relay