May 10, 2024Open Access

Gradient Descent for Noisy Optimization

Key Points

Key points are not available for this paper at this time.

Abstract

We study the use of gradient descent with backtracking line search (GD-BLS) to solve the noisy optimization problem _: =argmin㵧 Ef (, Z), imposing that the function F (): =Ef (, Z) is strictly convex. Assuming that E\|_ f (_, Z) \|²< and that objective function is locally L-smooth, we first prove that GD-BLS allows to estimate _ with an error of size O₏ (B^-0. 25), where B is the available computational budget. We then show that we can improve upon this rate by stopping the optimization process earlier when the gradient of the objective function is sufficiently close to zero, and use the residual computational budget to optimize, again with GD-BLS, a finer approximation of F. By iteratively applying this strategy J times, we establish that we can estimate _ with an error of size O₏ (B^-1{2 (1-^J) }), where (1/2, 1) is a user-specified parameter. More generally, we show that if E\|_ f (_, Z) \|^1+< for some known (0, 1] then this approach allows to learn _ with an error of size O₏ (B^-{1+ (1-^J) }), where (2/ (1+3), 1) is a tuning parameter. Beyond knowing, achieving the aforementioned convergence rates do not require to tune the algorithms parameters according to the specific functions F and f at hand, and we exhibit a simple noisy optimization problem for which stochastic gradient is not guaranteed to converge while the algorithms discussed in this work are.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Hu et al. (Fri,) studied this question.

synapsesocial.com/papers/68e6ac5ab6db64358762ebf9 https://doi.org/https://doi.org/10.48550/arxiv.2405.06539

Bookmark

View Full Paper