What question did this study set out to answer?

The aim is to create a fractional gradient optimizer that utilizes wavelet transforms to improve memory efficiency in neural networks.

February 28, 2026Open Access

FAdamWav: A Fractional Wavelet Gradient Optimizer for Neural Networks

Key Points

The aim is to create a fractional gradient optimizer that utilizes wavelet transforms to improve memory efficiency in neural networks.
Developed FAdamWav combining parametric discrete wavelet transforms and fractional derivatives.
Analyzed three levels of wavelet transformations to reduce gradient memory usage.
Conducted experiments comparing FAdamWav with non-fractional and non-wavelet optimizers.
Demonstrated memory savings of 50%, 75%, or 87.5% theoretically, although actual savings are lower than expected.
Showed competitive performance of FAdamWav compared to traditional optimizers.

Abstract

The optimizer is a critical element of neural networks because it computes their optimal parameters through a training process. The Adam optimizer is considered the state of the art in deep learning. However, a drawback is the cost of storing and computing their gradients. A useful tool for addressing this issue is the application of the wavelet transform, and other relevant tool is the fractional derivative, which can be used to create fractional gradient optimizers. This research combines the wavelet transform and fractional optimizers to propose FAdamWav, a fractional version of Adam that uses (i) a parametric discrete wavelet transform to theoretically save 50%, 75% or 87.5% of gradient’s memory with one, two or three transformation levels, and (ii) a fractional gradient to optimize the neural network parameters. Experiments indicate that the saved memory is lower than the theoretical bounds, but memory is saved and fractional wavelet-based optimizers have competitive performance compared to their non-fractional and non-wavelet counterparts.

Bookmark

View Full Paper

Bookmark

View Full Paper

FAdamWav: A Fractional Wavelet Gradient Optimizer for Neural Networks

Key Points

Abstract

Cite This Study