What does this research mean for the field?

Standard deterministic Sharpness-Aware Minimization (SAM) becomes trapped in a stable period-2 limit cycle on quadratic loss surfaces, but scaling the perturbation with the gradient norm (Gradient-Scaled SAM) restores linear convergence to the exact global optimum. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This study aims to analyze the convergence properties of Sharpness-Aware Minimization (SAM), focusing on its tendency to get trapped in limit cycles.

June 9, 2026Open Access

On the Spectral Dynamics and Eigenspace-Trapped Limit Cycles of Sharpness-Aware Minimization

Key Points

This study aims to analyze the convergence properties of Sharpness-Aware Minimization (SAM), focusing on its tendency to get trapped in limit cycles.
Conducted mathematical and empirical analysis of SAM convergence on data-driven quadratic loss surfaces
Proved presence of a stable period-2 limit cycle on anisotropic Hessian curvature
Analyzed Gradient-Scaled SAM to demonstrate restoration of linear convergence.
Demonstrated that standard SAM is trapped in a limit cycle with an exact radius R dependent on various parameters.
Proved perturbations along sub-dominant eigenspaces decay at a governed rate by an amplification factor.
Validated theoretical predictions via numerical simulations.

Abstract

Sharpness-Aware Minimization (SAM) has emerged as a state-of-the-art optimization framework that improves the generalization of deep neural networks by actively seeking flatter regions of the loss landscape. Despite its widespread empirical success, the convergence properties of SAM remain partially understood. In particular, standard deterministic SAM utilizing a normalized perturbation often exhibits a persistent, non-vanishing steady-state error and fails to converge to the exact minimum even on simple deterministic quadratic objectives. In this paper, we conduct a rigorous mathematical and empirical analysis of this phenomenon. We prove that on data-driven quadratic loss surfaces with anisotropic Hessian curvature, the optimization trajectory of standard SAM is academically trapped in a stable period-2 limit cycle. We derive the exact closed-form radius R of this limit cycle as a function of the learning rate, perturbation radius, and maximum eigenvalue of the Hessian, showing that the trajectory oscillates perpetually along the dominant eigenvector. Furthermore, we analyze the transverse stability of this limit cycle, proving that perturbations along sub-dominant eigenspaces decay contractively at a rate governed by an analytical amplification factor. To resolve this fundamental bottleneck, we analyze Gradient-Scaled SAM (GS-SAM), showing that scaling the perturbation with the gradient norm restores linear convergence to the exact global optimum. Our theoretical predictions are validated via numerical simulations to machine precision.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper