What question did this study set out to answer?

May 2, 2026

From Convergence to Generalization: Stability of Stationary-Point Learning Algorithms.

Key Points

This research aims to explore the stability and generalization properties of learning algorithms beyond the constraints of convexity.
Established bounds on stability and generalization for various algorithms in nonconvex settings under a differentiability assumption.
Introduced an algorithm-dependent quantity based on the training dataset and the algorithm's output.
Applied stability analyses to gradient descent, linear models, and shallow neural networks.
Demonstrated new stability and generalization bounds that incorporate optimization error and the algorithm-dependent quantity.
Findings are valid even when algorithms do not converge to global or local minimizers.
Empirical studies confirmed the effectiveness of these stability analyses.

Abstract

Algorithmic stability is a fundamental concept in learning theory for studying the generalization guarantees of learning algorithms. A notable limitation of classical stability analyses is that they often require convexity assumptions to obtain nontrivial bounds. In this paper, we investigate the stability and generalization properties of learning algorithms in nonconvex settings. We introduce an algorithm-dependent quantity that depends only on the training dataset and the algorithm's output. Under a mild differentiability assumption, we establish stability and generalization bounds that apply to almost any algorithm. Our bounds explicitly involve the optimization error and the algorithm-dependent quantity, thereby capturing the local curvature of the objective function around the learned model. A key feature of our analysis is that it remains valid even when the algorithm does not converge to a global or local minimizer. We further apply our general framework to gradient descent and demonstrate its implications for both linear models and shallow neural networks. Empirical studies verify the effectiveness of our stability analyses.

Bookmark

From Convergence to Generalization: Stability of Stationary-Point Learning Algorithms.

Key Points

Abstract

Cite This Study