What question did this study set out to answer?

This research aims to develop a robust learning method using random features that can handle outliers and noise effectively in large datasets.

March 6, 2026

Generalization Properties of Robust Learning With Random Features

Key Points

This research aims to develop a robust learning method using random features that can handle outliers and noise effectively in large datasets.
Proposed a robust and adaptive regularized least squares method with random features (RRLS-RF).
Incorporated response truncation to balance robustness and bias based on sample size.
Established generalization properties under bounded moment conditions for RRLS-RF.
Analyzed learning rates and optimized random features based on data-dependent strategies.
RRLS-RF achieves learning rates of O(|D|^{-δ/(2δ + 2)}) with O(|D|^{δ/(2δ + 2)}log |D|) random features.
Learning rates converge to the optimal rates as δ increases, addressing traditional assumptions.
Demonstrated superior robustness against outliers and noise compared to standard methods in numerical experiments.

Abstract

Random features (RFs) provide an efficient approximation to kernel methods, and allow for scalable learning on large datasets by reducing computational complexity while maintaining strong theoretical guarantees. However, real-world data can often be contaminated by outliers or heavy-tailed noise, which significantly degrades the performance of standard RF algorithms. To address this issue, we propose a robust and adaptive regularized least squares method with random features (RRLS-RF) that incorporates response truncation. The truncation level adaptively balances robustness and bias based on the sample size and moment conditions. We establish the generalization properties of RRLS-RF by assuming only a bounded (1+) -th moment for any 0. Specifically, our analysis shows that RRLS-RF achieves learning rates of O (|D|^- {2 +2}) with only O (|D|^ {2 +2} |D|) random features, where |D| denotes the training sample size. These results converge to the optimal learning rates of O (|D|^-1{2}) as, covering the traditional boundedness or sub-Gaussian assumptions in the regularized least squares method with random features (RLS-RF). Furthermore, we refine our analysis and show that RRLS-RF can achieve even faster learning rates under source and capacity conditions, as well as a smaller number of RFs with data-dependent sampling strategies. The derived sharp learning rates can also cover the mis-specified settings where the true function may not precisely align with the assumed kernel space. We further establish the first minimax lower bound under the weak moment condition, which shows that the RRLS-RF estimator is optimal over a wide range of source conditions. Our numerical experiments and real data analysis verify the theoretical results and demonstrate the superior robustness of RRLS-RF against outliers and heavy-tailed noise compared to standard methods.

Bookmark

Generalization Properties of Robust Learning With Random Features

Key Points

Abstract

Cite This Study