What question did this study set out to answer?

This research aims to improve the balance between accuracy and robustness in adversarial training using an active machine learning approach.

May 1, 2026Open Access

Active machine learning approach to adversarial training improves trade-off between natural accuracy and adversarial robustness

Key Points

This research aims to improve the balance between accuracy and robustness in adversarial training using an active machine learning approach.
Introduced novel data generation techniques for adversarial examples based on classification margin criteria.
Evaluated the method using WRN34-10 and ResNet-18 on datasets like CIFAR-10, CIFAR-100, SVHN, and TinyImagenet-200.
Theoretical contributions were provided to explain the properties of this approach.
The proposed method achieved significant improvements in natural accuracy on CIFAR-10, CIFAR-100, SVHN, and TinyImagenet-200.
Results show enhanced robustness against adversarial attacks compared to existing methods.
Improvements in accuracy and robustness metrics were statistically significant for the studied architectures.

Abstract

Understanding our world which is open and diverse requires foundation models that generalize well while trustworthy. Adversarial training has been considered to be one of the most effective strategies to achieve robust learning systems, yet adversarial training methods have exhibited a trade-off between generalization accuracy and robustness. Motivated by the active machine learning approach to adversarial training, we introduce novel data generation technique acquires adversarial examples based on classification margin criterion, addressing key trade-off between generalization accuracy and adversarial robustness which remains a fundamental challenge in adversarial training. Here, we provide theoretical contribution that sheds light on the properties of the approach which is expected to be beneficial. Likewise, we empirically demonstrate that the proposed method achieves significant improvements in accuracy and robustness with WRN34-10 and ResNet-18 on CIFAR-10, CIFAR-100, SVHN, and TinyImagenet-200. This Article contributes to the broader scientific literature on adversarial training, generalization theory, and robust machine learning.

Mark Helpful

Bookmark

Relay

View Full Paper