What question did this study set out to answer?

This research aims to enhance the transferability of adversarial examples by flattening the input loss landscape.

February 16, 2026

Enhancing Adversarial Transferability with Cost-efficient Landscape Flattening

Key Points

This research aims to enhance the transferability of adversarial examples by flattening the input loss landscape.
Theoretically analyzed adversarial transferability linked to loss landscape characteristics.
Proposed the Cost-efficient Landscape Flattening (CLEF) attack integrating local maxima and minima optimization.
Utilized gradient reuse from previous attack steps to aid current perturbations.
Applied probabilistic modeling to learn perturbation distributions and assist in reaching local minima.
Demonstrated that incorporating local maxima and minima significantly flattens the loss landscape.
Showed improved adversarial transferability for crafted adversarial examples across different models.

Abstract

The transferability of adversarial examples across different models has drawn considerable attention recently, particularly in targeted transferability. Prior research has empirically shown that optimizing adversarial perturbations at neighboring points with the highest loss value improves transferability. While effective, such a method requires multiple iterations to reach the local maxima and disregards the local minima of the input loss landscape. In this paper, we theoretically show that enhancing adversarial transferability is attainable by flattening the input loss landscape. This is accomplished through the perturbation optimization at both local maxima and minima. Moreover, we propose the Cost-efficient LandscapE Flattening (CLEF) attack to consider local maxima and minima around current inputs in a cost-efficient way to flatten the loss landscape and improve adversarial transferability. Specifically, we reuse the gradients of the previous attack step to assist current inputs in reaching local maxima, and employ probabilistic modeling to learn the distributional representations of perturbations that assist current inputs in reaching local minima. This probabilistic modeling can be pre-trained on dozens of images from other domains, enabling us to directly sample this type of perturbation from the pre-trained distribution when attacking. Experimental results demonstrate that integrating local maxima and minima into targeted transferable attacks can significantly flatten the loss landscape of the crafted adversarial examples, resulting in improved adversarial transferability.

Mark Helpful

Bookmark

Relay