What question did this study set out to answer?

This research aims to enhance the transferability of adversarial examples using fine-grained perturbation techniques.

June 17, 2026Open Access

Boosting targeted adversarial transferability via fine-grained feature mixup perturbation and reference-based gradient refinement

Key Points

This research aims to enhance the transferability of adversarial examples using fine-grained perturbation techniques.
Proposed Fine-grained Feature Mixup Perturbation (FMGR) to partition feature maps and mix with clean features.
Implemented Reference-based Gradient Refinement (RGR) to amplify deviations in gradients and optimize transferability.
Conducted extensive experiments on ImageNet and CIFAR-10 datasets with CNN and Vision Transformer architectures.
FMGR significantly outperformed existing methods in generating transferable adversarial examples.
Achieved increased robustness against both CNN and Vision Transformer architectures.
Maintained computational efficiency while enhancing performance.

Abstract

Deep neural networks are vulnerable to transferable adversarial examples in black-box scenarios, and targeted attacks that mislead models into predicting specific classes pose particularly severe threats. Feature mixup attacks enhance adversarial transferability by injecting clean features to perturb intermediate representations, yet existing methods suffer from two fundamental limitations: coarse-grained global mixing strategies apply a shared mixing ratio uniformly across all spatial positions, fixing the clean reference for each position to its corresponding location in the clean image and thus limiting the diversity of mixed feature representations during optimization and the transferability of the generated adversarial examples; and standard momentum-based optimization over-aligns with the surrogate model’s gradient geometry, suppressing gradient variations essential for escaping model-specific local minima. We propose Fine-grained Feature Mixup Perturbation and Reference-based Gradient Refinement (FMGR) to address both limitations. FFM partitions feature maps into spatially disjoint blocks and mixes each block with clean features drawn from spatially shuffled positions of the same image, breaking the fixed spatial correspondence of clean references and producing more generalizable feature perturbations. RGR selectively amplifies deviations between instantaneous and reference gradients, suppressing surrogate-specific dominant directions and steering optimization toward flatter, more transferable loss regions. Extensive experiments on ImageNet and CIFAR-10 demonstrate that FMGR significantly outperforms state-of-the-art methods against both CNN and Vision Transformer architectures while maintaining computational efficiency.

AIに質問

Bookmark

View Full Paper

Cite This Study

Gao et al. (Mon,) studied this question.

synapsesocial.com/papers/6a323e6ad50b63ecad207aa8 https://doi.org/https://doi.org/10.1038/s41598-026-57336-1

AIに質問

Bookmark

View Full Paper