Key points are not available for this paper at this time.
We propose a new adversarial attack to Deep Neural Networks for image classification. Different from most existing attacks that directly perturb input pixels, our attack focuses on perturbing abstract features, more specifically, features that denote styles, including interpretable styles such as vivid colors and sharp outlines, and uninterpretable ones. It induces model misclassification by injecting imperceptible style changes through an optimization procedure. We show that our attack can generate adversarial samples that are more natural-looking than the state-of-the-art unbounded attacks. The experiment also supports that existing pixel-space adversarial attack detection and defense techniques can hardly ensure robustness in the style-related feature space.
Building similarity graph...
Analyzing shared references across papers
Loading...
Qiuling Xu
Purdue University West Lafayette
Guanhong Tao
University of Utah
Siyuan Cheng
Sun Yat-sen University
Purdue University West Lafayette
Building similarity graph...
Analyzing shared references across papers
Loading...
Xu et al. (Tue,) studied this question.
synapsesocial.com/papers/6a2165da4f27a676ef8b65e2 — DOI: https://doi.org/10.1609/aaai.v35i12.17259
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: