May 12, 2024Open Access

Improving Model Robustness against Adversarial Examples with Redundant Fully Connected Layer

Key Points

Key points are not available for this paper at this time.

Abstract

Recent studies show that deep neural networks are extremely vulnerable, especially for adversarial examples of image classification models. However, the current defense technologies exhibit a series of limitations in terms of the adaptability of different attacks, the trade-off between clean-instance accuracy and robust one, as well as efficiency for train time overhead. To tackle these problems, we present a novel component, named redundant fully connected layer, which can be combined with existing model backbones in a pluggable manner. Specifically, we design a tailor-made loss function for it that leverages cosine similarity to maximize the difference and diversity of multiple fully connected parts. We conduct extensive experiments against 12 representative attacks (white-box and black-box), based on the popular dataset. The empirical evaluations show that our scheme realizes significant outcomes against various attacks with negligible additional training overhead, while hardly bringing collateral damage for clean-instance accuracy.

Read Full Paperexternally

AIに質問

Bookmark

View Full Paper

Cite This Study

Zhao et al. (Sun,) studied this question.

synapsesocial.com/papers/68e6a879b6db64358762b0a8 https://doi.org/https://doi.org/10.1145/3589335.3651524

AIに質問

Bookmark

View Full Paper