Deep Neural Networks have taken Natural Language Processing by storm. While led to incredible improvements across many tasks, it also initiated a new field, questioning the robustness of these neural networks by them. In this paper, we investigate four word substitution-based on BERT. We combine a human evaluation of individual word substitutions a probabilistic analysis to show that between 96% and 99% of the analyzed do not preserve semantics, indicating that their success is mainly on feeding poor data to the model. To further confirm that, we introduce efficient data augmentation procedure and show that many adversarial can be prevented by including data similar to the attacks during. An additional post-processing step reduces the success rates of-of-the-art attacks below 5%. Finally, by looking at more reasonable on constraints for word substitutions, we conclude that BERT is a more robust than research on attacks suggests.
Hauser et al. (Wed,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: