What question did this study set out to answer?

This research aims to develop a framework for assessing the robustness of phishing detection systems against AI-generated adversarial emails.

May 20, 2026Open Access

Adversarial Testing Framework for AI-Generated Phishing Detection : Methodology and Empirical Evaluation

Key Points

This research aims to develop a framework for assessing the robustness of phishing detection systems against AI-generated adversarial emails.
Evaluated two classifiers, Logistic Regression and Support Vector Machine, on 1,600 emails
Generated adversarial samples using synonym substitution and structural camouflage techniques
Performed statistical validation with McNemar's test and bootstrap confidence intervals
Both classifiers maintained a 0.0% false positive rate
Detection sensitivity significantly decreased under adversarial conditions
Identified a critical asymmetric failure mode, enabling silent evasion in deployed systems

Abstract

This paper presents an adversarial testing framework for evaluating the robustness of machine learning-based phishing detection systems against AI-generated adversarial email content. Two TF-IDF-based classifiers Logistic Regression and Support Vector Machine are evaluated on a balanced dataset of 1,600 emails under both standard and adversarially transformed conditions. Adversarial samples are generated through synonym substitution and structural camouflage techniques. Results demonstrate statistically significant performance degradation under adversarial conditions, with a critical asymmetric failure mode identified: both classifiers maintain a 0.0% false positive rate while detection sensitivity is substantially reduced, creating a silent evasion channel in deployed systems. Statistical validation is performed using McNemar's test and bootstrap confidence intervals. This work establishes adversarial robustness evaluation as a necessary component of phishing detection assessment, particularly as AI-generated content increasingly characterises real-world attack vectors.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Arnika Madushani Rangalla (Mon,) studied this question.

synapsesocial.com/papers/6a0d4fa9f03e14405aa9b0db https://doi.org/https://doi.org/10.5281/zenodo.20268273

Bookmark

View Full Paper