What does this research mean for the field?

A unified adversarial patch method effectively reduces the accuracy of visible-infrared detection models by over 89% in simulated real-world scenarios. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.CHALLENGES_CONSENSUS.

What question did this study set out to answer?

The study aims to unveil the vulnerabilities of visible-infrared detection models to adversarial attacks.

February 16, 2026

Physical Multimodal Adversarial Attacks against Object Detection Models in Industrial Monitoring Scenario

Key Points

The study aims to unveil the vulnerabilities of visible-infrared detection models to adversarial attacks.
Proposes a unified adversarial patch method using a genetic algorithm.
Enables selective or balanced attacks on visible and infrared detectors.
Optimizes shape features of patches and adjusts attack strength adaptively.
Employs Expectation Transformation (EOT) for robustness against viewpoint changes.
Achieves an attack success rate (ASR) exceeding 89%.
Effectively reduces detection model accuracy in simulated real-world scenarios.
Demonstrates flexibility and robustness of cross-modal attacks through optimization.

Abstract

Deep Neural Networks (DNNs) have made significant advancements in computer vision, widely applied to various tasks. However, these models remain vulnerable to adversarial attacks. This study aims to reveal the adversarial threats faced by visible-infrared detection models in real-world scenarios and proposes a unified adversarial patch method, i.e., a single patch design effective across both visible and infrared modalities, based on a genetic algorithm. This method enables selective or balanced attacks on visible and infrared detectors, providing an in-depth analysis of the security of models in practical applications. Experimental results show that the method effectively reduces the detection model’s accuracy and demonstrates attack effects in simulated real-world scenarios. By optimizing the shape features of adversarial patches using a genetic algorithm, and adaptively adjusting the attack strength across modalities via weight coefficients, the proposed method enhances the flexibility and robustness of cross-modal adversarial attacks. Additionally, the method employs the Expectation Transformation (EOT) strategy, showing strong robustness under different viewpoints. Extensive experiments validate the method’s effectiveness, with an attack success rate (ASR) exceeding 89%. This study provides a theoretical basis for improving the robustness and security of models and offers valuable insights for safety-critical applications such as intelligent surveillance.

Mark Helpful

Bookmark

Relay