What question did this study set out to answer?

The central aim is to enhance zero-shot anomaly detection by effectively capturing anomaly semantics across varying domains.

May 14, 2026Open Access

Anomaly-aware prompt learning and multi-scale feature adaptation for zero-shot anomaly detection

Key Points

The central aim is to enhance zero-shot anomaly detection by effectively capturing anomaly semantics across varying domains.
Proposed a framework that integrates anomaly-aware prompt learning and feature adaptation.
Utilized an anomaly-aware textual prompt module to align visual and textual modalities dynamically.
Introduced multi-scale feature adapters to improve perception of defects at different scales.
Achieved state-of-the-art zero-shot anomaly detection performance on 15 industrial and medical datasets.
Showed significant performance improvement under few-shot settings, indicating robust application potential.

Abstract

Zero-Shot Anomaly Detection (ZSAD) aims to generalize to unseen domains without requiring any target-domain samples, and is commonly applied to address the cold-start problem in industrial settings. However, existing ZSAD methods usually fail to capture anomaly semantics in specific contexts, limiting their cross-domain generalization. Furthermore, vision-language models (VLMs) typically show limited sensitivity to subtle anomalous patterns, making it difficult to explicitly guide VLM-based ZSAD model to focus on anomalies. To address these issues, this paper proposes a framework integrating anomaly-aware prompt learning and feature adaptation, which consists of two key components: an anomaly-aware textual prompt module and multi-scale feature adapters. Specifically, we dynamically incorporate the intrinsic local and global anomaly semantics from test images into textual prompts to achieve deep alignment between visual and textual modalities. To focus on defects at different levels, we introduce adapters to aggregate multi-scale visual features, thus enhancing fine-grained anomaly perception. Furthermore, the framework is extended to the few-shot setting to effectively leverage limited target-domain samples. Extensive experiments on 15 industrial and medical datasets demonstrate that the proposed method achieves state-of-the-art (SOTA) ZSAD performance. Notably, the method can further significantly improve the performance under few-shot settings, indicating its strong application potential.

Bookmark

View Full Paper

Cite This Study

Li et al. (Tue,) studied this question.

synapsesocial.com/papers/6a056668a550a87e60a1e69b https://doi.org/https://doi.org/10.1007/s44443-026-00799-z

Bookmark

View Full Paper