March 3, 2026Open Access

Few-shot learning in industrial applications

Key Points

Support set augmentation shows significant effectiveness, enhancing the performance across various configurations,
DINOv2 and ConvNeXt-v2-T are identified as the leading backbone models, achieving the highest accuracy in defect classification,
Analysis involves evaluating 16 model combinations under different conditions including deterministic support set augmentation,
Findings imply that a robust backbone paired with a simple augmentation method can create efficient classification systems.

Abstract

This paper reports on the empirical performance of few-shot learning (FSL) for visual defect classification using confidential industrial datasets. We evaluate 16 combinations of four backbone models (Perception Encoder, DINOv2, DINOv3, ConvNeXt-v2) and four FSL classifiers (Prototypical Networks, Neighborhood Component Analysis, Relation Networks, Linear Adapter). The evaluation covers three conditions: a baseline comparison, deterministic support set augmentation, and a learnable attention preprocessor. Results demonstrate that support set augmentation is a highly effective strategy, improving performance in nearly all configurations. Furthermore, the DINOv2 and ConvNeXt-V2-T backbones emerged as top performers, achieving the most competitive and highest-accuracy results, respectively. These findings suggest that for industrial FSL applications, combining a strong, pre-trained backbone with a simple augmentation strategy is a practical approach for building data-efficient classification systems.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Molek et al. (Thu,) studied this question.

synapsesocial.com/papers/69a76621badf0bb9e87dbce3 https://doi.org/https://doi.org/10.15452/978-80-7599-515-5.2026.14

Bookmark

View Full Paper