What type of study is this?

This is a Quantitative Study study.

September 23, 2025Open Access

Saliency-Guided Local Semantic Mixing for Long-Tailed Image Classification

Key Points

Saliency-guided local semantic mixing significantly boosts classification accuracy for tail classes.
Enhancements improved performance by 5.0%, 7.3%, and 6.1% across three benchmark datasets.
Implemented using a novel approach that combines semantic-aware techniques and dynamic labeling.
Framework integrates well with existing models, indicating broad applicability across various classifications.

Abstract

In real-world visual recognition tasks, long-tailed distributions pose a widespread challenge, with extreme class imbalance severely limiting the representational learning capability of deep models. In practice, due to this imbalance, deep models often exhibit poor generalization performance on tail classes. To address this issue, data augmentation through the synthesis of new tail-class samples has become an effective method. One popular approach is CutMix, which explicitly mixes images from tail and other classes, constructing labels based on the ratio of the regions cropped from both images. However, region-based labels completely ignore the inherent semantic information of the augmented samples. To overcome this problem, we propose a saliency-guided local semantic mixing (LSM) method, which uses differentiable block decoupling and semantic-aware local mixing techniques. This method integrates head-class backgrounds while preserving the key discriminative features of tail classes and dynamically assigns labels to effectively augment tail-class samples. This results in efficient balancing of long-tailed data distributions and significant improvements in classification performance. The experimental validation shows that this method demonstrates significant advantages across three long-tailed benchmark datasets, improving classification accuracy by 5.0%, 7.3%, and 6.1%, respectively. Notably, the LSM framework is highly compatible, seamlessly integrating with existing classification models and providing significant performance gains, validating its broad applicability.

Saliency-Guided Local Semantic Mixing for Long-Tailed Image Classification

Key Points

Abstract

Cite This Study