What question did this study set out to answer?

This research aims to enhance image classification by integrating active learning and pseudo-labeling within visual language models.

April 1, 2026Open Access

Improving Image Classification through Active Learning and Pseudo-Labeling in Vision-Language Models

Key Points

This research aims to enhance image classification by integrating active learning and pseudo-labeling within visual language models.
Evaluated five active learning strategies: Random Sampling, Uncertainty Sampling, Margin Sampling, Entropy Sampling, Query-by-Committee.
Assessed three pseudo-labeling approaches: Direct, Confidence Threshold, Feature Similarity.
Performed iterative evaluations to combine active learning and pseudo-labeling techniques.
Showed promising results in image classification accuracy with fewer data iterations.
Achieved full class coverage efficiently within limited data scenarios.
Demonstrated higher class representativeness and reduced propagation errors.

Abstract

Visual Language Models (VLMs) combine natural language processing and computer vision to interpret multimodal data, such as images and text, showing great potential in image classification applications. This paper investigates the integration of Active Learning (AL) and pseudo-labeling techniques with VLMs to improve image classification in various domains. To achieve this, five AL strategies (Random Sampling, Uncertainty Sampling, Margin Sampling, Entropy Sampling, and Query-by-Committee) and three pseudo-labeling approaches (Direct, Confidence Threshold, and Feature Similarity) were evaluated iteratively. The results demonstrate that the combination of active learning and pseudo-labeling can achieve promising results, in addition to full class coverage in a few iterations. We conclude that the integration of AL with feature similarity-based pseudo-labeling offers a robust and efficient solution for image classification in limited-data scenarios, promoting high accuracy, class representativeness, and the reduction of propagation errors, with potential for applications in critical domains like healthcare and industry.

Bookmark

View Full Paper

Bookmark

View Full Paper

Improving Image Classification through Active Learning and Pseudo-Labeling in Vision-Language Models

Key Points

Abstract

Cite This Study