Although active learning can reduce the effort required to annotate object detection data, many current methods rely on a single selection criterion or combine criteria without accounting for annotation costs or their interactions. This paper presents a multi-criterion, cost-aware active learning framework for detecting small objects in agricultural images. The framework jointly considers prediction uncertainty, object size, scene density, and annotation cost. We evaluate both scalarized and Pareto-based selection strategies across five cost models and conduct an ablation study to examine the role and interactions of each criterion. Experimental results demonstrate that explicit annotation cost modeling improves active learning efficiency by reducing the amount of annotation required to achieve a given level of detection performance. Across multiple cost formulations and selection strategies, cost-aware acquisition reaches comparable accuracy and reduces the estimated annotation effort required to reach comparable detection performance by up to 50% compared to random sampling, where annotation effort is approximated using prediction-derived cost proxies.
Bonković et al. (Fri,) studied this question.