What question did this study set out to answer?

The aim is to explore how active learning can minimize the requirement for labeled data in text classification tasks using BERT-based models.

February 2, 2026Open Access

Reducing labeled data requirements in text classification with active learning and BERT-based transformers

Key Points

The aim is to explore how active learning can minimize the requirement for labeled data in text classification tasks using BERT-based models.
Integrated active learning strategies with transformer-based models from the BERT family.
Conducted experiments using 10 datasets and 7 different BERT classifiers.
Measured model performance through various evaluation metrics.
Achieved at least a 50% reduction in dataset size in 70% of cases.
Maintained model effectiveness without sacrificing performance.
Demonstrated the critical role of dataset size in maintaining high performance levels.

Abstract

Abstract It is a fact that natural language processing (NLP) has become an integral part of daily life, with research outcomes being integrated into various everyday implementations. A significant portion of this success can reasonably be attributed to the architecture of transformers. In this context, text classification problems constitute a large part of ongoing research. Simultaneously, there is a growing demand for high-quality labeled textual data. The latter is becoming increasingly urgent with the rising complexity and size of models. Based on this, the present work investigates the integration of active learning strategies into text classification problems using transformer-based models from the BERT family. Through an extensive experimental framework involving 10 datasets and 7 different BERT-based classifiers, we demonstrate that the incorporation of active learning in the context of text classification can significantly reduce the need for labeled data during the fine-tuning procedures. Specifically, our experimental results illustrate that without sacrificing model effectiveness–as measured by various evaluation metrics–we can achieve at least a 50% reduction in the dataset size in 70% of cases. Additionally, we show that the size of the dataset plays a crucial role in maintaining high performance levels.

Reducing labeled data requirements in text classification with active learning and BERT-based transformers

Key Points

Abstract

Cite This Study