What question did this study set out to answer?

This research aims to address model collapse during iterative self-training of language models by introducing adaptive data selection techniques.

February 21, 2026Open Access

Countering Model Collapse in Iterative Self-Training via Dynamic Center-Edge Sampling

Key Points

This research aims to address model collapse during iterative self-training of language models by introducing adaptive data selection techniques.
Propose a dynamic data selection framework called DCES for iterative self-training.
Conduct experiments across multiple language model architectures.
Evaluate using metrics like perplexity and expected calibration error.
DCES significantly reduces model collapse by improving model performance metrics like perplexity and loss.
The framework effectively decreases expected calibration error and entropy, indicating better model stability.
Adaptive curation of training data is shown to enhance AI self-evolution capabilities.

Abstract

Iterative self-training of language models presents a promising avenue for realizing self-improving Artificial Intelligence systems; however, this process is often hindered by the fundamental challenge of “Model Collapse.” Existing research indicates that models undergo catastrophic performance degradation and diversity collapse when recursively trained on their own increasingly homogenized synthetic data. Although some data selection-based approaches attempt to mitigate this issue by enhancing diversity, they predominantly rely on static strategies, lacking a feedback mechanism capable of adapting in real-time to the dynamic evolution of the model state and data distribution. To address this limitation, we propose a dynamic data selection framework titled “DCES” (dynamic center-edge sampling). We conducted extensive experiments on iterative self-training tasks across multiple model architectures. The results demonstrate that our system significantly outperforms baselines in terms of Perplexity (PPL) and loss across various models and test sets. Simultaneously, the framework effectively mitigates the degradation of Expected Calibration Error (ECE) and entropy metrics, successfully preventing mode collapse. Our findings highlight that an adaptive system capable of intelligent data curation based on training feedback is pivotal for maintaining the dynamic balance of data distributions and achieving sustainable AI self-evolution. This work provides a systematic methodology for realizing this goal.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper