Generative AI systems trained on synthetic data exhibit progressive degradation known as model collapse. This paper provides a theoretical explanation of this phenomenon using Shannon’s Data Processing Inequality (DPI), modeling iterative synthetic-data training as a Markov chain of lossy transformations. We show that mutual information with respect to the original data distribution must decrease monotonically, yielding qualitative predictions for exponential decay tendencies and indicating that information loss arises from general finite-precision and capacity constraints rather than from any specific architectural mechanism. Building on this analysis, we introduce the AI conceptual theorem, a generalized stability limit for computable systems. The theorem states that any purely computational system that generates outputs iteratively under finite precision, bounded capacity, and without external low-entropy input must experience cumulative information degradation after a finite number of steps. DPI-based collapse emerges as a special case of this broader principle. The framework is intended as a conceptual information-theoretic perspective rather than a fully formalized theory, with several assumptions intentionally simplified to highlight the underlying entropic mechanism. The results should therefore be interpreted as principled limits that motivate further empirical and mathematical investigation rather than as definitive closed-form predictions. Together, DPI and the AI Theorem provide a unified information-theoretic framework for understanding degradation in synthetic training, long-horizon inference, and other iterative computational processes. The resulting predictions are quantitatively falsifiable and offer guidance for designing more stable and information-preserving AI systems.
Building similarity graph...
Analyzing shared references across papers
Loading...
Pavel Straňák
Symmetry
Czech Radio
Building similarity graph...
Analyzing shared references across papers
Loading...
Pavel Straňák (Wed,) studied this question.
www.synapsesocial.com/papers/69f837423ed186a739981587 — DOI: https://doi.org/10.3390/sym18050764
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: