What question did this study set out to answer?

This research aims to develop a framework for understanding and detecting impending system collapse in advanced artificial agents under selection pressure.

January 24, 2026Open Access

System Collapse Under Selection Pressure: Viability Horizons for Advanced Artificial Agents

Puntos clave

This research aims to develop a framework for understanding and detecting impending system collapse in advanced artificial agents under selection pressure.
Introduced the concept of viability horizons for evaluating coherence in artificial agents.
Formalized system collapse under irreversible updates and loss of coherence across time and control channels.
Developed operational metrics for coherence drift and irreversibility-induced brittleness.
Outlined experimental protocols to detect collapse regimes in contemporary systems.
Collapse can occur without prior performance degradation, often related to coherence loss.
Alignment and reward maximization are not sufficient for guaranteeing long-term viability.
New metrics were established for assessing structural stability in artificial intelligence systems.

Resumen

Current evaluation practices for advanced artificial agents emphasize performance metrics such as reward, accuracy, or task completion. These metrics often fail to detect structural instabilities that arise under long-horizon operation, especially when optimization signals are sparse, misleading, or adversarial. In this work, we introduce viability horizons: a quantitative framework for detecting impending system collapse in agents subject to irreversible selection pressure. Building on a history-level perspective, we show that collapse is not necessarily preceded by performance degradation, but by loss of internal coherence across time, memory, and control channels. We formalize collapse as a failure to sustain a coherent system history under irreversible updates and demonstrate that alignment, reward maximization, and capability scaling are neither necessary nor sufficient conditions for long-term viability. We propose operational metrics for coherence drift, delayed failure, and irreversibility-induced brittleness, and outline concrete experimental protocols for detecting collapse regimes in contemporary agentic systems. These results reframe AI risk as a structural stability problem rather than a behavioral or normative one. Keywords: artificial intelligence stability, long-horizon coherence, irreversible updates, system collapse, alignment failure modes, cognitive persistence, information loss, entropy accumulation, dynamical systems, AI safety theory

Leer artículo completoexternamente

Preguntar a la IA

Me gusta

Guardar

Ver artículo completo