What question did this study set out to answer?

January 26, 2026Open Access

Output-Only Diagnostics for Multi-Turn Inference Instability in Large Language Models

Key Points

The main aim is to develop a framework that detects multi-turn inference instability in large language models without needing internal access or ground truth.
Introduced an output-only diagnostic framework for language models.
Formalized instability through observable structural events in conversation transcripts.
Evaluated the framework as a prediction task over conversation prefixes.
Compared the proposed diagnostics against turn-local heuristic baselines.
The diagnostics successfully predict coherence collapse several turns ahead.
Outperformed existing turn-local evaluation methods.
Identified conditions causing instability in reasoning trajectories.

Abstract

Large language models often remain coherent in short interactions while exhibiting instability over longer conversational horizons. Most existing evaluation approaches are turn-local and retrospective, and therefore fail to anticipate such failures before they manifest. This work introduces an output-only diagnostic framework for detecting and predicting multi-turn inference instability without access to model internals, training data, or semantic ground truth. Instability is formalized via observable structural events in interaction transcripts and evaluated as a prediction task over conversation prefixes. Across multiple models and long-horizon tasks, the proposed diagnostics anticipate coherence collapse several turns in advance and outperform turn-local heuristic baselines. The framework is intentionally diagnostic: it does not aim to interpret, correct, or control model behavior, but to provide a reproducible and model-agnostic mechanism for identifying conditions under which previously stable reasoning trajectories become unreliable. This record represents a canonical archived version intended for citation and long-term reference. #output-only diagnostics#long-horizon reasoning#inference instability#language model evaluation#multi-turn interaction#reasoning stability#black-box analysis#structural diagnostics

Output-Only Diagnostics for Multi-Turn Inference Instability in Large Language Models

Key Points

Abstract

Cite This Study