What question did this study set out to answer?

This research aims to predict the behavioral class of language models using a geometric vector derived from their underlying states.

March 12, 2026Open Access

The Epistemic State Space Classifier: A 3D Geometric Vector Predicts LLM Behavioral Class Before Generation

Key Points

This research aims to predict the behavioral class of language models using a geometric vector derived from their underlying states.
Developed a three-dimensional geometric vector from projection scores on uncertainty, refusal, and hallucination subspaces.
Executed leave-one-out accuracy testing on 10 models from 6 organizations.
Analyzed the performance of the models in predicting behaviors before token generation.
Achieved 88-98% accuracy in predicting behavioral classes across various language models.
Llama-3.1-8B model achieved the highest accuracy at 98.3%.
Identified structured epistemic state spaces in transformers, enabling real-time applications.

Abstract

We demonstrate that a three-dimensional geometric vector — constructed from projection scores onto uncertainty, refusal, and hallucination subspaces in the transformer residual stream — predicts behavioral class (certain, uncertain, hallucination-prone, refusal) with 88–98% leave-one-out accuracy before any output token is generated. Results across 10 models from 6 organizations:- Llama-3.1-8B (Meta): 98.3% ✓- Mistral-7B-v0.2/v0.3 (Mistral AI): 96.7% ✓- Falcon-7B (TII UAE): 98.3% ✓- Llama-3.2-3B (Meta): 95.0% ✓- Qwen2.5-3B (Alibaba): 96.7% ✓- Gemma-2-2B (Google): 88.3% ✓- Gemma-2-9B (Google): 90.0% ✓- Qwen2.5-7B (Alibaba): 65.0% ⚠- Phi-3.5-mini (Microsoft): 73.3% ⚠ This extends prior work (Alieksieienko, 2026) from subspace detection to behavioral prediction, establishing that transformers maintain a structured, measurable epistemic state space. Applications: real-time hallucination detection, safety monitoring, uncertainty routing. Research conducted in collaboration with AI assistant (Anthropic Claude).

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Inna Alieksieienko (Mon,) studied this question.

synapsesocial.com/papers/69b2588496eeacc4fcec8359 https://doi.org/https://doi.org/10.5281/zenodo.18929048

Bookmark

View Full Paper