What question did this study set out to answer?

The study investigates how an AI can accurately assess its own knowledge without relying on its answer confidence.

March 4, 2026Open Access

Epistemic Self-Assessment Without Model Confidence: An AI That Knows What It Doesn't Know

Key Points

The study investigates how an AI can accurately assess its own knowledge without relying on its answer confidence.
Developed a knowledge graph agent to compute confidence scores based on structural properties.
Conducted over 500 cycles of continuous learning to evaluate performance at varying confidence levels.
Implemented a refusal mechanism for low-confidence queries, preventing misleading answers.
Achieved 82.9% accuracy at high confidence and 3.6% at low confidence, confirming the reliability of the confidence signal.
Improved answer reliability by 15.5 percentage points from 48.1% to 63.6% by refusing low-confidence questions.
Refused 31.3% of questions, of which 87.7% would have been inaccuracies.

Abstract

Large language models generate confident-sounding responses regardless of whether they possess relevant knowledge — a failure mode known as hallucination. Current approaches to confidence estimation (output logits, calibration scaling, verbalized uncertainty) all derive confidence from the same generative process that produces answers, creating a fundamental confound: the confidence estimate cannot be independent of the answer. We present an experiment demonstrating that epistemic self-assessment — the ability to accurately judge one's own knowledge — can be achieved through structural analysis of a knowledge graph, providing a confidence signal that is independent of the answer generation process. A developmental knowledge graph agent computes a multi-dimensional confidence score for every response based on the structural properties of relevant subgraphs. When confidence falls below a calibrated threshold, the agent refuses to answer with "I don't know enough about this yet" — a hard boundary that cannot be overridden by prompting. Over 500 cycles of continuous learning, the system demonstrated well-calibrated self-assessment: at high confidence, the agent achieved 82.9% accuracy; at the lowest confidence, accuracy was 3.6% — confirming that the agent's confidence signal reliably predicts actual performance. The refusal mechanism improved answer reliability by 15.5 percentage points (from 48.1% to 63.6%) by refusing 31.3% of questions, of which 87.7% would have been errors. Beyond per-query confidence, the agent maintains a persistent self-model tracking domain-level competence, growth trajectories, and 10 types of metacognitive insights stored as nodes in the neural graph — creating recursive self-knowledge that influences subsequent cognition. All computation is performed on the graph structure with zero language model involvement.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Sai Tilak Pally

Acumen (United States)

Actions

Institutions

Acumen (United States)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Epistemic Self-Assessment Without Model Confidence: An AI That Knows What It Doesn't Know

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider