What type of study is this?

This is a Experimental Study study.

October 23, 2025Open Access

TECP: Token-Entropy Conformal Prediction for LLMs

Key Points

TECP enhances uncertainty quantification in large language models for output prediction.
The method integrates log-probability-based metrics with a novel conformal prediction approach.
This approach ensures finite-sample coverage through effective calibration of prediction sets.
Success across CoQA and TriviaQA benchmarks suggests promising reliability for language model outputs.

Abstract

Uncertainty quantification (UQ) for open-ended language generation remains a critical yet underexplored challenge, particularly in settings where token-level log-probabilities are available during decoding. We present Token-Entropy Conformal Prediction (TECP), which treats a log-probability-based token-entropy statistic as a nonconformity score and integrates it with split conformal prediction to construct prediction sets with finite-sample coverage guarantees. We work in a white-box regime in which per-token log-probabilities are accessible during decoding. TECP estimates episodic uncertainty from the token-entropy structure of sampled generations and calibrates thresholds via conformal quantiles to ensure provable error control. Empirical evaluations across six large language models and two QA benchmarks (CoQA and TriviaQA) show that TECP consistently achieves reliable coverage and compact prediction sets, outperforming prior self-UQ methods. These results provide a principled and efficient solution for trustworthy generation in white-box, log-probability-accessible LLM settings.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Xu et al. (Tue,) studied this question.

synapsesocial.com/papers/68f9f86eb2c35e10cc4e3c18 https://doi.org/https://doi.org/10.3390/math13203351

Bookmark

View Full Paper