What type of study is this?

This is a Quantitative Study study.

October 16, 2025Open Access

How Can Large Language Models Be More Reliable?

Key Points

Clear improvement in reliability and predictability of large language models can be achieved with better strategies.
The approach utilizes confidence-based abstention, linking uncertainty estimation with query difficulty.
Models often face difficulty discordance, solving harder tasks while failing on simple queries, impacting trust.
Enhancements in user trust are aimed through explicit refusal of uncertain answers, improving overall model transparency.

Abstract

Large Language Models (LLMs) are increasingly integrated into everyday applications, yet their reliability remains inconsistent, even for seemingly simple queries. By “scaling up” and “shaping up”, these models have improved average accuracy and robustness to prompt variations, but they continue to display “difficulty discordance”: they solve harder tasks while making errors on easier ones. Moreover, they show a marked reluctance to refuse answers even when uncertain. Such behaviour deprives users of clear cues about when outputs can be trusted. This work explores strategies to enhance LLM reliability through confidence-based abstention, combining uncertainty estimation techniques with measures of question difficulty to define a model’s “safe operating area”. By ensuring that queries are either answered correctly or explicitly declined, the approach aims to enhance predictability, transparency, and user trust, while providing a framework for managing model limitations.

Read Full Paperexternally

Perguntar à IA

Bookmark

View Full Paper