What type of study is this?

September 5, 2025Open Access

ACR: Adaptive Confidence Re-Scoring for Reliable Answer Selection Among Multiple Candidates

Key Points

ACR improves answer accuracy while significantly reducing inference cost in large language models.
It reduces the number of inference calls by up to 95% compared to existing verification methods.
The method enhances inference efficiency, yielding accuracy gains per inference call of 2× to 17×.
ACR addresses vulnerabilities in LLMs by adaptively evaluating candidate answers for more reliable selection.

Abstract

With the improved reasoning capabilities of large language models (LLMs), their applications have rapidly expanded across a wide range of tasks. In recent question answering tasks, performance gains have been achieved through Self-Consistency, where LLMs generate multiple reasoning paths and determine the final answer via majority voting. However, this approach can fail when the correct answer is generated but does not appear frequently enough to be selected, highlighting its vulnerability to inconsistent generations. To address this, we propose Adaptive Confidence Re-scoring (ACR)—a method that adaptively evaluates and re-scores candidate answers to select the most trustworthy one when LLMs fail to generate consistent reasoning. Experiments on arithmetic and logical reasoning benchmarks show that ACR maintains or improves answer accuracy while significantly reducing inference cost. Compared to existing verification methods such as FOBAR, ACR reduces the number of inference calls by up to 95%, while improving inference efficiency—measured as accuracy gain per inference call—by a factor of 2× to 17×, depending on the dataset and model.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Eunhye Jeong

Yong Suk Choi

Journals

Applied Sciences

Actions

Institutions

Hanyang University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

ACR: Adaptive Confidence Re-Scoring for Reliable Answer Selection Among Multiple Candidates

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study