Key points are not available for this paper at this time.
Generation of plausible yet incorrect factual information, termed hallucination, is an unsolved issue in large language models.We study the ability of language models to deliberate on the responses they give in order to correct their mistakes.We develop the Chain-of-Verification (COVE) method whereby the model first (i) drafts an initial response; then (ii) plans verification questions to fact-check its draft; (iii) answers those questions independently so the answers are not biased by other responses; and (iv) generates its final verified response.In experiments, we show COVE decreases hallucinations across a variety of tasks, from list-based questions from Wikidata, closed book Multi-SpanQA and longform text generation.
Building similarity graph...
Analyzing shared references across papers
Loading...
Shehzaad Dhuliawala
Mojtaba Komeili
Jing Xu
Building similarity graph...
Analyzing shared references across papers
Loading...
Dhuliawala et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69d730cf0420a49c9848f364 — DOI: https://doi.org/10.18653/v1/2024.findings-acl.212