What question did this study set out to answer?

April 17, 2026

AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with Large Language Models

Key Points

This research aims to enhance the generation and evaluation of clarifying questions in conversational search systems.
Developed AGENT-CQ framework for generating clarifying questions using LLMs.
Introduced CrowdLLM for evaluating user responses with diverse evaluator personas.
Conducted experiments in both open-domain conversational search and regulatory question-answering settings.
Temperature-variation prompting produced higher quality clarifying questions than baseline prompting.
LLM-generated questions improved downstream retrieval performance compared to human-authored questions.

Abstract

Clarifying questions enable conversational search (CS) systems to resolve underspecified queries by eliciting missing information from users. However, how prompting strategies shape the quality of clarifying questions and how such questions should be evaluated at scale remains understudied. We present AGENT-CQ ( A utomatic GEN eration and evalua T ion of C larifying Q uestions), a framework for systematically generating and evaluating clarifying questions and simulated user responses using large language models (LLMs). To support scalable and multi-perspective evaluation, we introduce CrowdLLM , an LLM-based evaluation paradigm that simulates diverse annotator judgments through distinct evaluator personas. Our experiments span both open-domain conversational search and a regulatory question-answering setting, allowing us to examine the extent to which clarification strategies generalize across domains with different interaction constraints. Across settings, temperature-variation prompting leads to higher quality clarifying questions than baseline prompting and human-authored questions on several dimensions of the task. In addition, LLM-generated clarifying questions lead to improved downstream retrieval performance than human-authored questions in open-domain search. Together, AGENT-CQ and CrowdLLM provide a practical framework for studying and improving clarification strategies in conversational IR systems.

AIに質問

Bookmark

AIに質問

Bookmark

AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with Large Language Models

Key Points

Abstract

Cite This Study