March 8, 2024Open Access

Enhancing Human Annotation: Leveraging Large Language Models and Efficient Batch Processing

Key Points

Key points are not available for this paper at this time.

Abstract

Large language models (LLMs) are capable of assessing document and query characteristics, including relevance, and are now being used for a variety of different classification labeling tasks as well. This study explores how to use LLMs to classify an information need, often represented as a user query. In particular, our goal is to classify the cognitive complexity of the search task for a given "backstory". Using 180 TREC topics and backstories, we show that GPT-based LLMs agree with human experts as much as other human experts. We also show that batching and ordering can significantly impact the accuracy of GPT-3.5, but rarely alter the quality of GPT-4 predictions. This study provides insights into the efficacy of large language models for annotation tasks normally completed by humans, and offers recommendations for other similar applications.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Zendel et al. (Fri,) studied this question.

www.synapsesocial.com/papers/68e74e1db6db6435876c6fc2 — DOI: https://doi.org/10.1145/3627508.3638322

Authors

Oleg Zendel

J. Shane Culpepper

Falk Scholer

Actions

Institutions

The University of Queensland

RMIT University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Enhancing Human Annotation: Leveraging Large Language Models and Efficient Batch Processing

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider