March 3, 2026Open Access

Challenges in AI: Indexing LGBTQ+ fiction

Key Points

ChatGPT often fails to accurately identify LGBTQ+ themes in fiction, producing broader and irrelevant terms.
Precision and recall scores in the evaluation of AI-generated index terms were notably low, indicating limitations.
This analysis utilized a sample from the Queerlit database, comparing AI-generated terms with expert-assigned terms.
Careful evaluation and potential collaboration with experts in information science are essential for improvement.

Abstract

Efforts to automate cataloging in libraries have progressed significantly, with AI tools like ChatGPT emerging as potential aids. However, automating the subject indexing of LGBTQ+ fiction poses unique challenges. Traditional fiction indexing often overlooks specific themes and characters sought by users, particularly those related to LGBTQ+ identity and issues. Generative Pre-trained Transformers (GPTs) like ChatGPT offer promise in producing detailed subject terms but face biases and inaccuracies. This study explores ChatGPT’s efficacy in generating subject index terms for LGBTQ+ fiction by comparing AI-generated terms with those assigned by professional information specialists in the Queerlit database. The Queerlit database, which uses the QLIT thesaurus for LGBTQ+ terms and general Swedish controlled vocabularies, provides a gold standard for this comparison. Using a sample of 20 full-text works and 20 metadata records from the Queerlit database, ChatGPT was tasked with generating subject index terms. The evaluation revealed that ChatGPT struggled to identify any LGBTQ+ themes, often producing broader and irrelevant terms, even when the index terms were given as input in the metadata. The precision and recall scores were low, highlighting AI’s limitations in this context. The study underscores the need for careful evaluation of AI tools in library and information science and professional practice, particularly for indexing fiction and minority representation. Future research should involve collaboration with both information and subject experts to examine the potential of automatically generated terms that were not previously assigned, as well as to explore the possibility of refining automated indexing methods, and to address inherent biases in AI models.

Bookmark

View Full Paper

Cite This Study

Koraljka Golub (Thu,) studied this question.

synapsesocial.com/papers/69a768a4badf0bb9e87e56d0 https://doi.org/https://doi.org/10.5617/dhnbpub.13029

Bookmark

View Full Paper