What question did this study set out to answer?

To evaluate how well large language models can recall basic facts about journal articles.

January 23, 2026

Do large language models know basic facts about journal articles?

Key Points

To evaluate how well large language models can recall basic facts about journal articles.
Asked 4 questions to ChatGPT 4o-mini regarding 64,055 journal articles from 2021.
Assessed both uncited and highly cited articles using ChatGPT 4.1 and open weight LLMs.
Evaluated LLMs' correctness in identifying first author affiliation, publishing journal, and publication year.
Results were mostly incorrect, even for highly cited articles.
ChatGPT 4o-mini showed 42% correctness for Physical Review B.
Low accuracy in identifying the publishing journal and publication year for the majority of articles.

Abstract

Purpose There is an increase in the use of large language models (LLMs) in information science, including evaluating academic journal articles. Despite this, it is unclear whether they “know” about articles in the sense of being able to answer simple questions about individual papers without web searches. Design/methodology/approach In this study, 4 questions were asked of ChatGPT 4o-mini about 64,055 academic journal articles (excluding reviews) from 2021, identified by their titles and abstracts, with uncited and highly cited articles also assessed by ChatGPT 4.1 and 5 open weight LLMs. Findings The results were mostly incorrect, even for the most cited articles from that year. In particular, ChatGPT 4o-mini and the open weights LLMs had almost no knowledge of an article’s first author affiliation, rarely knew the publishing journal and usually guessed the publication year wrong, although ChatGPT 4o-mini was 42% correct for Physical Review B. Even ChatGPT 4.1 could only identify a small majority of the journals for the top cited papers of the year. Practical implications Smaller LLMs’ lack of basic knowledge about articles suggests that when they are asked to evaluate them without web searches, they will rarely cheat by eliciting citation information or journal reputation but will instead answer based on the article text because they may not associate online criticisms with individual articles. Originality/value This is the first investigation of the ability of LLMs to recall basic facts about journal articles.

Bookmark

Cite This Study

M. Thelwall (Tue,) studied this question.

synapsesocial.com/papers/69730ed4c8125b09b0d1ea5e https://doi.org/https://doi.org/10.1108/jd-11-2025-0330

Bookmark