Key points are not available for this paper at this time.
Large language models (LLMs) are rapidly changing academic research, raising questions of who is adopting these tools and under what conditions. This article analyzes full texts of 7.3 million journal articles published from 2020–2025 by four major publishers (Elsevier, Frontiers, MDPI, and PLoS) to track the prevalence of LLM-associated language and identify social and institutional correlates of adoption. A corpus of 228 focal words exhibiting sharp post-2022 frequency increases consistent with LLM output was developed; articles were scored on their rate of focal word usage. By 2025, an estimated 57% of published articles exhibited evidence of LLM influence, up from 12% in 2023. Among articles exhibiting LLM-influenced text, there is substantial heterogeneity, ranging from subtle linguistic influence to articles mostly or entirely LLM-generated. Difference-in-differences models reveal that LLM-associated language varies markedly across regions, institutional ranks, publishers, disciplines, and journal tiers. Economic development and proximity to English as a primary language are key predictors of regional variation. Lower-ranked institutions exhibit higher rates than elite universities, young for-profit publishers show elevated rates vis-à-vis competitors, and academic fields differ widely in adoption. LLM adoption in academic writing is pervasive but socially stratified. As models grow more powerful and their use becomes further entrenched in academic research, understanding social dynamics of adoption will be essential for governing the evolving relationship between AI and academic knowledge production.
Kyle Siler (Fri,) studied this question.