What question did this study set out to answer?

The aim is to examine citation inaccuracies in AI-generated medical writing and their implications for scientific integrity.

March 21, 2026Open Access

Citation Inaccuracies and the Need for Multi-Level Oversight in AI-Assisted Medical Writing

Read Full Paperexternally

Key Points

The aim is to examine citation inaccuracies in AI-generated medical writing and their implications for scientific integrity.
Reviewed literature detailing citation errors in AI-generated content.
Analyzed the nature and persistence of citation inaccuracies in medical contexts.
Considered potential safeguards for improving citation accuracy.
Consistent reports of citation inaccuracies, including fabricated references and incorrect bibliographic details.
Documentation of missing authors, journal titles, publication years, or digital object identifiers.
Highlighting the need for verification due to limitations in the reliability of language models.

Abstract

Generative artificial intelligence (AI)-based large language models (LLMs) are increasingly being used in medical writing to improve efficiency and broaden access to knowledge. However, concerns have emerged regarding the accuracy of the citations they generate. This review discusses the issue of citation inaccuracies in AI-assisted medical writing and its implications for scientific reliability and accountability in academic medicine. Published literature describing citation errors in AI-generated content, particularly in medical and academic contexts, was examined to understand the nature and persistence of this problem and to consider potential safeguards. Reports consistently describe citation inaccuracies, including fabricated references, incorrect bibliographic details, and incomplete source information such as missing authors, journal titles, publication years, or digital object identifiers. Although these tools continue to evolve, such errors remain reported and highlight limitations in their reliability. While LLMs offer clear benefits in supporting medical writing, their outputs require careful verification. As developers continue to address these challenges, responsible use will depend on continued human oversight, improved transparency, greater user awareness, and institutional and policy-level guidance to ensure accurate and trustworthy use of generative AI in medical writing.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

V. Rajaratnam

Usama Farghaly Omar

Kristen Kee

Journals

Standards

Actions

Institutions

National University of Singapore

Nanyang Technological University

Khoo Teck Puat Hospital

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Citation Inaccuracies and the Need for Multi-Level Oversight in AI-Assisted Medical Writing

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider