Generative artificial intelligence (AI)-based large language models (LLMs) are increasingly being used in medical writing to improve efficiency and broaden access to knowledge. However, concerns have emerged regarding the accuracy of the citations they generate. This review discusses the issue of citation inaccuracies in AI-assisted medical writing and its implications for scientific reliability and accountability in academic medicine. Published literature describing citation errors in AI-generated content, particularly in medical and academic contexts, was examined to understand the nature and persistence of this problem and to consider potential safeguards. Reports consistently describe citation inaccuracies, including fabricated references, incorrect bibliographic details, and incomplete source information such as missing authors, journal titles, publication years, or digital object identifiers. Although these tools continue to evolve, such errors remain reported and highlight limitations in their reliability. While LLMs offer clear benefits in supporting medical writing, their outputs require careful verification. As developers continue to address these challenges, responsible use will depend on continued human oversight, improved transparency, greater user awareness, and institutional and policy-level guidance to ensure accurate and trustworthy use of generative AI in medical writing.
Building similarity graph...
Analyzing shared references across papers
Loading...
V. Rajaratnam
Usama Farghaly Omar
Kristen Kee
Standards
National University of Singapore
Nanyang Technological University
Khoo Teck Puat Hospital
Building similarity graph...
Analyzing shared references across papers
Loading...
Rajaratnam et al. (Fri,) studied this question.
www.synapsesocial.com/papers/69be38906e48c4981c679074 — DOI: https://doi.org/10.3390/standards6010010
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: