Key points are not available for this paper at this time.
Unlabelled: Advances in artificial intelligence (AI) promise to reshape the landscape of scientific inquiry. Amidst all these, OpenAI's latest tool, Deep Research, stands out for its potential to revolutionize how researchers engage with the literature. However, this leap forward presents a paradox; while AI-generated reviews offer speed and accessibility with minimal effort, they raise fundamental concerns about citation integrity, critical appraisal, and the erosion of deep scientific thinking. These concerns are particularly problematic in the context of biomedical research, where evidence quality may influence clinical practice and decision-making. In this piece, we present an empirical evaluation of Deep Research and explore both its remarkable capabilities and inherent limitations. Through structured experimentation, we assess its effectiveness in synthesizing literature, highlight key shortcomings, and reflect on the broader implications of these tools for research training, and the integrity of evidence-based practice. With AI tools increasingly blurring the lines between knowledge generation and critical inquiry, we argue that while AI democratizes access to knowledge, wisdom remains distinctly human.
Ong et al. (Tue,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: