What question did this study set out to answer?

This review examines the readiness of artificial intelligence agents for use in medicine and biomedical research, focusing on their functionalities and evidence requirements.

June 19, 2026Open Access

Are artificial intelligence agents ready for medicine and biomedical research? A narrative review

Key Points

This review examines the readiness of artificial intelligence agents for use in medicine and biomedical research, focusing on their functionalities and evidence requirements.
Narrative review of current AI systems and their applications in clinical and biomedical settings.
Assessment of AI capabilities such as clinical calculations, risk prediction, and oncological support.
Evaluation of existing literature and clinical tools to identify strengths and shortcomings of AI in medical contexts.
AI agents show promise in applications like clinical calculations and risk assessments, but evidence is inconsistent across studies.
Most reliable applications are those that use validated tools and human oversight, while broader claims often rely on simulated environments.
AI is expected to enhance workflows in biomedical research but necessitates ongoing human judgment and accountability.

Abstract

Artificial intelligence (AI) agents extend large language models from single-turn text generation to systems that pursue goals through planning, retrieval, tool use, code execution, memory, feedback, and role coordination. In medicine and biomedical research, this shift is creating early systems for clinical calculations, risk prediction, oncology decision support, omics analysis, hypothesis development, laboratory automation, and research writing. However, the evidence remains uneven. Clinical examples are the most defensible when agents use validated calculators, curated clinical tools, or guideline-grounded modules under human oversight. Biomedical discovery systems exhibit broader workflow capabilities; however, many claims still rely on preprints, narrow benchmarks, simulated settings, or domain-specific demonstrations. For clinicians and biomedical researchers, the immediate challenge is not to decide whether agents will replace experts but to understand what tasks can be delegated, what evidence is needed, and what human judgment must be preserved. This narrative review explains what makes an AI system agentic, summarizes its representative clinical and discovery applications, and outlines safeguards for evaluation, reproducibility, and oversight. Biomedical readers should expect AI agents to enter medicine and research first as constrained, auditable workflow infrastructures. These infrastructures may reorganize biomedical work; however, accountability should remain with the clinicians and investigators.

Read Full Paperexternally

KI fragen

Bookmark

View Full Paper