What question did this study set out to answer?

This study aims to systematically map and characterize the literature surrounding DeepSeek's applications in medicine.

June 18, 2026Open Access

Applications of DeepSeek in Medicine: Bibliometric Analysis and Scoping Review

Key Points

This study aims to systematically map and characterize the literature surrounding DeepSeek's applications in medicine.
Conducted a systematic search across PubMed, Web of Science, and Scopus following PRISMA-ScR guidelines.
Performed bibliometric analysis on 371 papers to quantify publication trends and research themes.
Synthesized applications and limitations of 353 original articles through a scoping review.
Publication output increased progressively, with China (n=163), Turkey (n=52), and the USA (n=48) as leading contributors.
DeepSeek is noted for applications in five main domains, with variable performance against proprietary models.
66.6% (235/353) of original articles were classified as low-quality evidence, highlighting a need for prospective validation.

Abstract

Background: The integration of large language models (LLMs) into medicine has reshaped health care delivery, education, and research. Although proprietary models face challenges such as data privacy, regulation, and adaptability, DeepSeek, an open-source LLM, has emerged as a customizable and cost-effective alternative with significant potential for clinical and operational applications. However, the rapid expansion of research in this area necessitates a systematic mapping of its landscape, applications, and challenges. Objective: This study combines bibliometric analysis with a scoping review to systematically map and characterize the literature on DeepSeek's medical applications. The aims were to (1) analyze publication trends, leading contributors, and research themes and (2) identify primary application domains, strengths, limitations, and future directions. Methods: Following the framework by Arksey and O'Malley and the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews) guidelines, a systematic search was conducted using PubMed, Web of Science, and Scopus from January 20, 2025, to November 30, 2025. Bibliometric analysis was then used to quantify publication trends, productivity, and research themes across 371 papers. The scoping review thematically synthesized the applications, strengths, and limitations of 353 original articles. Results: The publication output showed a progressive increase, with China (n=163), Turkey (n=52), and the United States (n=48) as leading contributors. Keyword co-occurrence analysis formed 7 clusters; the 3 most frequent keywords were "large language model," "artificial intelligence," and "patient education." DeepSeek has shown promising yet preliminary performance across multiple domains, including patient education, clinical decision support, medical education, workflow optimization, and medical research. The evidence base remains predominantly low in quality, with 66.6% (235/353) of original articles classified as low-quality evidence, consisting largely of unvalidated benchmarking, simulated cases, and single-center retrospective analyses. Only 6.8% (24/353) of studies met the criteria to be considered high quality, and prospective randomized trials assessing patient-relevant outcomes were notably absent. Conclusions: Publications on DeepSeek's medical applications increased progressively from January 2025 through November 2025, with China, Turkey, and the United States as the leading contributors. The scoping review found that DeepSeek has been evaluated across 5 domains (patient education, clinical decision support, medical education, workflow optimization, and research), with variable but often competitive performance relative to proprietary models. Strengths included readability, diagnostic accuracy in select specialties, cost-efficiency, and local deployability. Limitations included inconsistent cross-specialty performance, hallucinations, ethical concerns, data privacy issues, and regulatory gaps. The evidence base is predominantly low-quality and simulation-based, with few prospective trials or randomized controlled trials. These findings indicate that DeepSeek's clinical readiness varies, and future research should address prospective validation, multimodal capabilities, bias mitigation, human oversight, and equitable access.

Read Full Paperexternally

Perguntar à IA

Bookmark

View Full Paper

Cite This Study

Zhang et al. (Mon,) studied this question.

synapsesocial.com/papers/6a338de8630953a74978ea4c https://doi.org/https://doi.org/10.2196/93354

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Perguntar à IA

Bookmark

View Full Paper