ABSTRACT Recently, large language models have gained momentum in several areas, including medical text summarization and clinical decision support in healthcare. In this systematic review, we focus on how LLMs are used for summarization and explainability, and on how they improve the precision of clinical decisions. Initially, by screening eight distinct databases, we identified 1601 studies using the PRISMA protocol. Following strict qualifying criteria, 61 studies were selected for the final review. Our results show that decoder‐only architectures dominate the field (60.7% of studies), with zero‐shot prompting emerging as the predominant approach (55.7%), while 24.6% of studies employed fine‐tuning. General clinical decision support (26.2%) and diagnostic assistance (19.7%) constitute the two most widely used clinical applications across multiple clinical specialties. Identified in 26.2% of studies, the intersection of summarization and explainability becomes a key focus area, with fine‐tuned models obtaining 38.6% higher average ROUGE‐1 scores compared to zero‐shot approaches, and studies including robust explainability features report 27.3% higher clinician acceptance rates. Despite great potential, major obstacles still exist, including model hallucinations (63.9% of studies), minimal workflow integration (only 14.8% of studies), and inadequate attention to regulatory paths (8.2%). Although LLMs offer significant potential for transforming clinical decision support via improved information handling and clear reasoning, realizing these goals requires overcoming essential challenges in clinical validation, system integration, and ethical oversight. This review aims to provide an extensive framework for understanding the current capabilities, limitations, and prospects of LLM‐based explainable summarization in clinical settings, guiding future research and practical deployment. This article is categorized under: Technologies > Artificial Intelligence Fundamental Concepts of Data and Knowledge > Explainable AI Application Areas > Health Care
Building similarity graph...
Analyzing shared references across papers
Loading...
Aleka Melese Ayalew
University of Oulu
Md Rabiul Hasan
University of Oulu
Tapio Seppänen
University of Oulu
Wiley Interdisciplinary Reviews Data Mining and Knowledge Discovery
University of Oulu
Building similarity graph...
Analyzing shared references across papers
Loading...
Ayalew et al. (Tue,) studied this question.
synapsesocial.com/papers/69f2a4b78c0f03fd67763c88 — DOI: https://doi.org/10.1002/widm.70089
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: