What type of study is this?

This is a Systematic Review study.

What question did this study set out to answer?

This review examines the use of large language models for explainable summarization in clinical decision support.

April 30, 2026Open Access

Large Language Models for Explainable Medical Text Summarization: A Systematic Literature Review

Read Full Paperexternally

Key Points

This review examines the use of large language models for explainable summarization in clinical decision support.
Conducted a systematic literature review following the PRISMA protocol, screening eight databases.
Identified 1601 studies, narrowing down to 61 based on strict qualifying criteria.
Decoder-only architectures represent 60.7% of studies.
Fine-tuning approaches demonstrate higher ROUGE-1 scores by 38.6% compared to zero-shot prompting.
General clinical decision support and diagnostic assistance are the most used applications, at 26.2% and 19.7% respectively.

Abstract

ABSTRACT Recently, large language models have gained momentum in several areas, including medical text summarization and clinical decision support in healthcare. In this systematic review, we focus on how LLMs are used for summarization and explainability, and on how they improve the precision of clinical decisions. Initially, by screening eight distinct databases, we identified 1601 studies using the PRISMA protocol. Following strict qualifying criteria, 61 studies were selected for the final review. Our results show that decoder‐only architectures dominate the field (60.7% of studies), with zero‐shot prompting emerging as the predominant approach (55.7%), while 24.6% of studies employed fine‐tuning. General clinical decision support (26.2%) and diagnostic assistance (19.7%) constitute the two most widely used clinical applications across multiple clinical specialties. Identified in 26.2% of studies, the intersection of summarization and explainability becomes a key focus area, with fine‐tuned models obtaining 38.6% higher average ROUGE‐1 scores compared to zero‐shot approaches, and studies including robust explainability features report 27.3% higher clinician acceptance rates. Despite great potential, major obstacles still exist, including model hallucinations (63.9% of studies), minimal workflow integration (only 14.8% of studies), and inadequate attention to regulatory paths (8.2%). Although LLMs offer significant potential for transforming clinical decision support via improved information handling and clear reasoning, realizing these goals requires overcoming essential challenges in clinical validation, system integration, and ethical oversight. This review aims to provide an extensive framework for understanding the current capabilities, limitations, and prospects of LLM‐based explainable summarization in clinical settings, guiding future research and practical deployment. This article is categorized under: Technologies > Artificial Intelligence Fundamental Concepts of Data and Knowledge > Explainable AI Application Areas > Health Care

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Aleka Melese Ayalew

University of Oulu

Md Rabiul Hasan

University of Oulu

Tapio Seppänen

University of Oulu

Journals

Wiley Interdisciplinary Reviews Data Mining and Knowledge Discovery

Actions

Institutions

University of Oulu

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Large Language Models for Explainable Medical Text Summarization: A Systematic Literature Review

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider