What question did this study set out to answer?

The goal is to develop a system that generates accurate radiology reports from chest X-rays using a RAG framework.

March 23, 2026Open Access

From General Doctor to Specialist: Enhancing Radiology Report Generation with Retrieval-Augmented Generation

Key Points

The goal is to develop a system that generates accurate radiology reports from chest X-rays using a RAG framework.
Evaluated similarity metrics for retrieval
Applied negative sampling in the RAG pipeline
Utilized TorchXRayVision model for finding predictions
Built vector indices over MIMIC-CXR training reports
Conducted 16 controlled experiments comparing RAG to non-retrieval baseline
RAG outperformed non-retrieval baseline across clinical and linguistic metrics
Improved performance measured by CheXbert and RadGraph scores
Explicit negative sampling degraded performance of the LLM
Demonstrated that similar reports guide better report generation

Abstract

We propose a multimodal Retrieval-Augmented Generation (RAG) framework for generating clinically accurate radiology reports from chest X-rays. Our study systematically evaluates similarity metrics for retrieval and the impact of negative sampling within the RAG pipeline. The approach extracts predicted findings and scores from a TorchXRayVision model, builds vector indices over MIMIC-CXR training reports, retrieves relevant neighbors using multiple strategies, and generates reports via a Large Language Model. In 16 controlled experiments, RAG consistently outperformed a non-retrieval baseline across both clinical (CheXbert and RadGraph) and linguistic (BERTScore, BLEU, METEOR, ROUGE-L) metrics. Moreover, adding explicit negative sampling at the prompt level consistently degrades performance, indicating that dissimilar reports confuse the LLM rather than provide useful guidance. Conceptually, RAG grounds a general-purpose LLM with precise, case-specific exemplars, steering it toward the specialized phrasing and clinical judgment of an expert radiologist.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Zamaninejad et al. (Thu,) studied this question.

synapsesocial.com/papers/69c0df0bfddb9876e79c1508 https://doi.org/https://doi.org/10.1016/j.procs.2026.01.095

Bookmark

View Full Paper