What question did this study set out to answer?

The research aims to recover hidden causality from radiology reports using multimodal small language models.

April 1, 2026Open Access

RAD-PHI3 at the NTCIR-18 HIDDEN-RAD: Hidden Causality Inclusion in Radiology Reports with Multimodal Small Language Models

Key Points

The research aims to recover hidden causality from radiology reports using multimodal small language models.
Participated in the Hidden-RAD Challenge for causality inclusion
Fine-tuned Rad-Phi-3.5 Vision-CXR model
Evaluated the effectiveness of various small language models in causal explanation generation
Included baselines from general domain and reasoning-specialized models
Found that fine-tuned domain-specific models improved causality recovery
Demonstrated effectiveness of multimodal inputs over language-only inputs
Comparison of model performance revealed distinct strengths among various approaches

Abstract

This paper presents the participation of the Microsoft Research RADPHI3 team in the Hidden-RAD Challenge: Hidden Causality Inclusion in Radiology Reports. The task aims to recover hidden causality from radiology reports, optionally accompanied by their corresponding frontal chest X-rays (CXRs). We fine-tune small language models, specifically Rad-Phi-3.5 Vision-CXR, to recover causality analysis in both language-only and multi-modal settings, given radiology reports and radiology images as inputs. We also include baselines of various models in the general domain, including models specifically tuned for reasoning tasks such as GPT-4o, LLaMA 3.3, Phi4, DeepSeek, OpenAI o1, OpenAI o1-mini, and OpenAI o3-mini3. Through these experiments, we evaluated the effectiveness of general-domain, reasoning-specialized, and fine-tuned domain-specific small language models in generating causal explanations given radiology reports and images optionally as inputs.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper