What question did this study set out to answer?

This study aims to improve Medical Visual Question Answering (MVQA) systems through data augmentation techniques and explainable AI.

January 6, 2026Open Access

Augmenting medical visual question answering with mixup, label smoothing, and layer-wise relevance propagation eXplainable Artificial Intelligence

Key Points

This study aims to improve Medical Visual Question Answering (MVQA) systems through data augmentation techniques and explainable AI.
Implemented Mixup and Label Smoothing for dataset augmentation.
Evaluated performance using quantitative metrics with Layer-wise Relevance Propagation (LRP) for explainability.
Compared models trained on augmented datasets versus baseline datasets.
Models trained on augmented datasets showed improved accuracy and BLEU score compared to baseline datasets.
Layer-wise Relevance Propagation visualizations highlighted key image and text regions that enhance model interpretability.

Abstract

The growing volume of medical data presents significant opportunities for advancing Medical Visual Question Answering (MVQA) systems. However, an imbalance in the number and distribution of image and Question–Answer (QA) pairs poses challenges for developing robust models. This study proposes improving existing MVQA datasets using data augmentation techniques specifically Mixup and Label Smoothing—to address this issue. The performance of MVQA models trained on these enhanced datasets is evaluated using quantitative metrics, as well as Layer-wise Relevance Propagation for eXplainable artificial intelligence (LRP XAI). Results indicate that models trained on the augmented datasets outperform those trained on the baseline datasets, showing significant gains in both accuracy and Bilingual Evaluation Understudy (BLEU) score. Furthermore, LRP XAI visualizations highlight key image and text regions that contribute to accurate answer predictions, thereby improving model interpretability and trust. This work underscores the importance of dataset augmentation and explainability in advancing MVQA research and it is available in https://doi.org/10.5281/zenodo.15910714 .

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper