What type of study is this?

This is a Quantitative Study study.

October 18, 2025

MediVQA: Medical Visual Question Answering using VisualBERT

Key Points

The proposed system achieves robust performance in generating accurate answers to medical queries, enhancing diagnostics.
By combining BERT for textual and ResNet-101 for visual features, the system effectively addresses medical VQA challenges.
Utilizing datasets like VQA-Med-2019 and ImageCLEFmed_MEDVQA, the approach is tailored for medical-specific tasks.
This innovation may improve clinical workflows and contribute significantly to better patient care outcomes.

Abstract

Medical Visual Question Answering (VQA) aims to enhance automated medical image interpretation by enabling AI systems to generate accurate responses to clinical queries. This research focuses on advancing Medical VQA, with a particular emphasis on interpreting medical images to support diagnostic and decision-making processes. Using VQA-Med-2019, ImageCLEFmedMEDVQA and SLAKE 1. 0 datasets, we developed a specialized system tailored to the unique demands of medical VQA tasks. Our approach combines textual feature extraction via the BERT model with visual feature extraction using ResNet-101. These multimodal features are integrated and processed through the MLP-integrated VisualBERT model, optimized for classification tasks. The proposed system demonstrates robust performance in generating accurate answers to complex medical queries, showcasing its potential to enhance medical diagnostics, improve clinical workflows, and ultimately contribute to better patient care outcomes.

Bookmark

MediVQA: Medical Visual Question Answering using VisualBERT

Key Points

Abstract

Cite This Study

Also Consider

Also Consider