What type of study is this?

This is a Quantitative Study study.

October 16, 2025Open Access

Multimodal Fact Checking with Unified Visual, Textual, and Contextual Representations

Key Points

The proposed MultiCheck framework achieves a weighted F1 score of 0.84, indicating its high effectiveness in multimodal fact-checking.
Using dedicated encoders for text and images enables the system to effectively capture cross-modal relationships.
The contrastive learning objective promotes alignment between claim-evidence pairs in a shared latent space, enhancing verification accuracy.
Results from the Factify 2 dataset show that this approach can lead to scalable and interpretable fact-checking in complex real-world scenarios.

Abstract

The growing rate of multimodal misinformation, where claims are supported by both text and images, poses significant challenges to fact-checking systems that rely primarily on textual evidence. In this work, we have proposed a unified framework for fine-grained multimodal fact verification called "MultiCheck", designed to reason over structured textual and visual signals. Our architecture combines dedicated encoders for text and images with a fusion module that captures cross-modal relationships using element-wise interactions. A classification head then predicts the veracity of a claim, supported by a contrastive learning objective that encourages semantic alignment between claim-evidence pairs in a shared latent space. We evaluate our approach on the Factify 2 dataset, achieving a weighted F1 score of 0.84, substantially outperforming the baseline. These results highlight the effectiveness of explicit multimodal reasoning and demonstrate the potential of our approach for scalable and interpretable fact-checking in complex, real-world scenarios.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Aditya Kishore

Gaurav Kumar

Jasabanta Patro

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Multimodal Fact Checking with Unified Visual, Textual, and Contextual Representations

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider