What question did this study set out to answer?

This research aims to improve detection of multimodal forgeries in images created by generative models.

January 6, 2026Open Access

Multi-Domain Perception Transformer for Generalized Forgery Image Detection

Key Points

This research aims to improve detection of multimodal forgeries in images created by generative models.
Proposed a multi-domain feature fusion Transformer network
Integrated spatial, frequency, and wavelet transform features
Introduced a cross-domain feature fusion module (CDAF) for detection
Model demonstrated superior detection performance on forged images
Exhibited enhanced robustness against various generative models

Abstract

With the rapid advancement of generative AI (AIGC) technology, synthetic images are increasingly approaching real pictures in terms of resolution and semantic consistency. Traditional detection methods face numerous challenges, such as insufficient cross-modal generalization capabilities and difficulty in identifying hidden generative traces. Existing solutions primarily design feature extractors for single generative models, struggling to address the complexity of multimodal forgeries. Therefore, we propose a multi-domain feature fusion Transformer network that integrates spatial, frequency, and wavelet transform features and introduce a cross-domain feature fusion module (CDAF) to detect subtle forgery traces in deepfake images. This model demonstrates superior detection performance on current forged images generated by generative adversarial networks (GANs) and diffusion models while exhibiting enhanced robustness.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Man et al. (Mon,) studied this question.

synapsesocial.com/papers/695d855e3483e917927a4ba5 https://doi.org/https://doi.org/10.3390/app16010533

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Bookmark

View Full Paper