What question did this study set out to answer?

This research aims to create a deepfake detection framework that accurately identifies manipulated media and explains its predictions to users.

June 2, 2026Open Access

Interpretable and Trustworthy Deepfake Detection Framework: Leveraging Transfer-Learned CNNs with Grad-CAM and SHAP for Robust Media Forensics

Puntos clave

This research aims to create a deepfake detection framework that accurately identifies manipulated media and explains its predictions to users.
Developed a framework using transfer learning from pre-trained CNN models Xception and ResNet50.
Trained on diverse public datasets with a robust preprocessing pipeline including face detection and alignment.
Incorporated Explainable AI techniques, Grad-CAM and SHAP, to visualize regions affecting predictions.
Achieved high accuracy in identifying deepfakes with confidence levels reported.
Demonstrated effectiveness in visual explanations through heatmaps highlighting unnatural facial regions.
Enhanced user trust evidenced by clear justifications for each prediction.

Resumen

The rapid evolution of Deep-fake technologies has enabled AI based techniques like Generative Adversarial Networks (GANs) to create incredibly realistic yet totally fabricated video and image content. These developments are very exciting; however, they have serious implications to many areas including; financial fraud, misinformation, Identity Theft and Erosion of Public Trust. A significant weakness of most existing detection mechanisms is the lack of transparency- Many operate as "Black Boxes" which will identify fake media but provide no explanation for why this was done; Therefore, Most are untrustworthy. The purpose of this research paper is to develop an advanced Deepfake Detection Framework that is capable of identifying manipulated media at a very high confidence level; In addition, Provide the user with a clear and understandable justification for every prediction. The framework uses transfer learning from pre-trained CNN models: Xception and ResNet50. It is trained on diverse, publicly available datasets and follows a structured preprocessing pipeline consisting of face detection, alignment, resizing, and data augmentation in order to improve real-world robustness. This is demonstrated by embedding Explainable AI (XAI) techniques, Grad-CAM and SHAP, that highlight the particular facial regions responsible for the model's prediction. For instance, the heatmap could convey that the eye or mouth region looks unnatural, which might hint that this is where the system bases its decision on whether something is fake. Combining compelling classification with both visual and numerical explanations will help the system to build users' confidence in its potential applications to real-world digital forensics, content moderation, and media verification tasks.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Muzaffar et al. (Sun,) studied this question.

synapsesocial.com/papers/6a1e734530b38c64201b6797 https://doi.org/https://doi.org/10.5281/zenodo.20470289

Me gusta

Guardar

Ver artículo completo