What question did this study set out to answer?

The research aims to improve deepfake detection by developing a framework that balances accuracy and real-time efficiency.

April 12, 2026Open Access

Real time detection of deepfakes using the efficient Swin attention network with global and local facial features

Key Points

The research aims to improve deepfake detection by developing a framework that balances accuracy and real-time efficiency.
Designed the Efficient-Swin Attention Network (ESANet) for deepfake detection.
Employed EfficientNet-B0 for extracting local facial features.
Utilized Swin Transformer to capture global facial relationships.
Implemented a feature fusion mechanism to combine local and global features.
Evaluated performance on three benchmark datasets.
Achieved detection accuracies of 96.5%, 95.3%, and 94.8% on FaceForensics++, CelebV1, and CelebV2, respectively.
Maintained low inference latency suitable for real-time applications.
Confirmed robustness through cross-dataset evaluations.

Abstract

The rapid advancement of deepfake generation techniques has exposed critical limitations in existing deepfake detection methods, particularly their inability to simultaneously achieve high detection accuracy and real-time efficiency across diverse datasets. To address this gap, this study proposes the Efficient-Swin Attention Network (ESANet), a hybrid deep learning framework for real-time deepfake detection that jointly exploits local and global facial features. ESANet integrates EfficientNet-B0 for lightweight local feature extraction with the Swin Transformer to model hierarchical global facial relationships, and combines the two representations via an efficient feature fusion mechanism. The proposed framework is evaluated on three benchmark datasets, FaceForensics++, CelebV1, and CelebV2. Experimental results demonstrate detection accuracies of 96.5%, 95.3%, and 94.8%, respectively, while maintaining low inference latency suitable for real-time applications. Cross-dataset evaluations further confirm the robustness and generalisation capability of the proposed approach. By enabling accurate and efficient deepfake detection, this work helps strengthen trust and mitigate.

Bookmark

View Full Paper

Cite This Study

Javed et al. (Fri,) studied this question.

synapsesocial.com/papers/69db37df4fe01fead37c5e93 https://doi.org/https://doi.org/10.1007/s44163-026-01188-1

Bookmark

View Full Paper