What question did this study set out to answer?

This research aims to enhance facial liveness detection to better identify real human faces versus spoofing attempts.

May 16, 2026Open Access

Facial Liveness Detection Using a Hybrid CNNViT Architecture for Robust Face AntiSpoofing

Puntos clave

This research aims to enhance facial liveness detection to better identify real human faces versus spoofing attempts.
Developed a hybrid architecture combining CNN and ViT for liveness classification.
Evaluated performance on the CASIA Face Anti-Spoofing Database.
Compared hybrid model performance with standalone ViT, LBP, and SVM.
The hybrid CNN-ViT model achieved the highest classification accuracy.
Joint local-global feature extraction provided more discriminative representations.
Improved detection rates for various spoofing artefacts.

Resumen

Face anti-spoofing, commonly known as facial liveness detection, is a critical security component of modern biometric authentication systems. It determines whether a presented face belongs to a live person or is a fraudulent artefact such as a printed photograph, video replay, or three-dimensional mask. This paper proposes a hybrid deep learning architecture that fuses a Convolutional Neural Network (CNN) with a Vision Transformer (ViT) for binary liveness classification on the CASIA Face Anti-Spoofing Database. The CNN component captures fine-grained local texture cues indicative of spoofing artefacts, while the ViT component models global spatial relationships across image patches through multi-head self-attention. To provide performance context, a standalone ViT and a classical Local Binary Pattern (LBP) and Support Vector Machine (SVM) pipeline are also evaluated. The proposed hybrid model achieves the highest classification accuracy, validating that joint local-global feature extraction yields more discriminative representations for face anti-spoofing.

Me gusta

Guardar

Ver artículo completo