What type of study is this?

This is a Experimental Study study.

October 18, 2025Open Access

Hybrid Vision Transformer and Quantum Convolutional Neural Network for Image Classification

Key Points

The hybrid ViT-QCNN-FT model achieved 99.77% accuracy on CIFAR-10, showcasing significant performance gain.
Quantum noise can improve accuracy by 2.71% in certain conditions, emphasizing its complex role in QML.
The study demonstrates a 29.36% accuracy drop when QCNN is replaced by classical counterparts, indicating clear quantum advantage.
This work paves the way for co-designing classical and quantum architectures for high-dimensional learning challenges.

Abstract

Quantum machine learning (QML) holds promise for computational advantage, yet progress on real-world tasks is hindered by classical preprocessing and noisy devices. We introduce ViT-QCNN-FT, a hybrid framework that integrates a fine-tuned Vision Transformer with a quantum convolutional neural network (QCNN) to compress high-dimensional images into features suited for noisy intermediate-scale quantum (NISQ) devices. By systematically probing entanglement, we show that ansatzes with uniformly distributed entanglement entropy consistently deliver superior non-local feature fusion and state-of-the-art accuracy (99.77% on CIFAR-10). Surprisingly, quantum noise emerges as a double-edged factor: in some cases, it enhances accuracy (+2.71% under amplitude damping). Strikingly, substituting the QCNN with classical counterparts of equal parameter count leads to a dramatic 29.36% drop, providing unambiguous evidence of quantum advantage. Our study establishes a principled pathway for co-designing classical and quantum architectures, pointing toward practical QML capable of tackling complex, high-dimensional learning tasks.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Mingzhu Wang

Harbin University of Science and Technology

Yun Shang

Qingdao University of Science and Technology

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Hybrid Vision Transformer and Quantum Convolutional Neural Network for Image Classification

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study