What question did this study set out to answer?

This research aims to address challenges in spatiotemporal feature extraction and multi-source data adaptation for video behavior recognition.

June 1, 2026Open Access

Video Behavior Recognition System Combining Big Data and Deep Learning

Key Points

This research aims to address challenges in spatiotemporal feature extraction and multi-source data adaptation for video behavior recognition.
Developed a robust intelligent recognition system integrating big data processing and deep learning.
Employed a cooperative learning algorithm with a hierarchical spatiotemporal adaptation network and hybrid knowledge distillation.
Tested the system on the UCF101 and HMDB51 datasets.
Achieved recognition accuracy of 98.5% on UCF101 and 89.2% on HMDB51 datasets.
Improved accuracy by 2.3% and 3.1% compared to the optimal baseline for the respective datasets.

Abstract

In response to the problem of insufficient spatiotemporal feature extraction and difficulty in adapting to multi-source heterogeneous data in complex scenes of existing video behavior recognition models, this paper constructs a highly robust intelligent recognition and analysis system that integrates big data processing and deep learning. The core architecture employs a cooperative learning algorithm, comprising a hierarchical spatiotemporal adaptation network and a hybrid knowledge distillation (KD) mechanism. The network first identifies local and global video features via an adaptation layer, then enhances them through second-order pooling. Hybrid KD uses a teacher-student model to integrate previous human knowledge and distill it into a lightweight model that can efficiently process massive streaming data. The comparison results show that the recognition accuracy of the proposed system on the UCF101 and HMDB51 (Human Motion Database) datasets is 98.5% and 89.2%, respectively, which are 2.3% and 3.1% higher than the optimal baseline, respectively. This demonstrates the effectiveness of the framework in achieving accuracy and resource efficiency in practical video analysis systems.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Xu Zhao (Thu,) studied this question.

synapsesocial.com/papers/6a1d228d02fbce9130638465 https://doi.org/https://doi.org/10.1016/j.procs.2026.03.221

Bookmark

View Full Paper