What type of study is this?

This is a Quantitative Study study.

September 16, 2025

Human Action Recognition from Video Using ML and DL Classifiers

Key Points

LRCN model achieves higher accuracy of ~87.6% in human action recognition, compared to the MoveNet model.
Hybrid framework combines machine learning with deep learning, addressing issues of occlusion and noise.
Both models utilize benchmark datasets such as UCF101 and HMDB51 to evaluate performance on various action recognition tasks.
Findings reveal important trade-offs between accuracy and latency pertinent to real-time applications.

Abstract

Human Action Recognition (HAR) has evolved from traditional handcrafted feature methods to modern data-driven approaches leveraging machine and deep learning. Early systems struggled with generalization in realistic conditions due to occlusions, motion complexity, and background noise. This project overcomes these restrictions by proposing a hybrid framework that merges traditional Machine Learning (ML) with advanced Deep Learning (DL) models to detect human actions from video data. Two fundamental architectures are deployed and contrasted: the Long-term Recurrent Convolutional Network (LRCN), which combines CNNs and LSTMs to capture spatial and temporal patterns, and a streamlined pose-based classifier utilizing Google's Move Net for real-time skeleton tracking. Both models are trained and evaluated on benchmark datasets UCF101 and HMDB51. Experimental results demonstrate that while LRCN achieves higher accuracy (~87.6%), the MoveNet model offers superior inference speed and robustness to noise, a making it suitable for real-time applications. The findings highlight key trade-offs between accuracy and latency, providing insights for deploying HAR systems across diverse domains such as surveillance, healthcare, and human-computer interaction.

KI fragen

Bookmark

KI fragen

Bookmark

Human Action Recognition from Video Using ML and DL Classifiers

Key Points

Abstract

Cite This Study