What question did this study set out to answer?

The aim is to improve the automatic detection of violent actions in video surveillance systems.

February 14, 2026Open Access

HSTNet:Violent Action Detection

Key Points

The aim is to improve the automatic detection of violent actions in video surveillance systems.
Proposes HSTNet, a hybrid architecture combining spiking neural networks and transformers.
Utilizes a dual-branch design for temporal and spatial feature extraction.
Introduces a feature interaction module for deep cross-modal feature fusion.
HSTNet outperforms existing state-of-the-art methods in action recognition accuracy.
Significantly higher performance metrics observed across multiple datasets including UCF101 and HMDB51.

Abstract

To enhance public safety and safeguard lives and property, the automatic detection of anomalous and violent behaviors in video has become a key task in intelligent surveillance systems. Violent actions are often abrupt, rapid, and irregular, posing considerable challenges to conventional approaches. Existing methods based on hand-crafted features and convolutional neural networks still exhibit limitations in spatiotemporal feature extraction, recognition accuracy, and model robustness. To address these issues, this paper proposes HSTNet, a hybrid neural architecture that integrates Spiking Neural Networks (SNNs) with Transformers. The framework adopts a dual-branch design: the SNN branch models temporal dynamics in video, while the Transformer branch extracts spatial structural information. A feature interaction module is further introduced to enable deep cross-modal fusion. Experiments on multiple datasets including UCF101, HMDB51, Hockey Fight, and Movies Fight demonstrate that HSTNet achieves significantly higher accuracy than state-of-the-art baselines, indicating strong performance and promising application potential.

Bookmark

View Full Paper

Cite This Study

Meng et al. (Thu,) studied this question.

synapsesocial.com/papers/6990113f2ccff479cfe57c77 https://doi.org/https://doi.org/10.3390/app16041825

Bookmark

View Full Paper