What question did this study set out to answer?

The research aims to improve the recognition of unsafe behaviors among miners using advanced video analysis techniques.

February 11, 2026Open Access

An Algorithm for Identifying Unsafe Behaviors of Miners Based on the Improved AlphaPose

Key Points

The research aims to improve the recognition of unsafe behaviors among miners using advanced video analysis techniques.
Developed an unsafe behavior recognition algorithm based on improved AlphaPose (RS-AlphaPose).
Replaced the original target detection network with an improved real-time detection Transformer (RTDETR).
Enhanced posture estimation using sliding window and channel attention mechanisms.
Constructed spatio-temporal dependencies using a spatio-temporal graph convolution network.
Achieved an average posture estimation accuracy of 72.5% on the COCO2017 dataset, 2.2% higher than basic AlphaPose.
Reached 94.5% accuracy in recognizing typical unsafe behaviors on a self-built miner dynamic behavior dataset, 4.5% higher than the basic model.

Abstract

Utilizing video surveillance in mines to identify unsafe behaviors of miners is an important technical means for preventing coal mine accidents and achieving safety control. However, the complex underground environment (such as chaotic backgrounds, personnel occlusion, etc.) severely affects the estimation of human postures and feature extraction, resulting in low accuracy of unsafe behavior identification. To address this issue, this paper proposes a miner unsafe behavior recognition algorithm based on improved AlphaPose (RS-AlphaPose). Firstly, the improved real-time detection Transformer (RTDETR) is adopted to replace the original target detection network. Through the deformable attention mechanism and the addition of small target detection layers, the target detection ability in complex scenes is enhanced. Secondly, the sliding window attention and channel attention mechanisms are integrated in the posture estimation network to strengthen multi-scale semantics and global context correlation, thereby improving the accuracy of skeleton extraction in the presence of occlusion. Finally, the spatio-temporal graph convolution network is introduced to construct the spatio-temporal dependency of the skeleton sequence, capturing the temporal features of dynamic behaviors. On the COCO2017 posture dataset, the average accuracy of posture estimation of this algorithm reaches 72.5%, which is 2.2% higher than the basic AlphaPose model. On the self-built miner dynamic behavior dataset, the average recognition accuracy for typical unsafe behaviors such as climbing and crossing reaches 94.5%, which is 4.5% higher than the basic model. The experiments show that the proposed algorithm can effectively solve the interference problems in complex underground environments, significantly improve the accuracy of dynamic unsafe behavior recognition of miners, and provide a reliable technical solution for coal mine safety production.

Bookmark

View Full Paper

Cite This Study

Liu et al. (Sun,) studied this question.

synapsesocial.com/papers/698c1bdc267fb587c655ddde https://doi.org/https://doi.org/10.3390/s26041107

Bookmark

View Full Paper