April 9, 2025Open Access

MAF-Net: A multimodal data fusion approach for human action recognition

Key Points

Key points are not available for this paper at this time.

Abstract

3D skeleton-based human activity recognition has gained significant attention due to its robustness against variations in background, lighting, and viewpoints. However, challenges remain in effectively capturing spatiotemporal dynamics and integrating complementary information from multiple data modalities, such as RGB video and skeletal data. To address these challenges, we propose a multimodal fusion framework that leverages optical flow-based key frame extraction, data augmentation techniques, and an innovative fusion of skeletal and RGB streams using self-attention and skeletal attention modules. The model employs a late fusion strategy to combine skeletal and RGB features, allowing for more effective capture of spatial and temporal dependencies. Extensive experiments on benchmark datasets, including NTU RGB+D, SYSU, and UTD-MHAD, demonstrate that our method outperforms existing models. This work not only enhances action recognition accuracy but also provides a robust foundation for future multimodal integration and real-time applications in diverse fields such as surveillance and healthcare.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Dongwei Xie

Nantong University

Xiaodan Zhang

Beijing University of Posts and Telecommunications

Xiang Gao

Wenzhou Medical University

Journals

PLoS ONE

Actions

Institutions

Zhongkai University of Agriculture and Engineering

Guangdong Polytechnic Normal University

Guangdong Police College

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

MAF-Net: A multimodal data fusion approach for human action recognition

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study