What type of study is this?

September 10, 2025

MASTER: A Multi-modal Foundation Model for Human Activity Recognition

Key Points

MASTER achieves higher accuracy in human activity recognition with less labeled data across various situations.
The model effectively utilizes self-supervised pre-training to learn from unlabeled data, making it versatile across different scenes.
Its few-shot alignment mechanism allows adaptation to multiple modalities and different activity categories.
MASTER's performance was validated on 7 multi-modal datasets, supporting 8 distinct sensor types.

Abstract

Multi-modal sensing has become crucial in Human Activity Recognition (HAR) due to its ability to combine data from diverse sensors. However, challenges arise in recognizing various activities in different scenes using multi-modal data from different positions and devices, due to dynamic combinations of modal inputs, data heterogeneity, and scarcity of labeled data. To tackle these challenges, we propose MASTER, a multi-modal foundation model specifically designed for HAR. MASTER introduces a masked-data modeling-based self-supervised pre-training method, enabling the model to learn from unlabeled data and adapt to dynamic combinations of modal inputs. Moreover, it incorporates a few-shot alignment mechanism to facilitate adaptation to different activities, scenes, positions, and devices. Through the pre-training and fine-tuning on 7 multi-modal HAR datasets, MASTER currently supports, but is not limited to, 8 modalities (ACC, Gyro, mmWave, WiFi, Skeleton, Lidar, Infrared, and RGB) and 45 human activities. The results demonstrate that MASTER achieves the highest accuracy with minimal labeled data across various situations, surpassing alternative solutions.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Guanzhou Zhu

Dong Zhao

C-Q. Li

Journals

Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies

MASTER: A Multi-modal Foundation Model for Human Activity Recognition

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study