The IoT has posed novel cyber-physical vulnerabilities due to the fast proliferation of Internet of Things (IoT) systems. Old network-based intrusion detection solutions can poorly identify malicious activities that are caused by on-device sensors. This paper introduces a multimodal sensing architecture based on deep learning to identify cyber-attacks on the traces of heterogeneous sensors, such as acceleration, gyroscopes, microphones, and temperature devices. The new hybrid CNN-RNN-Transformer architecture allows a fusion of features, as well as consideration of spatial-temporal interaction between sensor modalities. Evaluation was done using a manually annotated multimodal dataset and two publicly available benchmark datasets (CICIDS-2017 and IoT-23). The framework obtained an AUC of 0.96, an F1-score of 0.94, and an inference latency of 23 ms on edge hardware, and verified real-time deployability. These findings indicate that multimodal deep learning is a useful and scalable approach to cyber-physical threat detection in IoT settings that are resource-constrained.
Building similarity graph...
Analyzing shared references across papers
Loading...
Muhammad A. Latif
Iqra University
Abdul Ahad Abro
Iqra University
Syed Muhammad Daniyal
Iqra University
Scientific Reports
Prince Sultan University
Princess Nourah bint Abdulrahman University
Iqra University
Building similarity graph...
Analyzing shared references across papers
Loading...
Latif et al. (Thu,) studied this question.
synapsesocial.com/papers/69a286eb0a974eb0d3c02441 — DOI: https://doi.org/10.1038/s41598-026-40614-3