What question did this study set out to answer?

The aim is to address challenges in multimodal real-time interactions, enhancing prediction accuracy for user intentions.

May 29, 2026Open Access

Computer vision simulation with multimodal data for real-time user interaction in industrial design

Key Points

The aim is to address challenges in multimodal real-time interactions, enhancing prediction accuracy for user intentions.
Proposed a CLT-driven multimodal fusion architecture for real-time interaction.
Verified on the HoloAssist dataset with metrics including prediction accuracy and cognitive load.
Conducted ablation experiments to assess the importance of core modules.
Interactive intention prediction accuracy reached 95.2% ± 1.3%, outperforming the AlignMamba model by 3.5 percentage points.
Achieved end-to-end delay of 0.18 s ± 0.02 s with an alignment delay of 0.028 s.
Subjective cognitive load score improved to 3.2 ± 0.8, significantly better than the baseline model.

Abstract

To solve the problems of modal heterogeneity, temporal asynchrony and cognitive adaptation imbalance in multimodal real-time interaction, a CLT-driven multi-modal real-time fusion architecture was proposed.Experimental verification on HoloAssist dataset shows that the interactive intention prediction accuracy of the proposed architecture reaches 95.2% ± 1.3%, which is 3.5 percentage points higher than that of AlignMamba model.The end-to-end delay is 0.18 s ± 0.02 s, and the alignment delay is as low as 0.028 s.The subjective score of cognitive load was 3.2 ± 0.8, which was significantly better than the baseline model.Ablation experiments confirm that each core module is crucial to performance improvement, and the model has excellent robustness in scenarios with modal loss and noise interference.This research provides support for the implementation of real-time multimodal interaction technology.

Bookmark

View Full Paper

Bookmark

View Full Paper

Computer vision simulation with multimodal data for real-time user interaction in industrial design

Key Points

Abstract

Cite This Study