What type of study is this?

This is a Quantitative Study study (also classified as: Experimental Study).

September 23, 2025Open Access

Multimodal Large Language Model-Enabled Machine Intelligent Fault Diagnosis Method with Non-Contact Dynamic Vision Data

Key Points

A novel fault diagnosis method achieved a classification accuracy of 95.40%, indicating effectiveness in real-time monitoring.
Using event cameras for dynamic visual data improved environmental adaptability in fault diagnosis, enhancing equipment reliability.
Two fine-tuning strategies for the multimodal model were tested, utilizing OpenCV and leveraging video processing for better classification.
This approach allows non-contact sensing and intelligent analysis, highlighting the potential for advancements in smart manufacturing.

Abstract

Smart manufacturing demands ever-increasing equipment reliability and continuous availability. Traditional fault diagnosis relies on attached sensors and complex wiring to collect vibration signals. This approach suffers from poor environmental adaptability, difficult maintenance, and cumbersome preprocessing. This study pioneers the use of high-temporal-resolution dynamic visual information captured by an event camera to fine-tune a multimodal large model for the first time. Leveraging non-contact acquisition with an event camera, sparse pulse events are converted into event frames through time surface processing. These frames are then reconstructed into a high-temporal-resolution video using spatiotemporal denoising and region of interest definition. The study introduces the multimodal model Qwen2.5-VL-7B and employs two distinct LoRA fine-tuning strategies for bearing fault classification. Strategy A utilizes OpenCV to extract key video frames for lightweight parameter injection. In contrast, Strategy B calls the model’s built-in video processing pipeline to fully leverage rich temporal information and capture dynamic details of the bearing’s operation. Classification experiments were conducted under three operating conditions and four rotational speeds. Strategy A and Strategy B achieved classification accuracies of 0.9247 and 0.9540, respectively, successfully establishing a novel fault diagnosis paradigm that progresses from non-contact sensing to end-to-end intelligent analysis.

Read Full Paperexternally

Ask AI

Mark Helpful

Bookmark

Relay

View Full Paper