ホーム
探索
nav.journalClub
トレンド
その他
synapse
⌘+K
言語
日本語
日本語
Cross-modal spatio-temporal fusion weakly supervised video anomaly detection based on large-scale vision-language models | Synapse
March 3, 2026
Cross-modal spatio-temporal fusion weakly supervised video anomaly detection based on large-scale vision-language models
LP
Lihu Pan
Shandong University
SP
Shouxin Peng
Taiyuan University of Science and Technology
RZ
Rui Zhang
Jiangsu University
See all
Key Points
Video anomaly detection significantly improves using spatio-temporal fusion techniques, enhancing detection accuracy.
The analysis leverages large-scale vision-language models to achieve effectiveness in detecting anomalies.
Weakly supervised methods are utilized to reduce the need for extensive labeled datasets, making the approach more accessible.
This technique may enable better surveillance systems and automated monitoring in various domains, enhancing safety and security.
Mark Helpful
Like
Save
Bookmark
Relay
Share
Cite This Study
Copy
Pan et al. (Tue,) studied this question.
synapsesocial.com/papers/69a7662bbadf0bb9e87dbf72
https://doi.org/https://doi.org/10.1007/s00530-025-02158-w
Mark Helpful
Like
Save
Bookmark
Relay
Share