What question did this study set out to answer?

The primary goal is to enhance real-time detection of crisis events from noisy social media data using an optimized CNN-LSTM model.

April 22, 2026Open Access

Optimized CNN–LSTM Modeling for Crisis Event Detection in Noisy Social Media Streams

Key Points

The primary goal is to enhance real-time detection of crisis events from noisy social media data using an optimized CNN-LSTM model.
Developed the SOCIAL framework integrating CNN and LSTM for event classification.
Utilized TF-IDF and Word2Vec embeddings for text representation.
Evaluated performance through six experimental configurations.
Achieved 98.59% accuracy with the TF-IDF-based CNN-LSTM model.
Reported a precision of 98.13% and a recall of 99.06%.
Confirmed strong predictive capabilities with high F0.5, F1, and F2 scores.

Abstract

Event detection is crucial for disaster response, public safety, and trend analysis, enabling real-time identification of critical events. Social media platforms provide a vast content source, offering timely and diverse event coverage compared to traditional news reports. However, challenges arise due to the informal and noisy nature of the text, along with the limited availability of ground truth data for training models. This study introduces SOCIAL (Social Media Event Classification using Integrated Artificial Learning and Natural Language Processing), a mathematically grounded framework for real-time social media event detection. SOCIAL integrates a formal representation of social media text with a customized CNN–LSTM architecture, combining convolutional operations for local feature extraction with sequential modeling to capture temporal dependencies, thereby enhancing classification accuracy. Generative AI is employed to create synthetic event-related samples, addressing data scarcity and ensuring a balanced dataset, while the design incorporates quantitative principles to guide embedding selection and model optimization. This study systematically evaluates six experimental configurations with TF-IDF and Word2Vec embeddings. The TF-IDF-based CNN–LSTM model achieved top performance with 98.59% accuracy, 98.13% precision, 99.06% recall, and 0.9719 MCC. Additionally, the F0.5, F1, and F2 scores were 98.31%, 98.59%, and 98.87%, respectively, confirming the model’s strong predictive capabilities. TF-IDF integration enhanced event-specific term recognition, reducing misclassifications and improving reliability. These results demonstrate that SOCIAL is not only a fast, accurate, and scalable tool for crisis event detection, but also a formally principled framework for modeling and analyzing social media signals.

Bookmark

View Full Paper

Bookmark

View Full Paper

Optimized CNN–LSTM Modeling for Crisis Event Detection in Noisy Social Media Streams

Key Points

Abstract

Cite This Study