The escalating scale and psychological harm of cyberbullying across digital platforms present a critical social challenge, demanding the urgent development of highly accurate and reliable automated detection systems. Standard fine-tuned transformer models, while powerful, often fall short in capturing the nuanced, context-dependent nature of online harassment. This paper introduces a novel hybrid deep learning model called Robustly Optimized Bidirectional Encoder Representations from the Transformers with the Bidirectional Long Short-Term Memory-based Attention model (RoBERTa-BiLSTM), specifically designed to address this challenge. To maximize its effectiveness, the model was systematically optimized using the Optuna framework and rigorously benchmarked against eight state-of-the-art transformer baseline models on a large cyberbullying dataset. Our proposed model achieves state-of-the-art performance, outperforming BERT-base, RoBERTa-base, RoBERTa-large, DistilBERT, ALBERT-xxlarge, XLNet-large, ELECTRA-base, DeBERTa-v3-small with an accuracy of 94.8%, precision of 96.4%, recall of 95.3%, F1-score of 95.8%, and an AUC of 98.5%. Significantly, it demonstrates a substantial improvement in F1-score over the strongest baseline and reduces critical false negative errors by 43%, all while maintaining moderate computational efficiency. Furthermore, our efficiency analysis indicates that this superior performance is achieved with a moderate computational complexity. The results validate our hypothesis that a specialized hybrid architecture, which synergizes contextual embedding with sequential processing and attention mechanism, offers a more robust and practical solution for real-world social media applications.
Mahdi et al. (Thu,) studied this question.