What type of study is this?

September 5, 2025

Toxic language detection on social media: a critical linguistic approach to online hate speech

Key Points

The analysis found that 25.67% of 15,000 posts contained toxic language, highlighting the prevalence of hate speech on social media.
Critical Discourse Analysis revealed hidden expressions of hate speech through irony and metaphor, challenging the effectiveness of automated tools.
The study employed machine learning techniques, specifically a BERT-based model, to classify posts for toxic language detection.
The findings call for improved contextual approaches in content moderation systems to combat online hate speech effectively.

Abstract

Background: The rise of hate speech on social media, especially during the COVID-19 pandemic, poses serious threats to psychological well-being and social cohesion. While automated detection tools exist, they often lack the ability to grasp context and cultural nuances. This study explores the integration of Critical Discourse Analysis to enhance the accuracy and fairness of toxic language detection on digital platforms. Aim: This study aims to examine toxic language on social media by integrating an automated detection method based on machine learning with Critical Discourse Analysis (CDA), in order to understand how hate speech is produced, disseminated, and normalized within digital spaces. Method: This study employs a qualitative-critical design. Data were collected by crawling public posts on social media platforms (Twitter and Facebook) using specific keywords. The screening of toxic language was performed using a BERT-based machine learning classification model. From the automatic detection results, 200 posts were purposively selected for further analysis using CDA, focusing on text structure, discursive practices, and social practices. Result: The results reveal that 25.67% of the 15,000 posts analyzed were classified as toxic language. The CDA analysis uncovered that much of the toxic language did not appear explicitly but was instead concealed through irony, humor, and metaphor. The most prevalent targets of hate speech were racial issues (45%), followed by religion (28%), gender (15%), and sexual orientation (12%). Social media serves not only as a medium for individual dissemination but also as an arena for the reproduction of discriminatory ideologies. Conclusion: This study makes methodological contributions to the development of fairer and more contextual digital content moderation systems and provides a foundation for policymakers to implement more effective regulations aimed at protecting digital spaces from hate speech.

Bookmark

Cite This Study

Kusuma et al. (Sun,) studied this question.

synapsesocial.com/papers/68bb46a86d6d5674bccfe2ef https://doi.org/https://doi.org/10.64268/jllm.v1i01.4

Bookmark