What question did this study set out to answer?

The aim is to develop effective counter narratives to combat online hate speech in Tamil through a structured methodology.

February 9, 2026Open Access

Nalvakku: fact-based counter narratives for Tamil hate speech

Key Points

The aim is to develop effective counter narratives to combat online hate speech in Tamil through a structured methodology.
Expanded a dataset of hate speech counter narrative pairs from 220 to 5,000 using a human-in-the-loop framework.
Employed a retrieval augmented generation system to enhance the narratives with external knowledge.
Integrated a human post-edited dataset into the RAG system, culminating in a Fact-RAG system.
Counter narratives generated are varied, credible, and contextually relevant.
Assessment shows significant improvements in factual accuracy and persuasiveness of narratives.

Abstract

Abstract Warning: This paper contains insulting statements that may cause discomfort for readers. The rapid proliferation of digital platforms has intensified online hate speech, especially in low-resource languages such as Tamil, where automated moderation techniques remain underdeveloped. This paper presents a three-stage methodology for generating counter narratives in Tamil. First, a seed dataset of 220 hate speech counter narrative (HS-CN) pairs is expanded to 5,000 through a human-in-the-loop Author Reviewer framework with expert validation. Second, a fact-based retrieval augmented generation (RAG) system is employed to incorporate external knowledge to enhance factual accuracy and persuasiveness. Finally, the human post-edited dataset is integrated to the RAG system as a curated knowledge base yielding a Fact-RAG system with stronger factual grounding and cultural appropriateness. Assessment through intrinsic indicators and LLM-based evaluations indicates that our methodology generates counter-narratives that are varied, credible, and contextually relevant. These findings underscore the effectiveness of integrating human supervision, factual validation, and selected examples for counter-narrative development in low-resource settings. GitHub: https://github.com/Bharathi-AI-for-Social-Good/Fact-RAG-BasedCN-Ta

Perguntar à IA

Bookmark

View Full Paper