What question did this study set out to answer?

The aim is to enhance automatic detection methods for disinformation by analyzing discourse derailment.

June 14, 2026Open Access

Using LLMs to identify discourse derailment as a potential cue for disinformation in social media posts

Key Points

The aim is to enhance automatic detection methods for disinformation by analyzing discourse derailment.
Implemented an automatic detection system using a Large Language Model (LLM) to identify discourse derailment.
Tested the system on human-annotated data to evaluate its performance against various baselines.
Compared expected replies generated by the LLM with actual replies to identify discourse shifts.
The system outperformed various baseline models, achieving close agreement with human annotators.
Identified limitations in bias due to differing cues used by the automatic system versus human annotators.

Abstract

Abstract Identifying disinformation in online social media is important for addressing several risks to society. However, methods for automatically detecting possible disinformation still mostly rely on low-level lexical cues, which are increasingly outdated in a world with access to generative language models. We present a new approach to the automatic detection of disinformation based on measuring discourse ‘derailment’ – messages that try to force the topic of discourse away from one topic and onto another. While this may include both malicious and benign derailment, it could serve as an important signal for early-warning systems. In this study, we implement a system for automatic detection of discourse derailment which uses a Large Language Model (LLM) to generate expected replies and compares them to the real replies. We test this system on a set of human-annotated data to show that the system outperforms various baselines and approaches the agreement between human annotators. This suggests that LLMs can be sensitive to discourse-level information. However, we also identify evidence of several limitations, including that the automatic system relies on different cues compared to human annotators, which leads to some amount of bias. Nevertheless, our project represents a considerable step towards understanding how to use LLMs to analyse discourse and a new angle on tackling disinformation.

Ask AI

Helpful

Bookmark

View Full Paper