Capturing the Style of Fake News

Key Points

Key points are not available for this paper at this time.

Abstract

In this study we aim to explore automatic methods that can detect online documents of low credibility, especially fake news, based on the style they are written in. We show that general-purpose text classifiers, despite seemingly good performance when evaluated simplistically, in fact overfit to sources of documents in training data. In order to achieve a truly style-based prediction, we gather a corpus of 103,219 documents from 223 online sources labelled by media experts, devise realistic evaluation scenarios and design two new classifiers: a neural network and a model based on stylometric features. The evaluation shows that the proposed classifiers maintain high accuracy in case of documents on previously unseen topics (e.g. new events) and from previously unseen sources (e.g. emerging news websites). An analysis of the stylometric model indicates it indeed focuses on sensational and affective vocabulary, known to be typical for fake news.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Piotr Przybyła

Universitat Pompeu Fabra

Actions

Institutions

Polish Academy of Sciences

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Piotr Przybyła (Fri,) studied this question.

synapsesocial.com/papers/6a0e9bd08720ffe3c1045183 — DOI: https://doi.org/10.1609/aaai.v34i01.5386

Also consider

Synapse has enriched 4 closely related papers on similar clinical questions. Consider them for comparative context:

The general inquirer: A computer system for content analysis and retrieval based on the sentence as a unit of information· 2007 · 220 citations
Social and Heuristic Approaches to Credibility Evaluation Online· 2010 · 1,331 citations
Explanation and trust: what to tell the user in security and AI?· 2010 · 154 citations
MizAR 60 for Mizar 50· 2023 · 76,144 citations

Also consider

Synapse has enriched 4 closely related papers on similar clinical questions. Consider them for comparative context:

The general inquirer: A computer system for content analysis and retrieval based on the sentence as a unit of information· 2007 · 220 citations
Social and Heuristic Approaches to Credibility Evaluation Online· 2010 · 1,331 citations
Explanation and trust: what to tell the user in security and AI?· 2010 · 154 citations
MizAR 60 for Mizar 50· 2023 · 76,144 citations

Capturing the Style of Fake News

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider