In recent times, there has been an increase in misinformation, with misleading information being shared as real news to deceive and manipulate public opinion. The dissemination of misinformation, particularly in areas with global implications such as politics and health, can have severe consequences for society as a whole. For example, recent US elections related widespread misinformation has shown to deepen polarization and erode trust in both democratic institutions and our news media. Misleading reports during crises like the Ebola outbreak or COVID-19 misinformation about vaccines and treatments spread unnecessary fear, created barriers for public health response teams, and resulted in many preventable deaths. Social media further amplifies fake news, making it difficult for fact-checking efforts to keep pace. To distinguish misinformation from credible reporting, this paper aims to apply machine learning techniques to detect fake news with greater accuracy. To research this, we analyze datasets containing both fake and real news articles to uncover linguistic patterns and differences between the two. Natural Language Processing (NLP) techniques such as Term Frequency-Inverse Document Frequency (TF-IDF) are used to convert text data into numerical features for training machine learning models. Several classification algorithms, such as Logistic Regression, Random Forest, and XGBoost, are then trained to differentiate fake from real news. To further explore the differences in the data types, an analysis is done to examine sentiment differences. By leveraging data from everyday news, politics, and health sources, we keep the work grounded in the real-world implications of fake news disguised as fact. The goal is to develop an AI-powered automated fact-checking system to distinguish between real and fake sources, thereby contributing to ongoing efforts to protect the public from the harms of misinformation and uphold their trust in news media.
Maya Hussain (Sun,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: