What does this research mean for the field?

Large Language Models (LLMs) are capable of producing content that is as persuasive as untrained human participants, posing significant risks of manipulation and deception. Novelty: ClaimNovelty.SYNTHESIS. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The aim is to investigate the risks posed by large language models in terms of deception, manipulation, and persuasion.

February 21, 2026Open Access

Lies, damned lies, and language statistics: a comprehensive review of risks from manipulation, persuasion, and deception with large language models

Key Points

The aim is to investigate the risks posed by large language models in terms of deception, manipulation, and persuasion.
Survey of risks related to LLMs such as fraud and misinformation
Review of empirical data on LLM persuasion and deception
Evaluation of proposed mitigations for deceptive capabilities
LLMs are found to be as persuasive as untrained humans
Identified risks include criminal fraud and political misinformation
Mitigation strategies are evaluated for effectiveness and limitations

Abstract

Abstract Large Language Models (LLMs) have the potential to produce content that is effective at persuading, deceiving, and manipulating people. Here we survey the possible risks of systems with these capabilities, including criminal fraud, political misinformation, addictive AI companions, and misaligned autonomous systems. We then survey the rapidly growing body of empirical work on their propensity to deceive and their capacity to persuade, which suggests that models are already roughly as persuasive as untrained human participants. We review proposed mitigations for these techniques—including training models to be truthful or monitoring their hidden states—and highlight strengths and weaknesses of each potential approach. Finally, we highlight five key open questions for future research: how persuasive could AI systems be? How do AI systems persuade? What broader social impacts could AI persuasion have? Does persuasion advance truth? And how effective are proposed mitigations?

KI fragen

Bookmark

View Full Paper