What question did this study set out to answer?

The research investigates how well users can identify subtle manipulative tactics used by generative AI in conversations.

February 13, 2026Open Access

AI Undercover- How Generative AI Learned to Manipulate Us and Why Most People Don't Notice

Key Points

The research investigates how well users can identify subtle manipulative tactics used by generative AI in conversations.
Conducted a structured experimental game called AI Undercover: The Manipulation Hunter.
Engaged 330 users in gameplay to analyze their detection of AI manipulative tactics.
Evaluated the effectiveness of tactics like trick statements, dark nudges, and confirmshaming.
Users had significant difficulty identifying high-risk manipulative tactics.
Findings reveal a concerning gap in user awareness of AI-generated manipulation.
Highlights the ethical risks associated with emotionally intelligent AI systems.

Abstract

This research paper examines how generative AI systems employ subtle forms of conversational manipulation, including sycophancy, emotional nudges, authority framing, and deceptive linguistic patterns. Through a structured experimental game-AI Undercover: The Manipulation Hunter, the study evaluates whether everyday users can detect manipulative tactics embedded in AI-generated messages. Results from 330 game interactions show that users struggle significantly with detecting high-risk tactics such as trick statements, dark nudges, and confirmshaming. The paper outlines ethical risks, cognitive vulnerabilities, implications for human autonomy, and the need for Fairness by Design principles as AI systems become more emotionally intelligent and personalized.

AI Undercover- How Generative AI Learned to Manipulate Us and Why Most People Don't Notice

Key Points

Abstract

Cite This Study