What question did this study set out to answer?

This research aims to understand how different forms of choice architecture influence decision-making in large language models compared to human behavior.

June 17, 2026Open Access

AI agents are sensitive to nudges

Key Points

This research aims to understand how different forms of choice architecture influence decision-making in large language models compared to human behavior.
Adapted a human decision-making task for testing LLMs under four forms of choice architecture: defaults, suggestions, information highlighting, and optimal nudges.
Compared LLM decision-making with human behavior as a baseline across various prompting strategies and models.
Evaluated the sensitivity of LLMs to nudges and the stability of their behavior using reasoning-optimized techniques.
LLMs are more responsive to nudges than humans, where weak cues significantly alter model choices, affecting both positive and negative outcomes.
Behavioral responses of LLMs diverge from human patterns, with some models over-investing in information acquisition and others ignoring it entirely.
Reasoning-optimized LLMs restore some human-like sensitivity to nudges inconsistently and at a high computational cost.

Abstract

Large language models (LLMs) are increasingly deployed as autonomous agents that make choices and use tools on behalf of users. Yet, we have limited evidence about how their decisions are shaped by their environment. We adapt a human decision-making task to test leading LLMs under four forms of choice architecture: defaults, suggestions, information highlighting, and “optimal” nudges derived from a resource-rational model of human choice. We treat human behavior as a baseline for predictable sensitivity to such interventions. Across models and prompting strategies, LLMs often depart substantially from this baseline. They sometimes pay excessive costs to acquire information, sometimes ignore available information, and, most crucially, are far more responsive to nudges than humans, such that weak cues that slightly shift human behavior have larger effects on model choices, toward both better and worse payoff outcomes. Chain-of-thought prompting and in-context human data do not reliably stabilize behavior. Recent reasoning-optimized LLMs can, in some configurations, restore more human-level sensitivity to nudges, but do so inconsistently and at substantial computational cost. These results point to an important and largely neglected safety concern: LLM agents can be behaviorally brittle under subtle changes in choice architecture, even in the absence of adversarial settings.

Read Full Paperexternally

Ask AI

Helpful

Bookmark

View Full Paper