What question did this study set out to answer?

The study investigates if large language models' behavior in strategic settings aligns with normative decision-making or human behavior.

June 17, 2026

Can Large Language Models Reason Strategically? Evidence From Attacker–Defender Signaling Games

Key Points

The study investigates if large language models' behavior in strategic settings aligns with normative decision-making or human behavior.
Controlled attacker-defender signaling game evaluation
Comparison of GPT-4o against normative Bayesian model and human data
Analysis of belief formation and action selection dynamics
GPT-4o matches normative action in 7 out of 8 scenarios but diverges in decision-making distribution
Significant underutilization of the abort option by GPT-4o (6.7% vs. normative 25.6%)
Core finding reveals cognitive-action decoupling, with GPT-4o's beliefs less certain yet producing more deterministic actions.

Abstract

ABSTRACT Large language models (LLMs) are increasingly considered for deployment in applications requiring strategic judgment under uncertainty. Yet it remains unclear whether their behavior in adversarial environments resembles normative decision‐making, human strategic behavior, or something qualitatively distinct from both. This study addresses that question using a controlled attacker–defender signaling game in which an attacker must interpret potentially deceptive defender announcements and decide whether to attack one of two targets or abstain. We develop a three‐way comparison framework that evaluates GPT‐4o against two benchmarks simultaneously: a normative Bayesian best‐response model and empirical human decisions drawn from a matched experimental data set. Critically, we decompose strategic behavior into two components, belief formation and action selection, to identify whether similarities and divergences across agent types arise at the level of probabilistic inference, behavioral choice, or both. The results provide partial support for normative alignment (H1): GPT‐4o's modal action matches the normative benchmark in seven out of eight scenarios, yet its decision distributions diverge significantly in all conditions (), driven by a systematic underutilization of the abort option (6.7% vs. the normative recommendation of 25.6%). Human similarity (H2) is not supported, with action frequency distributions differing significantly across all eight conditions (). The core finding is a cognitive‐action decoupling: GPT‐4o maintains more diffuse posterior beliefs than humans in six out of eight scenarios yet produces more deterministic actions, and explicitly articulates uncertainty in 14%–28% of reasoning traces while systematically overriding that uncertainty in its final decisions. These findings position current LLMs as a strategically distinct class of agent, neither fully rational equilibrium players nor behavioral mimics of human bounded rationality. The observed commission bias and belief‐action decoupling have direct implications for the deployment of LLMs in high‐stakes adversarial roles, where abstention under uncertainty is often the strategically rational choice.

AIに質問

Bookmark