What question did this study set out to answer?

The study examines the limitations of autonomous AI agents in high-stakes decisions compared to structured decision frameworks.

March 14, 2026Open Access

When Agents Fail: Why Multi-Criteria Decision Frameworks Outperform Autonomous AI Agents for High-Stakes Decisions

Read Full Paperexternally

Key Points

The study examines the limitations of autonomous AI agents in high-stakes decisions compared to structured decision frameworks.
Systematic mapping of 16 case studies
Comparison of AI agents with the AEGIS multi-criteria decision framework
Analysis of documented security vulnerabilities and emergent safety behaviors
Identified 10 security vulnerabilities in autonomous agents
Established 6 safety behavior patterns across multi-agent environments
Demonstrated how MCDA frameworks ensure auditability and proportionality

Abstract

The rapid deployment of autonomous AI agents in production environments has exposed critical vulnerabilities that challenge fundamental assumptions about agent reliability. Shapira et al. (2026) documented 10 security vulnerabilities and 6 emergent safety behaviors across 6 autonomous agents observed over 14 days in unstructured multi-agent environments (arXiv:2602.20021). These failures -- including disproportionate response, manipulation via social engineering, identity hijacking, infinite resource loops, and constitutional corruption -- are not implementation bugs but architectural consequences of delegating decisions to stochastic generative models. We perform a systematic mapping of all 16 documented case studies to the AEGIS multi-criteria decision framework, demonstrating that every vulnerability class maps to a structural guarantee in MCDA: proportionality via normalized TOPSIS distances, manipulation resistance via fixed criteria schemas, deterministic reproducibility, bounded resource consumption, and complete audit trails. We argue that for domains requiring auditability, proportionality, and adversarial robustness -- including cybersecurity, finance, healthcare, and critical infrastructure -- structured MCDA frameworks like AEGIS are architecturally superior to autonomous agents for the decision layer, while AI remains valuable for feature extraction and enrichment.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Anderson Acosta de Paiva

Priscylla Lygia Boente do Nascimento

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

When Agents Fail: Why Multi-Criteria Decision Frameworks Outperform Autonomous AI Agents for High-Stakes Decisions

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider