This deposit contains a working research draft describing a multi-agent deliberation architecture for long-horizon sequential decision making in large language model–based agents. Using Zork I as a challenging interactive fiction testbed, the work argues that single-pass inference places excessive cognitive burden on a single model call, leading to looping behavior and poor arbitration between competing objectives. The paper proposes an explicit separation between proposal generation and decision selection through specialized mission agents, a dedicated explorer agent, and a distinct arbitration step. The contribution is architectural and methodological rather than performance-driven; results are preliminary and intended to motivate further investigation into long-horizon agent control, exploration–exploitation tradeoffs, and reasoning transparency.
Building similarity graph...
Analyzing shared references across papers
Loading...
Michael D. Lane
Texas A&M University
Mitchell Institute
Building similarity graph...
Analyzing shared references across papers
Loading...
Michael D. Lane (Mon,) studied this question.
www.synapsesocial.com/papers/6967190087ba607552bb8f3e — DOI: https://doi.org/10.5281/zenodo.18224702
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: