The Game Reasoning Arena library provides a framework for evaluating the decision making abilities of large language models (LLMs) through strategic board games implemented in Google OpenSpiel library. The framework enables systematic comparisons between LLM based agents and other agents (random, heuristic, reinforcement learning agents, etc.) in various game scenarios by wrapping multiple board and matrix games and supporting different agent types. It integrates API access to models via liteLLM, local model deployment via vLLM, and offers distributed execution through Ray. This paper summarises the library structure, key characteristics, and motivation of the repository, highlighting how it contributes to the empirical evaluation of the reasoning of LLM and game theoretic behaviour.
Building similarity graph...
Analyzing shared references across papers
Loading...
Lucia Cipolina-Kun
Marianna Nezhurina
Jenia Jitsev
Building similarity graph...
Analyzing shared references across papers
Loading...
Cipolina-Kun et al. (Tue,) studied this question.
www.synapsesocial.com/papers/68d6e16f8b2b6861e4c4004b — DOI: https://doi.org/10.48550/arxiv.2508.03368
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: