Foundation models, in particular large language models (LLMs), are finding increasing popularity when used in describing goals for robotic control, decision making, and execution. Recently, proposals for hybrid paradigms leveraging strengths of reinforcement learning (RL) agents in tandem with LLMs for robotic control have been demonstrated. The interface between the RL agents and the language model however offers a unique opportunity to explore how prompt framing may affect such hybrid systems. This work presents a controlled experimental platform to measure and better understand how manipulation of the interface between RL agents and an LLM impacts behaviour of a hybrid advisor-arbiter architecture. We compared three agents under matched evaluation protocols and initializations in a simulated navigation environment: (i) RL-only tabular Q-learning; (ii) LLM-only (stateless) action selection; and (iii) a hybrid LLM + RL agent. Under a constrained interaction budget (10 episodes per world), the hybrid LLM + RL agent achieves higher mean success and higher mean cumulative reward than both RL-only and LLM-only baselines. Advisor-channel ablations (random recommendations and null recommendations) reduce performance, indicating that structured advice contributes beyond adding extra text. We further demonstrate prompt framing as a controlled factor by evaluating navigation-role personas, narrative personas, and relational variants of a caregiver prompt under matched conditions, yielding heterogeneous effects across framings. The contribution of this work is to provide a structured testbed and evaluation approach for investigating the impact of prompt framing on multi-step decision making and control tasks.
Building similarity graph...
Analyzing shared references across papers
Loading...
Anup Tuladhar
Eli Kinney‐Lang
SHILAP Revista de lepidopterología
Frontiers in Robotics and AI
University of Calgary
Azrieli College of Engineering Jerusalem
Suntek Computer Systems (China)
Building similarity graph...
Analyzing shared references across papers
Loading...
Tuladhar et al. (Wed,) studied this question.
www.synapsesocial.com/papers/69edaa9b4a46254e215b3274 — DOI: https://doi.org/10.3389/frobt.2026.1771992