Key points are not available for this paper at this time.
Tools have become a mainstay of LLMs, allowing them to retrieve knowledge not in their weights, to perform tasks on the web, and even to control robots. However, most ontologies and surveys of tool-use have assumed the core challenge for LLMs is choosing the tool. Instead, we introduce a framework for tools more broadly which guides us to explore a model's ability to detect "silent" tool errors, and reflect on how to plan. This more directly aligns with the increasingly popular use of models as tools. We provide an initial approach to failure recovery with promising results both on a controlled calculator setting and embodied agent planning.
Building similarity graph...
Analyzing shared references across papers
Loading...
Sun et al. (Thu,) studied this question.
synapsesocial.com/papers/68e6312bb6db6435875c3940 — DOI: https://doi.org/10.48550/arxiv.2406.19228
Jimin Sun
Carnegie Mellon University
So Yeon Min
Yingshan Chang
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: