June 27, 2024Open Access

Tools Fail: Detecting Silent Errors in Faulty Tools

Key Points

Key points are not available for this paper at this time.

Abstract

Tools have become a mainstay of LLMs, allowing them to retrieve knowledge not in their weights, to perform tasks on the web, and even to control robots. However, most ontologies and surveys of tool-use have assumed the core challenge for LLMs is choosing the tool. Instead, we introduce a framework for tools more broadly which guides us to explore a model's ability to detect "silent" tool errors, and reflect on how to plan. This more directly aligns with the increasingly popular use of models as tools. We provide an initial approach to failure recovery with promising results both on a controlled calculator setting and embodied agent planning.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Sun et al. (Thu,) studied this question.

synapsesocial.com/papers/68e6312bb6db6435875c3940 — DOI: https://doi.org/10.48550/arxiv.2406.19228

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error· 2024 · 1 citations
Tool learning with language models: a comprehensive survey of methods, pipelines, and benchmarks· 2025
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?· 2024 · 1 citations
ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages· 2024 · 2 citations
Towards Practical Tool Usage for Continually Learning LLMs· 2024 · 2 citations

Authors

Jimin Sun

Carnegie Mellon University

So Yeon Min

Yingshan Chang

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Tools Fail: Detecting Silent Errors in Faulty Tools

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider