What type of study is this?

This is a Experimental Study study.

September 30, 2025Open Access

DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL

Key Points

DeepDive-32B achieves competitive results on BrowseComp, outperforming other models like WebSailor.
Multi-turn reinforcement learning training leads to significant performance improvements across multiple benchmarks.
The strategy synthesizes complex questions from knowledge graphs, enhancing the reasoning capability of language models.
Test-time scaling of tool calls and parallel sampling are enabled through the DeepDive framework.

Abstract

Augmenting large language models (LLMs) with browsing tools substantially improves their potential as deep search agents to solve complex, real-world tasks. Yet, open LLMs still perform poorly in such settings due to limited long-horizon reasoning capacity with browsing tools and the lack of sufficiently difficult supervised data. To address these challenges, we present DeepDive to advance deep search agents. First, we propose a strategy to automatically synthesize complex, difficult, and hard-to-find questions from open knowledge graphs. Second, we apply end-to-end multi-turn reinforcement learning (RL) to enhance LLMs' long-horizon reasoning with deep search. Experiments show that DeepDive-32B achieves a new open-source competitive result on BrowseComp, outperforming WebSailor, DeepSeek-R1-Browse, and Search-o1. We demonstrate that multi-turn RL training improves deep search ability and significantly contributes to the performance improvements across multiple benchmarks. We observe that DeepDive enables test-time scaling of tool calls and parallel sampling. All datasets, models, and code are publicly available at https://github.com/THUDM/DeepDive.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Rui Lu

Zhenyu Hou

Zihan Wang

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study