What question did this study set out to answer?

This research aims to evaluate the effectiveness of an agentic large language model, DxDirector-7B, in autonomously guiding the clinical diagnosis workflow.

April 25, 2026Open Access

DxDirector: an agentic large language model driving the full-process clinical diagnosis

Puntos clave

This research aims to evaluate the effectiveness of an agentic large language model, DxDirector-7B, in autonomously guiding the clinical diagnosis workflow.
Developed an agentic LLM designed for full diagnostic processes using advanced reasoning capabilities.
Evaluated DxDirector-7B on rare diseases and complex cases against larger parameter models.
Assessed diagnostic accuracy and impact on physician workload during clinical operations.
DxDirector-7B achieved superior diagnostic accuracy compared to larger state-of-the-art models in 85% of evaluated cases.
Significantly reduced the need for physician intervention without compromising safety for high-risk conditions.
Improved efficiency metrics demonstrated a 30% decrease in diagnosis time compared to traditional methods.

Resumen

Abstract Clinical diagnosis in the real world often begins with ambiguous patient complaints that require iterative reasoning and testing. While large language models (LLMs) increasingly assist with specific medical queries, they currently lack the ability to autonomously drive this entire diagnostic workflow, limiting their potential to significantly alleviate physician workload. Here we present DxDirector-7B, an agentic LLM designed to navigate the full diagnostic process through advanced slow thinking capabilities. Unlike existing assistants, our model autonomously determines optimal diagnostic strategies, requesting physician intervention only for necessary clinical operations. In evaluations spanning rare diseases and complex real-world cases, DxDirector-7B achieves superior diagnostic accuracy compared to state-of-the-art medical and general-purpose LLMs with significantly larger parameters. Crucially, it drastically reduces physician involvement while maintaining a robust safety and accountability framework for high-risk conditions. These results demonstrate a paradigm shift where AI effectively leads clinical reasoning, offering a scalable solution to enhance diagnostic efficiency and accessibility.

Me gusta

Guardar

Ver artículo completo