What question did this study set out to answer?

This research aims to develop a navigation framework for quadrotors using large language models (LLMs) in unknown environments.

February 16, 2026Open Access

A Visual Target Navigation Method for Quadrotor Based on Large Language Model in Unknown Environment

Key Points

This research aims to develop a navigation framework for quadrotors using large language models (LLMs) in unknown environments.
Developed a visual target navigation framework utilizing LLMs.
Designed an intelligent planner using prompt templates in two phases: search sequencing and sub-goal generation.
Employed path planning algorithms for quadrotor navigation.
Simulation results show approximately 20% performance improvement over single-modality baselines.
Physical flight experiments yielded success rates of 56% in Cross-layout and 48% in T-shaped layout scenarios.
Challenges noted include perceptual occlusion and planning uncertainty.

Abstract

This paper proposes a novel Large Language Model (LLM)-based visual target navigation framework for quadrotors in unknown environments. Leveraging the semantic knowledge of LLMs, our method enables autonomous exploration based on natural language instructions. We design an intelligent planner using specialized prompt templates that operates in two phases: first, deriving global search sequences via probabilistic inference; second, dynamically generating sub-goal waypoints by fusing visual observations with statistical priors and LLM-derived scene relevance metrics. The quadrotor then executes a progressive search via path planning algorithms. Simulation results indicate that our fused method outperforms single-modality baselines by approximately 20%. Furthermore, physical flight experiments demonstrate success rates of 56% in Cross-layout and 48% in T-shaped layout scenarios. These results, while reflecting the inherent challenges of perceptual occlusion and planning uncertainty, validate the feasibility and potential of the proposed framework in real-world applications.

Read Full Paperexternally

Ask AI

Helpful

Bookmark

View Full Paper