What question did this study set out to answer?

The aim is to enhance robot navigation in complex environments by integrating Graph Neural Networks with Reinforcement Learning.

May 22, 2026Open Access

GNN enhanced reinforcement learning for robot navigation in complex topological networks

Key Points

The aim is to enhance robot navigation in complex environments by integrating Graph Neural Networks with Reinforcement Learning.
Utilized a GNN-RL framework based on the Soft Actor-Critic algorithm.
Encoded environmental features into graph nodes and edges for structured cognition.
Established a dynamic collaboration between the GNN encoder and the RL agent with multi-objective rewards.
GNN-RL method provides improved perception accuracy and decision-making efficiency compared to DQN and PPO.
Achieved favorable performance in simulated complex topological environments, outperforming traditional algorithms like A*.

Abstract

To address the challenges encountered by intelligent robots in perceiving high-dimensional environmental states and making adaptive trajectory planning decisions in complex topological environments, this paper presents a Graph Neural Network–Reinforcement Learning (GNN-RL) integrated framework, implemented based on the Soft Actor-Critic (SAC) algorithm for continuous control tasks. First, leveraging the topological modeling capability of GNNs, environmental entities are abstracted into graph nodes, and their spatial constraints and semantic associations are encoded as edge features. Through multi-layer graph convolution and adaptive edge weighting, high-dimensional structured environmental information is compressed into low-dimensional node-level and graph-level embeddings with rich topological semantics. This provides structured environmental cognition for the subsequent reinforcement learning module, alleviating the curse of dimensionality and enabling efficient action selection. Second, a dynamic collaborative mechanism between the GNN encoder and the SAC-based RL agent is established. The topological features extracted by the GNN are fed as input to the RL agent, which consists of twin Q-networks, a policy network, and a value network. A multi-objective reward function, which integrates safety, progress, and motion smoothness, guides the agent’s trial-and-error exploration. In this manner, static topological representations are transformed into dynamic trajectory policies, while the GNN parameters are jointly optimized end-to-end via the gradient signals from the RL loss function, overcoming the limitations of purely static graph learning. Finally, comprehensive comparative experiments are conducted in simulated complex topological environments, evaluating the proposed GNN-RL approach against DQN, PPO, and A* algorithms. The results show that the GNN-RL method achieves a favorable balance between perception accuracy and decision-making efficiency, providing a reliable and adaptive solution for robot navigation and trajectory planning in structured, dynamic environments.

Bookmark

View Full Paper