What question did this study set out to answer?

The aim is to enhance mobile robot path planning using deep reinforcement learning to handle dynamic environments more effectively.

April 12, 2026Open Access

Integrating self-attention and LSTM into TD3 for robust mobile robot navigation in dynamic environments

Key Points

The aim is to enhance mobile robot path planning using deep reinforcement learning to handle dynamic environments more effectively.
Developed the Self-Attention LSTM TD3 (SAL-TD3) algorithm with LSTM networks and multi-head self-attention.
Implemented a rank-based prioritized experience replay with n-step returns to boost sample efficiency.
Designed a composite reward function to provide dense feedback for efficient policy learning.
SAL-TD3 achieved a 91% success rate compared to 77% for TD3.
Reduced path length by 16.6%.
Lowered collision rate from 23% to 9%.

Abstract

Mobile robot path planning in dynamic environments is challenging because existing deep reinforcement learning methods lack temporal memory, suffer from inefficient sample utilization under uniform replay, and face credit assignment difficulties with sparse rewards. This paper proposes the Self-Attention LSTM TD3 (SAL-TD3) algorithm, which integrates LSTM networks and multi-head self-attention into the TD3 framework to capture temporal dependencies for proactive obstacle avoidance. A rank-based prioritized experience replay with n-step returns improves sample efficiency, and a composite reward function provides dense feedback for efficient policy learning. Experiments show that SAL-TD3 achieves a 91% success rate (vs. 77% for TD3), reduces path length by 16.6%, and lowers collision rate from 23% to 9%. Generalization tests and real-world robot deployment confirm robust sim-to-real transfer performance.

Perguntar à IA

Bookmark

View Full Paper

Cite This Study

Chen et al. (Fri,) studied this question.

synapsesocial.com/papers/69db37b04fe01fead37c5c36 https://doi.org/https://doi.org/10.1038/s41598-026-45819-0

Perguntar à IA

Bookmark

View Full Paper