What question did this study set out to answer?

The research aims to develop a framework for efficient robot navigation while protecting data privacy.

April 12, 2026Open Access

SPHTRLM: secure and privacy-preserving hyperparameter-tuned reinforcement learning method for robot path finding in dynamic environments

Key Points

The research aims to develop a framework for efficient robot navigation while protecting data privacy.
Developed SPHTRLM framework incorporating Q-learning and federated learning for updates
Implemented refined differential privacy and minimal encrypted parameter exchange
Utilized adaptive reward shaping and automatic hyperparameter optimization
Included mobility conscious aggregation for resource-limited robotic platforms
SPHTRLM achieved a success rate of 95% ± 2% in path planning
Reduced average path distance by 20-25% compared to traditional methods
Improved convergence speed by approximately 35%
Achieved a collision rate of 0.08 with dense obstacles
Maintained real-time decision time of 110-125 ms with minimal computational costs of 8-12%

Abstract

Autonomous robot navigation within a dynamic environment is a complicated issue since environmental factors keep on changing, safety remains a factor, and issues of data privacy concern are also on the increase. The existing reinforcement learning (RL) navigation systems mainly focus on path performance and avoidance of collisions but do not focus on privacy protection, adaptation learning stability, and real deployment. This research aims to overcome these constraints by suggesting a novel framework Secure and Privacy-Preserving Hyperparameter-Tuned RL Model (SPHTRLM) to the efficient generation of path plans in grid ecosystems with dynamic environments. The framework incorporates adjusted Q-learning with federated learning (FL) based distributed updates, refined differentiated privacy, minimal encrypted parameter exchange, adaptive reward shaping and automatic hyperparameter optimization. In a further attempt to enhance practicability, the proposed architecture also embraces mobility conscious aggregation and heterogeneous model support of resource-limited robotic platforms. The suggested SPHTRLM has a success rate of (95% ± 2%), and it is better than the comparable one Q-learning (87% ± 4%) and Deep RL (DRL) baselines (88%) when these methods were evaluated under the same condition. The framework minimizes distances to the average path with a reduction of 20–25% and convergence is speeded up by around 35% compared to normal Q-learning. When the obstacles are very thick then the collision rate becomes and the obstacle reduces to 0.08, and the safety of the navigation process improves. Although there are additional privatization mechanisms, the computational costs are minimal (8–12%), and the average decision time is 110–125 ms, which meets the real-time operational capabilities. Privacy analysis with formally stated membership inference and reconstruction attacks provide status of attack rate less than 5% attack success with both white and black box adversary. These findings underscore that SPHTRLM is a feasible way of achieving the goals of ensuring navigation, learning consistency, safety as well as privacy protection to give credible acceptance to using autonomous robotic systems in dynamic and data-sensitive environment.

Bookmark

View Full Paper

Bookmark

View Full Paper

SPHTRLM: secure and privacy-preserving hyperparameter-tuned reinforcement learning method for robot path finding in dynamic environments

Key Points

Abstract

Cite This Study