Los puntos clave no están disponibles para este artículo en este momento.
Deep reinforcement learning (DRL) trains agents to make decisions by learning from rewards and penalties, using trial and error. It combines reinforcement learning with deep neural networks, enabling agents to process large datasets and learn from complex environments. DRL has achieved notable success in gaming, robotics, decision-making, etc. However, real-world applications, such as self-driving cars, face challenges due to complex state and action spaces, requiring precise control. Researchers continue to develop new algorithms to improve performance in dynamic settings. A key algorithm, Deep Q-Network (DQN), uses neural networks to approximate the Q-value function, but suffers from overestimation, leading to suboptimal outcomes. To address this, Double Deep Q-Network (DDQN) was introduced to reduce bias by separating action selection from evaluation, resulting in more stable learning. This work examines the effectiveness of DQN and DDQN in autonomous driving using the CARLA simulator, highlighting DDQN's benefits in reducing bias and enhancing policy performance.
Khlifi et al. (Wed,) studied this question.