Deep Reinforcement Learning has shown immense potential in autonomous driving decision-making; however, its application in safety–critical scenarios such as highways still faces significant challenges in data efficiency and safety performance. To address this, this paper proposes an LLM-guided Safety-Aware DQN (LSA-DQN) framework. This framework first utilizes a multi-head attention network to enhance the comprehension of complex traffic scenarios. Building on this, we design a method combining physics-based pre-screening and Large Language Model (LLM) arbitration to accurately classify experience data. Furthermore, we devise a hybrid safety regularization method that integrates Conservative Q-Learning with a Margin-based Contrastive Penalization (MCP) to learn explicit safety boundaries. Experimental results demonstrate that, compared to baseline algorithms, LSA-DQN reduces the collision rate to 0.9% while maintaining high traffic efficiency, proving its high robustness and reliability in complex and dynamic highway environments.
Ren et al. (Fri,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: