Key points are not available for this paper at this time.
It is expected that human-driven vehicles and autonomous vehicles (AVs) will coexist in a mixed traffic for a long time. To enable AVs to safely and efficiently maneuver in this mixed traffic, it is critical that the AVs can understand how humans cope with risks and make driving-related decisions. In this work, we incorporate a human decision-making model in reinforcement learning to control AVs for safe and efficient operations. Specifically, we adapt regret theory to describe a human driver's lane-changing behavior and t the personalized models to individual drivers for predicting their lane-changing decisions. The predicted decisions are incorporated in the safety regulations for reinforcement learning in training and in implementation. We then use an extended version of double deep Q-network (DDQN) to train our AV controller within the safety set. By doing so, the number of collisions in training and testing is reduced to zero, while the training accuracy is not impinged.
Chen et al. (Wed,) studied this question.