What question did this study set out to answer?

This research aims to develop a deep reinforcement learning method to improve vessel trajectory prediction in maritime navigation.

June 4, 2026Open Access

A CNN-PPO Deep Reinforcement Learning Method for Maritime Target Motion Prediction

Key Points

This research aims to develop a deep reinforcement learning method to improve vessel trajectory prediction in maritime navigation.
Trajectory prediction formulated as a Markov Decision Process (MDP).
Convolutional neural network (CNN) utilized for policy network parameterization.
CNN-PPO deep reinforcement learning method trained on historical vessel trajectories.
Proposed algorithm accurately predicts target position at future moments.
Demonstrated advantages in multi-moment trajectory prediction compared to existing methods.

Abstract

In recent years, with the continuous development of the global economy, maritime transportation has become an essential mode of transportation for both domestic and international trade, making it an urgent task for countries around the world to improve the intelligence level of maritime traffic management. As a key technology in real-time vessel monitoring, traffic hazard early warning, and traffic flow estimation, vessel trajectory prediction has been widely applied across both civil and commercial sectors. To address the problems with existing trajectory prediction methods, such as difficulty in establishing real-time vessel kinematic models, limited understanding of vessel navigation strategies, and poor adaptability for long-term prediction, this paper proposes a deep reinforcement learning-based vessel trajectory prediction method. Firstly, the trajectory prediction problem is formulated as a Markov Decision Process (MDP), thereby transforming the prediction problem into an optimal policy-solving problem of the MDP. Secondly, given the complexity of the trajectory prediction problem, a convolutional neural network (CNN) is adopted to parameterize the policy network, and a CNN-PPO deep reinforcement learning method is used to learn the target vessel’s navigation strategy from historical trajectories. Comparative experimental results show that the proposed algorithm is not only suitable for predicting the target’s position at a specific future moment but also offers clear advantages in trajectory prediction across multiple future moments.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Zheng et al. (Mon,) studied this question.

synapsesocial.com/papers/6a21171dd499ed480b16ff6f https://doi.org/https://doi.org/10.3390/app16115509

Bookmark

View Full Paper