Expert-guided and action-compensated deep reinforcement learning for robust multi-ship collision avoidance in dynamic and uncertain maritime environments | Synapse