An improved actor-critic architecture with PPO for the traveling salesman problem | Synapse