A distributed adaptive policy gradient method based on momentum for multi-agent reinforcement learning | Synapse