A Proximal Policy Optimization method in UAV swarm formation control | Synapse