Policy gradient reinforcement learning for fast quadrupedal locomotion | Synapse