Coupled Penalties-Augmented Proximal Policy Optimization for Safe Reinforcement Learning | Synapse