Accelerating Nash Learning from Human Feedback via Mirror Prox | Synapse