UniAPL: A Unified Adversarial Preference Learning Framework for Instruct-Following | Synapse