DPO vs. RLHF: An Empirical Comparison of Alignment Techniques for Large Language Models | Synapse