iREPO: implicit Reward Pairwise Difference based Empirical Preference Optimization | Synapse