How Well Can Preference Optimization Generalize Under Noisy Feedback? | Synapse