Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process | Synapse