Machine unlearning (MU) aims to eliminate information that has been learned from specific training data, namely forgetting data, from a pretrained model. Currently, the mainstream of relabeling-based MU methods involves modifying the forgetting data with incorrect labels and subsequently fine-tuning the model. While learning such incorrect information can indeed remove knowledge, the process is quite unnatural as the unlearning process undesirably reinforces the incorrect information and leads to over-forgetting. Towards more natural machine unlearning, we inject correct information from the remaining data to the forgetting samples when changing their labels. Through pairing these adjusted samples with their labels, the model tends to use the injected correct information and naturally suppress the information meant to be forgotten. Albeit straightforward, such a first step towards natural machine unlearning can significantly outperform current state-of-the-art approaches. In particular, our method substantially reduces the over-forgetting problem and leads to strong robustness across different unlearning tasks, making it a promising candidate for practical machine unlearning.
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhengbao He
Shanghai Jiao Tong University
Tao Li
Shanghai Jiao Tong University
Xinwen Cheng
Changchun University of Science and Technology
IEEE Transactions on Pattern Analysis and Machine Intelligence
Shanghai Jiao Tong University
Building similarity graph...
Analyzing shared references across papers
Loading...
He et al. (Wed,) studied this question.
synapsesocial.com/papers/68a3633d0a429f7973329f18 — DOI: https://doi.org/10.1109/tpami.2025.3597350