What type of study is this?

This is a Experimental Study study.

October 13, 2025Open Access

Delving into Instance-Dependent Label Noise in Graph Data: A Comprehensive Study and Benchmark

Key Points

Experiments demonstrate that instance-dependent label noise significantly impacts GNN performance and robustness.
The BeGIN benchmark facilitates evaluation of various noise-handling strategies across different GNN architectures.
Node-specific parameterization is crucial for enhancing GNN robustness in the presence of label noise.
Algorithmic methods and simulations show the complexities of instance-dependent noise, particularly with LLM-based corruption.

Abstract

Graph Neural Networks (GNNs) have achieved state-of-the-art performance in node classification tasks but struggle with label noise in real-world data. Existing studies on graph learning with label noise commonly rely on class-dependent label noise, overlooking the complexities of instance-dependent noise and falling short of capturing real-world corruption patterns. We introduce BeGIN (Benchmarking for Graphs with Instance-dependent Noise), a new benchmark that provides realistic graph datasets with various noise types and comprehensively evaluates noise-handling strategies across GNN architectures, noisy label detection, and noise-robust learning. To simulate instance-dependent corruptions, BeGIN introduces algorithmic methods and LLM-based simulations. Our experiments reveal the challenges of instance-dependent noise, particularly LLM-based corruption, and underscore the importance of node-specific parameterization to enhance GNN robustness. By comprehensively evaluating noise-handling strategies, BeGIN provides insights into their effectiveness, efficiency, and key performance factors. We expect that BeGIN will serve as a valuable resource for advancing research on label noise in graphs and fostering the development of robust GNN training methods. The code is available at https://github.com/kimsu55/BeGIN.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Suyeon Kim

SeongKu Kang

Korea University

Dongwoo Kim

The University of Sydney

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Delving into Instance-Dependent Label Noise in Graph Data: A Comprehensive Study and Benchmark

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study