What question did this study set out to answer?

This research aims to assess the effectiveness of federated graph unlearning methods in achieving multi-level indistinguishability and geometric privacy compliance.

May 1, 2026Open Access

Federated Illusion: Multi-Level Geometric Privacy Audit for Federated Graph Unlearning

Key Points

This research aims to assess the effectiveness of federated graph unlearning methods in achieving multi-level indistinguishability and geometric privacy compliance.
Conducted 31,900 trials across five graph benchmarks and five federated unlearning methods.
Developed a five-model threat taxonomy and extended the Hub-Ripple embedding drift audit.
Evaluated various supplementary ablations including K-value, cross-edge handling, and dp-sgd defense.
All approximate methods failed to satisfy multi-level indistinguishability; Confidence-Embedding Gap was 0.12 versus 0.35 centralized.
Cross-client leakage correlated with shared cross-edge count (r=0.56, p<10−160).
FedRetrain showed residual cross-client leakage with Cross-Mean L2 AUC =0.62±0.04 after achieving global and local indistinguishability.

Abstract

Machine unlearning in federated graph learning must satisfy the multi-level indistinguishability requirement of the deletion of a target node being undetectable at the level of the global model, of the unlearning client’s local model, and of every non-target client’s local model. Approximate unlearning methods that pass confidence-based audits may still leave geometric traces through embedding drift at one or more of these K+1 levels. We formalize this requirement, introduce a five-model threat taxonomy, and extend the Hub–Ripple embedding drift audit to global, local, and cross-client levels. Across 31,900 trials spanning five graph benchmarks, five federated unlearning methods, and four supplementary ablations (K-value, cross-edge handling, control sampling, and DP-SGD defense), we find that all approximate methods fail the following multi-level requirement: the Confidence–Embedding Gap persists at 0.12 (versus 0.35 centralized), cross-client leakage correlates with shared cross-edge count (r=0.56, p<10−160), and a federated participant outperforms a white-box external auditor (AUC 0.83 versus 0.81). Client-level unlearning is more detectable at the global level than node-level unlearning (AUC 0.81 versus 0.77), contradicting the intuition that coarser deletion yields stronger privacy. FedRetrain satisfies global and local indistinguishability but exhibits residual cross-client leakage (Cross-Mean L2 AUC =0.62±0.04) because re-aggregation itself perturbs the global parameter vector. No method evaluated achieves full multi-level indistinguishability. Supplementary studies confirm that this is a structural property of FedAvg; DP-SGD reduces Cross L2 AUC by only 0.013 at the cost of a 79% accuracy drop, and FedSage-like neighbor sharing does not change the leakage profile. Multi-level geometric auditing, spanning all K+1 models, is the necessary evaluation floor that any method claiming verifiable privacy compliance must satisfy.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper