What question did this study set out to answer?

May 20, 2026

Shielding PII to Prevent Re-identification and Preserve Utility

Key Points

The study aims to tackle the challenge of protecting Personally Identifiable Information (PII) while maintaining data utility in textual data.
Modeled as a two-player Stackelberg game with an attacker and protector.
Developed SHIELD, an attack-aware PII protection system integrating logical reasoning and machine learning.
Empirically tested SHIELD on synthetic and real-world datasets for performance evaluation.
SHIELD improves the privacy-utility trade-off compared to prior PII protection systems.
Achieves constant-factor approximation for utility loss while mitigating PII breach risks.

Abstract

This paper addresses the challenge of protecting Personally Identifiable Information (PII) in textual data, identifying and anonymizing PII to ensure privacy and regulatory compliance, while preserving data utility. We model this bi-criteria optimization problem as a two-player Stackelberg game, where an attacker seeks to link anonymized data back to individuals and a protector anonymizes the data to prevent re-identification. We show that the problem is intractable. Thus we develop SHIELD, an attack-aware PII protection system that iteratively engages the protector and attacker to prevent both PII breaches and over-scrubbing. SHIELD integrates logical reasoning with machine learning to identify PII, and supports pluggable attackers for robustness against re-identification. It achieves a constant-factor approximation for utility loss while mitigating risk. Using synthetic and real-world datasets, we empirically show that SHIELD offers better privacy-utility trade-off than prior PII protection systems, while remaining efficient and scalable.

Perguntar à IA

Bookmark

Cite This Study

Liu et al. (Mon,) studied this question.

synapsesocial.com/papers/6a0d5013f03e14405aa9bafe https://doi.org/https://doi.org/10.1145/3802109

Also Consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Perguntar à IA

Bookmark