One of many challenges to open science is anonymization of personal data so that it may be shared. This paper presents a case study of the anonymization of a dataset containing cardio-respiratory fitness and commuting patterns for Slovenian school children. It evaluates three different anonymization tools, ARX, SDV, and SynDiffix. The fitness study was selected because its small size (N=713) and generally low statistical significance make it particularly challenging for data anonymization. Unlike most prior anonymization tool evaluations, this paper examines whether the scientific conclusions of the original study would have been supported by the anonymized datasets. It also considers the burden imposed on researchers using the tools both for data generation and data analysis.
Building similarity graph...
Analyzing shared references across papers
Loading...
Paul Francis
University of Kaiserslautern
Gregor Jurak
University of Ljubljana
Bojan Leskošek
University of Ljubljana
Scientific Data
University of Ljubljana
Berlin Institute of Health at Charité - Universitätsmedizin Berlin
Max Planck Institute for Software Systems
Building similarity graph...
Analyzing shared references across papers
Loading...
Francis et al. (Thu,) studied this question.
synapsesocial.com/papers/68d46cd731b076d99fa695a7 — DOI: https://doi.org/10.1038/s41597-025-05823-x