Hu, R. and Lin, Y. and Yang, Mu and Yu, Y. and Sassone, V. (2025) Two-stage mining of linkage risk for data release. Mathematics 13 (17), p. 2731. ISSN 2227-7390.
![]() |
Text
mathematics-3780799-fc done.pdf - Published Version of Record Available under License Creative Commons Attribution. Download (483kB) |
Abstract
Privacy risk mining, a crucial domain in data privacy protection, endeavors to uncover potential information among datasets that could be linked to individuals’ sensitive data. Existing anonymization and privacy assessment techniques either lack quantitative granularity or fail to adapt to dynamic, heterogeneous data environments. In this work, we propose a unified two-phase linkability quantification framework that systematically measures privacy risks at both the inter-dataset and intra-dataset levels. Our approach integrates unsupervised clustering on attribute distributions with record-level matching to compute interpretable, fine-grained risk scores. By aligning risk measurement with regulatory standards such as the GDPR, our framework provides a practical, scalable solution for safeguarding user privacy in evolving data-sharing ecosystems. Extensive experiments on real-world and synthetic datasets show that our method achieves up to 96.7% precision in identifying true linkage risks, outperforming the compared baseline by 13 percentage points under identical experimental settings. Ablation studies further demonstrate that the hierarchical risk fusion strategy improves sensitivity to latent vulnerabilities, providing more actionable insights than previous privacy gain-based metrics.
Metadata
Item Type: | Article |
---|---|
School: | Birkbeck Faculties and Schools > Faculty of Business and Law > Birkbeck Business School |
Depositing User: | Mu Yang |
Date Deposited: | 04 Sep 2025 15:49 |
Last Modified: | 05 Sep 2025 10:29 |
URI: | https://eprints.bbk.ac.uk/id/eprint/56110 |
Statistics
Additional statistics are available via IRStats2.