Full Citation
Title: A Multi-phase k-anonymity Algorithm Based on Clustering Techniques
Citation Type: Book, Section
Publication Year: 2013
ISBN:
ISSN:
DOI: 10.1007/978-3-642-35795-4_46
NSFID:
PMCID:
PMID:
Abstract: We proposed a new k-anonymity algorithm to publish datasets with privacy protection. We improved clustering techniquesto lower data distort and enhance diversity of sensitive attributes values. Our algorithm includes four phases. Tuples are distributed to several groups in phase one. Tuples in a group own same sensitive value. In phase two, groups smaller than the threshold merge and then they are partitioned into several clusters according to quasi-identifier attributes. Each cluster would become an equivalence class. In phase three, remainder tuples are distributed to clusters evenly to satisfy L-diversity. Finally, quasi-identifier attributes values in each cluster are generalized to satisfy k-anonymity. We used OCC dataset to compare our algorithm with classic method based on clustering. Empirical results showed that our algorithm could be used to publish datasets with high security and limited information loss.
Url: http://link.springer.com/10.1007/978-3-642-35795-4_46
User Submitted?: No
Authors: Liu, Fei; Jia, Yan; Han, Weihong
Editors:
Pages: 365-372
Volume Title: Communications in Computer and Information Science
Publisher: Springer, Berlin, Heidelberg
Publisher Location:
Volume:
Edition: 320
Data Collections: IPUMS USA
Topics: Methodology and Data Collection
Countries: United States