IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: A Multi-phase k-anonymity Algorithm Based on Clustering Techniques

Citation Type: Book, Section

Publication Year: 2013

DOI: 10.1007/978-3-642-35795-4_46

Abstract: We proposed a new k-anonymity algorithm to publish datasets with privacy protection. We improved clustering techniquesto lower data distort and enhance diversity of sensitive attributes values. Our algorithm includes four phases. Tuples are distributed to several groups in phase one. Tuples in a group own same sensitive value. In phase two, groups smaller than the threshold merge and then they are partitioned into several clusters according to quasi-identifier attributes. Each cluster would become an equivalence class. In phase three, remainder tuples are distributed to clusters evenly to satisfy L-diversity. Finally, quasi-identifier attributes values in each cluster are generalized to satisfy k-anonymity. We used OCC dataset to compare our algorithm with classic method based on clustering. Empirical results showed that our algorithm could be used to publish datasets with high security and limited information loss.

Url: http://link.springer.com/10.1007/978-3-642-35795-4_46

User Submitted?: No

Authors: Liu, Fei; Jia, Yan; Han, Weihong

Editors:

Pages: 365-372

Volume Title: Communications in Computer and Information Science

Publisher: Springer, Berlin, Heidelberg

Publisher Location:

Volume:

Edition: 320

Data Collections: IPUMS USA

Topics: Methodology and Data Collection

Countries: United States

IPUMS NHGIS NAPP IHIS ATUS Terrapop