Full Citation
Title: Anonymizing Set-Valued Data by Nonreciprocal Recoding
Citation Type: Miscellaneous
Publication Year: 2012
ISBN:
ISSN:
DOI:
NSFID:
PMCID:
PMID:
Abstract: Today there is a strong interest in publishing set-valued data in a privacy-preserving manner. Such data associate individuals to sets of values (e.g., preferences, shopping items, symptoms, query logs). In addition, an individual can be associated with a sen- sitive label (e.g., marital status, religious or political conviction). Anonymizing such data implies ensuring that an adversary should not be able to (1) identify an individual’s record, and (2) infer a sen- sitive label, if such exists. Existing research on this problem either perturbs the data, publishes them in disjoint groups disassociated from their sensitive labels, or generalizes their values by assuming the availability of a generalization hierarchy. In this paper, we pro- pose a novel alternative. Our publication method also puts data in a generalized form, but does not require that published records form disjoint groups and does not assume a hierarchy either; instead, it employs generalized bitmaps and recasts data values in a nonrecip- rocal manner; formally, the bipartite graph from original to anony- mized records does not have to be composed of disjoint complete subgraphs. We configure our schemes to provide popular privacy guarantees while resisting attacks proposed in recent research, and demonstrate experimentally that we gain a clear utility advantage over the previous state of the art.
Url: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.370.7948&rep=rep1&type=pdf
User Submitted?: No
Authors: Xue, Mingqiang; Karras, Panagiotis; Raïssi, Chedy; Vaidya, Jaideep; Tan, Kian-Lee
Publisher: National University of Singapore
Data Collections: IPUMS USA
Topics: Population Data Science
Countries: United States