IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: Integrating Historical Noisy Answers for Improving Data Utility under Differential Privacy

Citation Type: Miscellaneous

Publication Year: 2012

Abstract: Differential privacy is a robust principle for privacy preserving data analysis tasks, and has been successfully applied to a variety of applications. However, the number of queries that can be answered is limited for preventing privacy disclosure. Once the privacy budget is exhausted, all succeeding queries must be rejected. Therefore, each of the historical query answers is valuable and it is important to exploit them together to learn more about the data. We propose to integrate all available linear query answers into a consistent form that embodies our knowledge learned from the noisy answers, obtaining more accurate answers to past queries and even new queries, improving the data utility. Two distinct approaches are developed for this purpose, one via principle component analysis, and another via maximum entropy method. The second approach also generates a synthetic database, which is useful for differentially private data publishing. One important goal of our work is to ensure that the running time of our approaches does not grow with the cardinality of the universe of a data tuple, so that high-dimensional data with very large domain can still be tackled efficiently.

Url: https://www.researchgate.net/profile/Sourav_S_Bhowmick/publication/241623253_Integrating_historical_noisy_answers_for_improving_data_utility_under_differential_privacy/links/53ec9f3c0cf24f241f159394.pdf?origin%3Dpublication_detail

User Submitted?: No

Authors: Chen, Shixi; Zhou, Shuigeng; Bhowmick, Sourav, S

Publisher: Fudan University, China

Data Collections: IPUMS USA

Topics: Population Data Science

Countries: United States

IPUMS NHGIS NAPP IHIS ATUS Terrapop