IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: Comparative Evaluation of Synthetic Data Generation Methods

Citation Type: Miscellaneous

Publication Year: 2019

Abstract: Unrestricted availability of datasets is important for researchers and decision makers to evaluate their strategies to solve certain problems. Equally important is the privacy of the respective data owners. Synthetically generated datasets provide a way to retain the utility of the original data keeping the privacy of the owners invulnerable. We perform a comparative study of synthetic data generation techniques using different data synthesizers like decision tree, ran- dom forest and neural network. We evaluate the effectiveness of these methods towards the amount of utility they preserve and the risk of disclosure.

Url: https://pdfs.semanticscholar.org/84ba/a6078344b5e8c7638c5f351ada4d7ea8fff6.pdf

User Submitted?: No

Authors: Dandekar, Ashish; Zen, Remmy, AM; Bressan, Stéphane

Publisher: National University of Singapore

Data Collections: IPUMS International

Topics: Methodology and Data Collection, Population Data Science

Countries:

IPUMS NHGIS NAPP IHIS ATUS Terrapop