IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: A recursive search algorithm for statistical disclosure assessment

Citation Type: Journal Article

Publication Year: 2008

DOI: 10.1007/s10618-007-0078-6

Abstract: A new algorithm, SUDA2, is presented which finds minimally unique itemsets i.e., minimal itemsets of frequency one. These itemsets, referred to as Minimal Sample Uniques (MSUs), are important for statistical agencies who wish to estimate the risk of disclosure of their datasets. SUDA2 is a recursive algorithm which uses new observations about the properties of MSUs to prune and traverse the search space. Experimental comparisons with previous work demonstrate that SUDA2 is several orders of magnitude faster, enabling datasets of significantly more columns to be addressed. The ability of SUDA2 to identify the boundaries of the search space for MSUs is clearly demonstrated.

Url: http://link.springer.com/10.1007/s10618-007-0078-6

User Submitted?: No

Authors: Manning, Anna M.; Haglin, David J.; Keane, John A.

Periodical (Full): Data Mining and Knowledge Discovery

Issue: 2

Volume: 16

Pages: 165-196

Data Collections: IPUMS USA

Topics: Population Data Science

Countries: United States

IPUMS NHGIS NAPP IHIS ATUS Terrapop