Full Citation
Title: A recursive search algorithm for statistical disclosure assessment
Citation Type: Journal Article
Publication Year: 2008
ISBN:
ISSN:
DOI: 10.1007/s10618-007-0078-6
NSFID:
PMCID:
PMID:
Abstract: A new algorithm, SUDA2, is presented which finds minimally unique itemsets i.e., minimal itemsets of frequency one. These itemsets, referred to as Minimal Sample Uniques (MSUs), are important for statistical agencies who wish to estimate the risk of disclosure of their datasets. SUDA2 is a recursive algorithm which uses new observations about the properties of MSUs to prune and traverse the search space. Experimental comparisons with previous work demonstrate that SUDA2 is several orders of magnitude faster, enabling datasets of significantly more columns to be addressed. The ability of SUDA2 to identify the boundaries of the search space for MSUs is clearly demonstrated.
Url: http://link.springer.com/10.1007/s10618-007-0078-6
User Submitted?: No
Authors: Manning, Anna M.; Haglin, David J.; Keane, John A.
Periodical (Full): Data Mining and Knowledge Discovery
Issue: 2
Volume: 16
Pages: 165-196
Data Collections: IPUMS USA
Topics: Population Data Science
Countries: United States