IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: Detecting Group Differences: Mining Contrast Sets

Citation Type: Miscellaneous

Publication Year: 2001

Abstract: A fundamental task in data analysis is understanding the di erences between several contrasting groups. These groups can represent di erent classes of ob jects, such as male or female students, or the same group over time, e.g. freshman students in 1993 through 1998. We present the problem of mining contrast sets: conjunctions of attributes and values that differ meaningfully in their distribution across groups. We provide a search algorithm for mining contrast sets with pruning rules that drastically reduce the computational complexity. Once the contrast sets are found, we post-process the results to present a subset that are surprising to the user given what we have already shown. We explicitly control the probability of Type I error (false positives) and guarantee a maximum error rate for the entire analysis by using Bonferroni corrections.

Url: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.23.5522&rep=rep1&type=pdf

User Submitted?: No

Authors: Bay, Stephen, D; Pazzani, Michael, J

Publisher: University of California, Irvine

Data Collections: IPUMS USA

Topics: Methodology and Data Collection, Other

Countries:

IPUMS NHGIS NAPP IHIS ATUS Terrapop