IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: Scalable Informative Rule Mining

Citation Type: Dissertation/Thesis

Publication Year: 2016

Abstract: In this thesis we present SIRUM: a system for Scalable Informative RUle Mining from multidimensional data. Informative rules have recently been studied in several contexts, including data summarization, data cube exploration and data quality. The objective is to produce a concise set of rules (patterns) over the values of the dimension attributes that provide the most information about the distribution of a numeric measure attribute. SIRUM optimizes this task for big, wide and distributed datasets. We implemented SIRUM in Spark and observed significant performance improvements on real data due to our optimizations.

Url: https://uwspace.uwaterloo.ca/bitstream/handle/10012/10620/Feng_Guoyao.pdf?sequence=3

User Submitted?: No

Authors: Feng, Guoyao

Institution: University of Waterloo

Department:

Advisor:

Degree:

Publisher Location: Waterloo, Ontario, Canada

Pages:

Data Collections: IPUMS USA

Topics: Methodology and Data Collection

Countries:

IPUMS NHGIS NAPP IHIS ATUS Terrapop