Full Citation
Title: On Approximation Measures for Functional Dependencies
Citation Type: Conference Paper
Publication Year: 2004
ISBN:
ISSN:
DOI:
NSFID:
PMCID:
PMID:
Abstract: We examine the issue of how to measure the degree to which a functional dependency (FD) is approximate. The primary motivation lies in the fact that approximate FDs represent potentially interesting patterns existent in a table. Their discovery is a valuable data mining problem. However, before algorithms can be developed, a measure must be defined quantifying their approximation degree. First we develop an approximation measure by axiomatizing the following intuition: the degree to which X --> Y is approximate in a table T is the degree to which T determines a function from Pi(X)(T) to Pi(Y)(T). We prove that a unique unnormalized measure satisfies these axioms up to a multiplicative constant. Next we compare the measure developed with two other measures from the literature. In all but one case, we show that the measures can be made to differ as much as possible within normalization. We examine these measure on several real datasets and observe that many of the theoretically possible extreme differences do not bear themselves out. We offer some conclusions as to particular situations where certain measures are more appropriate than others. (C) 2003 Elsevier Ltd. All rights reserved.
User Submitted?: No
Authors: Robertson, Edward; Giannella, Chris
Conference Name: 6th East-European Conference on Advances in Dabases and Information Systems
Publisher Location: Bratislava, Slovakia
Data Collections: IPUMS USA
Topics: Methodology and Data Collection, Other
Countries: