Full Citation
Title: A Minimum Spanning Tree-Inspired Clustering-Based Outlier Detection Technique
Citation Type: Journal Article
Publication Year: 2012
ISBN:
ISSN:
DOI:
NSFID:
PMCID:
PMID:
Abstract: Due to its important applications in data mining, many techniques have been developed for outlier detection. In this paper, an efficient three-phase outlier detection technique. First, we modify the famous k-means algorithm for an efficient construction of a spanning tree which is very close to a minimum spanning tree of the data set. Second, the longest edges in the obtained spanning tree are removed to form clusters. Based on the intuition that the data points in small clusters may be most likely all outliers, they are selected and regarded as outlier candidates. Finally, density-based outlying factors, LOF, are calculated for potential outlier candidates and accessed to pinpoint the local outliers. Extensive experiments on real and synthetic data sets show that the proposed approach can efficiently identify global as well as local outliers for large-scale datasets with respect to the state-of-the-art methods.
User Submitted?: No
Authors: Wang, Xia Li; Wilkes, D.Mitch; Wang, Xiaochun
Periodical (Full): Lecture Notes in Computer Science
Issue:
Volume: 7377
Pages: 209-223
Data Collections: IPUMS USA
Topics:
Countries: