IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: A Minimum Spanning Tree-Inspired Clustering-Based Outlier Detection Technique

Citation Type: Journal Article

Publication Year: 2012

Abstract: Due to its important applications in data mining, many techniques have been developed for outlier detection. In this paper, an efficient three-phase outlier detection technique. First, we modify the famous k-means algorithm for an efficient construction of a spanning tree which is very close to a minimum spanning tree of the data set. Second, the longest edges in the obtained spanning tree are removed to form clusters. Based on the intuition that the data points in small clusters may be most likely all outliers, they are selected and regarded as outlier candidates. Finally, density-based outlying factors, LOF, are calculated for potential outlier candidates and accessed to pinpoint the local outliers. Extensive experiments on real and synthetic data sets show that the proposed approach can efficiently identify global as well as local outliers for large-scale datasets with respect to the state-of-the-art methods.

User Submitted?: No

Authors: Wang, Xia Li; Wilkes, D.Mitch; Wang, Xiaochun

Periodical (Full): Lecture Notes in Computer Science

Issue:

Volume: 7377

Pages: 209-223

Data Collections: IPUMS USA

Topics:

Countries:

IPUMS NHGIS NAPP IHIS ATUS Terrapop