Density-based outlier detection for very large data sets

Density-based outlier detection for very large data sets

0.00 Avg rating0 Votes
Article ID: iaor20106451
Volume: 35
Issue: 2
Start Page Number: 141
End Page Number: 150
Publication Date: Apr 2010
Journal: Journal of the Korean O.R. and MS Society
Authors: , ,
Keywords: outliers
Abstract:

A density-based outlier detection such as an LOF (Local Outlier Factor) tries to find an outlying observation by using density of its surrounding space. In spite of several advantages of a density-based outlier detection method, the computational complexity of outlier detection has been one of major barriers in its application. In this paper, we present an LOF algorithm that can reduce computation time of a density based outlier detection algorithm. A kd-tree indexing and approximated k-nearest neighbor search algorithm (ANN) are adopted in the proposed method. A set of experiments was conducted to examine performance of the proposed algorithm. The results show that the proposed method can effectively detect local outliers in reduced computation time.

Reviews

Required fields are marked *. Your email address will not be published.