Browse by author
Lookup NU author(s): Dahui Yu, Dr Jichun Li
This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).
© 2024 Elsevier Inc.In conventional outlier detection, global outliers are easily identified, but the efficacy diminishes when faced with local outliers within clusters of varying densities. Conversely, while the local outlier factor excels in detecting local anomalies, its performance falters as the number of outliers increases. To address these limitations and cater to intricate datasets by ensuring adept detection of both global and local outliers, this paper introduces a novel outlier detection approach known as High-Density Iteration (HDIOD). The methodology begins by leveraging a combination of the Gaussian kernel function and k-nearest neighbors to compute the local kernel density for each sample. Subsequently, the process involves comparing the local kernel density of a given sample with that of its k-neighbors. If the sample's local kernel density is lower than the maximum density among its neighbors, it selects the neighbor with the highest local kernel density within its k-neighbors as the new object for comparison. This iterative process continues, where the set of k-neighbors for all objects constitutes the extended k-neighbors of the original sample. The final step involves utilizing the ratio of the maximum local kernel density within the extended k-nearest neighbors to the local density of the sample as a measure of the sample's outlier degree. Experimental evaluations conducted on 12 synthetic datasets and 19 real-world datasets demonstrate the effectiveness of the HDIOD method. Comparative analyses with 13 commonly used outlier detection methods underscore the high detection accuracy and robustness of HDIOD to parameter variations.
Author(s): Zhou Y, Xia H, Yu D, Cheng J, Li J
Publication type: Article
Publication status: Published
Journal: Information Sciences
Year: 2024
Volume: 662
Online publication date: 06/02/2024
Acceptance date: 31/01/2024
Date deposited: 17/05/2024
ISSN (print): 0020-0255
ISSN (electronic): 1872-6291
Publisher: Elsevier Inc.
URL: https://doi.org/10.1016/j.ins.2024.120286
DOI: 10.1016/j.ins.2024.120286
ePrints DOI: 10.57711/a7f3-gd42
Data Access Statement: Data will be made available on request.
Altmetrics provided by Altmetric