Toggle Main Menu Toggle Search

Open Access padlockePrints

Outlier detection method based on high-density iteration

Lookup NU author(s): Dahui Yu, Dr Jichun Li

Downloads


Licence

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).


Abstract

© 2024 Elsevier Inc.In conventional outlier detection, global outliers are easily identified, but the efficacy diminishes when faced with local outliers within clusters of varying densities. Conversely, while the local outlier factor excels in detecting local anomalies, its performance falters as the number of outliers increases. To address these limitations and cater to intricate datasets by ensuring adept detection of both global and local outliers, this paper introduces a novel outlier detection approach known as High-Density Iteration (HDIOD). The methodology begins by leveraging a combination of the Gaussian kernel function and k-nearest neighbors to compute the local kernel density for each sample. Subsequently, the process involves comparing the local kernel density of a given sample with that of its k-neighbors. If the sample's local kernel density is lower than the maximum density among its neighbors, it selects the neighbor with the highest local kernel density within its k-neighbors as the new object for comparison. This iterative process continues, where the set of k-neighbors for all objects constitutes the extended k-neighbors of the original sample. The final step involves utilizing the ratio of the maximum local kernel density within the extended k-nearest neighbors to the local density of the sample as a measure of the sample's outlier degree. Experimental evaluations conducted on 12 synthetic datasets and 19 real-world datasets demonstrate the effectiveness of the HDIOD method. Comparative analyses with 13 commonly used outlier detection methods underscore the high detection accuracy and robustness of HDIOD to parameter variations.


Publication metadata

Author(s): Zhou Y, Xia H, Yu D, Cheng J, Li J

Publication type: Article

Publication status: Published

Journal: Information Sciences

Year: 2024

Volume: 662

Online publication date: 06/02/2024

Acceptance date: 31/01/2024

Date deposited: 17/05/2024

ISSN (print): 0020-0255

ISSN (electronic): 1872-6291

Publisher: Elsevier Inc.

URL: https://doi.org/10.1016/j.ins.2024.120286

DOI: 10.1016/j.ins.2024.120286

ePrints DOI: 10.57711/a7f3-gd42

Data Access Statement: Data will be made available on request.


Altmetrics

Altmetrics provided by Altmetric


Funding

Funder referenceFunder name
2018GGJS079
National Natural Science Foundation of China
OSR/0550/SASC/S022
U1504622

Share