논문 상세보기

A Local Outlier Mining Algorithm Based on Region Segmentation KCI 등재

  • 언어ENG
  • URLhttps://db.koreascholar.com/Article/Detail/412501
구독 기관 인증 시 무료 이용이 가능합니다. 4,000원
한국컴퓨터게임학회 논문지 (Journal of The Korean Society for Computer Game)
한국컴퓨터게임학회 (Korean Society for Computer Game)
초록

Regarding high-dimensional heterogeneous data, combined with the existing algorithms' poor mining accuracy and parameter sensitivity, this paper proposes a local outlier mining algorithm based on neighborhood density. Use region segmentation to split high-dimensional data into reasonable sub-regions, reducing the difficulty of processing a large amount of high-dimensional data. The kernel neighborhood density is used to replace the average neighborhood density, so that the density calculation has nothing to do with data heterogeneity. Finally, the neighborhood state and outlier state of the data are further determined on the basis of neighborhood density to improve the accuracy of outlier mining. Through artificial and UCI data set simulation results, it shows that data volume and data dimension are the main factors that affect data outlier mining. The accuracy, coverage, and efficiency of the algorithm proposed in this paper are significantly better than those of the comparison algorithm, and it has better adaptability to different types of data sets.

목차
ABSTRACT
1. Introduction
2. The principle of outlier mining inheterogeneous data
3. Data set region segmentation
4. Outlier mining algorithm
    4.1. Outlier detection based on kernelneighborhood density
    4.2 Outlier score
5. Simulation and result analysis
    5.1. Experimental environment and evaluationindicators
    5.2. Artificial data set
    5.3 UCI data set
6. Conclusion
References
저자
  • Somenath Chakraborty(Dept. of Computing Sciences and Computer Engineering, The University of Southern Mississippi, Hattiesburg, USA) Corresponding author