4.6 Review

Review of Classification Methods on Unbalanced Data Sets

期刊

IEEE ACCESS
卷 9, 期 -, 页码 64606-64628

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3074243

关键词

Classification algorithms; Sampling methods; Support vector machines; Security; Safety; Deep learning; Boosting; Unbalanced data sets; classification; sampling methods; algorithm level; feature level

资金

  1. National Nature Science Foundation of China [62062004]
  2. Ningxia Natural Science Foundation Project [2020AAC03216]
  3. Postgraduate Innovation Project of Northern Minzu University [YCX20082]

向作者/读者索取更多资源

This paper explores the classification of unbalanced data sets, analyzing various methods from data sampling, algorithm, feature, cost-sensitive function, and deep learning perspectives, comparing the advantages and disadvantages of different techniques, and outlining future research directions.
This paper studies the classification of unbalanced data sets. First, this kind of data sets is briefly introduced, and then the classification methods of unbalanced data sets are analyzed in detail from different perspectives such as data sampling method, algorithm level, feature level, cost-sensitive function, and deep learning. In addition, the data sampling methods are divided into different technologies for introduction: unbalanced data set classification method based on synthetic minority over-sampling technology (SMOTE), support vector machine (SVM) technology, and k-nearest neighbor (KNN) technology, etc. Then, the advantages and disadvantages of these methods are compared. Finally, the evaluation criteria of the unbalanced data set classifier are summarized, and the future work directions are prospected and summarized.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据