期刊
IEEE ACCESS
卷 9, 期 -, 页码 64606-64628出版社
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3074243
关键词
Classification algorithms; Sampling methods; Support vector machines; Security; Safety; Deep learning; Boosting; Unbalanced data sets; classification; sampling methods; algorithm level; feature level
资金
- National Nature Science Foundation of China [62062004]
- Ningxia Natural Science Foundation Project [2020AAC03216]
- Postgraduate Innovation Project of Northern Minzu University [YCX20082]
This paper explores the classification of unbalanced data sets, analyzing various methods from data sampling, algorithm, feature, cost-sensitive function, and deep learning perspectives, comparing the advantages and disadvantages of different techniques, and outlining future research directions.
This paper studies the classification of unbalanced data sets. First, this kind of data sets is briefly introduced, and then the classification methods of unbalanced data sets are analyzed in detail from different perspectives such as data sampling method, algorithm level, feature level, cost-sensitive function, and deep learning. In addition, the data sampling methods are divided into different technologies for introduction: unbalanced data set classification method based on synthetic minority over-sampling technology (SMOTE), support vector machine (SVM) technology, and k-nearest neighbor (KNN) technology, etc. Then, the advantages and disadvantages of these methods are compared. Finally, the evaluation criteria of the unbalanced data set classifier are summarized, and the future work directions are prospected and summarized.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据