4.7 Article

Three-way sampling for rapid attribute reduction

期刊

INFORMATION SCIENCES
卷 609, 期 -, 页码 26-45

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.07.063

关键词

Three-way decisions; Attribute reduction; Support vector data description; Data sampling; Neighborhood rough set

资金

  1. National Natural Science Foundation of China [62006200]
  2. Sichuan Province Youth Science and Technology Innovation Team, China [2019JDTD0017]
  3. First-class Undergraduate Course Construction Project of Southwest Petroleum University [X2021YLKC035]
  4. Southwest Petroleum University Postgraduate English Course Construction Project [2020QY04]

向作者/读者索取更多资源

This paper proposes a rapid attribute reduction method based on three-way sampling (3WS-RAR), which improves the effectiveness and efficiency of attribute reduction through three main steps and experiments on large-scale datasets, demonstrating better performance on public benchmark datasets compared to state-of-the-art models.
As data dimensions and volume rapidly increase, attribute reduction using the original data becomes computationally infeasible. Large data frequently contain various redundant attributes and types of noise. This leads to the problems of overfitting and inefficiency in data processing. To address these problems, this paper proposes a general sampling method for attribute reduction by introducing three-way decisions, namely, three-way sampling (3WS), which is the first sampling method that describes the decision boundary accurately while improving the data quality significantly. To improve the effectiveness and efficiency of attribute reduction, we designed a rapid attribute reduction method based on three-way sampling (3WS-RAR). The 3WS-RAR method consists of three main steps: data sampling, attribute reduction, and model effectiveness evaluation. For data sampling, we define the three regions of the 3WS using support vectors to describe the data and use the boundary region as the sampling results. For the attribute reduction, we compute the neighborhood self-information for each attribute while considering the upper and lower approximations. For the effectiveness evaluation, we conducted experiments on 15 relatively large-scale datasets and analysed the influence of parameters. The experimental results reveal that, compared with state-of-the-art attribute reduction models, 3WS-RAR performs better on public benchmark datasets. (C) 2022 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据