4.8 Article

Distributed Feature Selection for Big Data Using Fuzzy Rough Sets

期刊

IEEE TRANSACTIONS ON FUZZY SYSTEMS
卷 28, 期 5, 页码 846-857

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TFUZZ.2019.2955894

关键词

Rough sets; Feature extraction; Big Data; Servers; Distributed databases; Cloud computing; Heuristic algorithms; Big data; distributed feature selection; dynamic data decomposition; fuzzy rough sets

资金

  1. National Key R&D Program of China [2018YFB1004703]
  2. China NSF [61672349, 61672253, 61728303, 61971030, 61772338, 61872033, 61972254, 61672353]
  3. Humanity and Social Science Youth Foundation of Ministry of Education of China [18YJCZH204]
  4. Open Research Project of the State Key Laboratory of Industrial Control Technology, Zhejiang University, China [ICT1800417]

向作者/读者索取更多资源

Fuzzy rough-set-based feature selection is an important technique for big data analysis. However, the classic fuzzy rough set algorithm takes all the data correlations into account, which leads to the centralized computing mode, requiring high computing and memory space resources. With the increasing amount of data in the big data era, the centralized server cannot afford the computation of fuzzy rough set. To enable the fuzzy rough set for big data analysis, in this article, we propose the novel distributed fuzzy rough set (DFRS)-based feature selection, which separates and assigns the tasks to multiple nodes for parallel computing. The key challenge is to maintain the global information on each distributed node without conserving the entire fuzzy relation matrix. We tackle this challenge by a dynamic data decomposition algorithm and a data summarization process on each distributed node. Extensive experiments based on multiple real datasets demonstrate that DFRS significantly improves the runtime, and its feature selection accuracy is nearly the same as the traditional centralized computing.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据