4.6 Article

RSFD: A rough set-based feature discretization method for meteorological data

期刊

FRONTIERS IN ENVIRONMENTAL SCIENCE
卷 10, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fenvs.2022.1013811

关键词

meteorological data; feature discretization; information gain; rough set; classification accuracy

资金

  1. Hainan Provincial Natural Science Foundation of China
  2. National Key Research and Development Program of China
  3. China Postdoctoral Science Foundation
  4. [2019CXTD400]
  5. [2018YFB1404400]
  6. [2021M701838]

向作者/读者索取更多资源

This study proposes a rough set-based feature discretization method (RSFD) for meteorological data, which optimizes the discretization scheme by calculating information gain, using chi-square test, and considering the variation of the indiscernibility relation. Experimental results show that the RSFD method achieves better overall performance in terms of meteorological data classification accuracy and the number of discrete intervals.
Meteorological data mining aims to discover hidden patterns in a large number of available meteorological data. As one of the most relevant big data preprocessing technologies, feature discretization can transform continuous features into discrete ones to improve the efficiency of meteorological data mining algorithms. Aiming at the problems of high interaction of multiple attributes, noise interference, and difficulty in obtaining prior knowledge in meteorological data, we propose a rough set-based feature discretization method for meteorological data (RSFD). First, we calculate the information gain of each candidate breakpoint in the meteorological attribute to split the intervals. Then, we use chi-square test to merge these discrete intervals. Finally, we take the variation of indiscernibility relation in rough set as the evaluation criterion for the discretization scheme. We scan each attribute in turn by using the strategy of splitting first and then merging, thus obtaining the optimal discrete feature set. We compare RSFD with the state-of-the-art discretization methods on meteorological data. Experiments show that our method achieves better results in the classification accuracy of meteorological data, and obtains a smaller number of discrete intervals while ensuring data consistency.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据