4.8 Article

Hybrid Missing Value Imputation Algorithms Using Fuzzy C-Means and Vaguely Quantified Rough Set

期刊

IEEE TRANSACTIONS ON FUZZY SYSTEMS
卷 30, 期 5, 页码 1396-1408

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TFUZZ.2021.3058643

关键词

Clustering algorithms; Machine learning algorithms; Approximation algorithms; Task analysis; Prediction algorithms; Vegetation; Classification algorithms; Fuzzy C-Means (FCM) clustering imputation; fuzzy membership relations; missing value imputation (MVI); nearest neighbor imputation; rough set

资金

  1. National Key R&D Program of China [2019YFB2101802]
  2. Key Laboratory of Pattern Recognition and Intelligent Information Processing, Institutions of Higher Education of Sichuan Province, through the High-End Talent Introduction Project From Abroad in 2020 [G20200236016]
  3. Sichuan Science and Technology Program [2021YFH0107]
  4. Building Skills 4.0 through University and Enterprise Collaboration (Erasmus+ Shyfte 4.0) Project (Erasmus+Programme) [598649-EPP-1-2018-1FR-EPPKA2-CBHE-JP]

向作者/读者索取更多资源

Two algorithms, JFCM-VQNNI and JFCM-FVQNNI, have been proposed in this research to achieve effective data imputation by considering clustering and uncertain information extraction when predicting missing values. Experimental results show that these two algorithms have higher imputation performance and reliability compared to traditional parameter-based imputation algorithms.
In real cases, missing values tend to contain meaningful information that should be acquired or should be analyzed before the incomplete dataset is used for machine learning tasks. In this work, two algorithms named jointly fuzzy C-Means and vaguely quantified nearest neighbor (VQNN) imputation (JFCM-VQNNI) and jointly fuzzy C-Means and fitted VQNN imputation (JFCM-FVQNNI) have been proposed by considering clustering conception and sufficient extraction of uncertain information. In the proposed JFCM-VQNNI and JFCM-FVQNNI algorithm, the missing value is regarded as a decision feature, and then, the prediction is generated for the objects that contain at least one missing value. Specially, as for JFCM-VQNNI algorithm, indistinguishable matrixes, tolerance relations, and fuzzy membership relations are adopted to identify the potential closest filled values based on corresponding similar objects and related clusters. On the basis of JFCM-VQNNI algorithm, JFCM-FVQNNI algorithm synthetic analyzes the fuzzy membership of the dependent features for instances with each cluster. In order to fill the missing values more accurately, JFCM-FVQNNI algorithm performs fuzzy decision membership adjustment in each object with respect to the related clusters by considering highly relevant decision attributes. The experiments have been carried out on five datasets. Based on the analysis of root-mean-square error, mean absolute error, comparison of imputation values with actual values, and classification accuracy results analysis, we can draw the conclusion that the proposed JFCM-FVQNNI and JFCM-VQNNI algorithms yields sufficient and reasonable imputation performance results by comparing with fuzzy C-Means parameter-based imputation algorithm and fuzzy C-Means rough parameter-based imputation algorithm.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据