4.6 Article

Missing value imputation using unsupervised machine learning techniques

Journal

SOFT COMPUTING
Volume 24, Issue 6, Pages 4361-4392

Publisher

SPRINGER
DOI: 10.1007/s00500-019-04199-6

Keywords

K-means; Fuzzy C-means; Rough K-means; Machine learning; Missing values; Imputation

Funding

  1. UGC, New Delhi [F1-17.1/2016-17/RGNF-2015-17-SC-TAM-28324, 43-274/2014]

Ask authors/readers for more resources

In data mining, preprocessing is one of the essential processes which involves data normalization, noise removal, handling missing values, etc. This paper focuses on handling missing values using unsupervised machine learning techniques. Soft computation approaches are combined with the clustering techniques to form a novel method to handle the missing values, which help us to overcome the problems of inconsistency. Rough K-means centroid-based imputation method is proposed and compared with K-means centroid-based imputation method, fuzzy C-means centroid-based imputation method, K-means parameter-based imputation method, fuzzy C-means parameter-based imputation method, and rough K-means parameter-based imputation methods. The experimental analysis is carried out on four benchmark datasets, viz. Dermatology, Pima, Wisconsin, and Yeast datasets, which have taken from UCI data repository. The proposed method proves the efficacy of different datasets, and the results are also promising one.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available