Journal
SOFT COMPUTING
Volume 24, Issue 6, Pages 4361-4392Publisher
SPRINGER
DOI: 10.1007/s00500-019-04199-6
Keywords
K-means; Fuzzy C-means; Rough K-means; Machine learning; Missing values; Imputation
Categories
Funding
- UGC, New Delhi [F1-17.1/2016-17/RGNF-2015-17-SC-TAM-28324, 43-274/2014]
Ask authors/readers for more resources
In data mining, preprocessing is one of the essential processes which involves data normalization, noise removal, handling missing values, etc. This paper focuses on handling missing values using unsupervised machine learning techniques. Soft computation approaches are combined with the clustering techniques to form a novel method to handle the missing values, which help us to overcome the problems of inconsistency. Rough K-means centroid-based imputation method is proposed and compared with K-means centroid-based imputation method, fuzzy C-means centroid-based imputation method, K-means parameter-based imputation method, fuzzy C-means parameter-based imputation method, and rough K-means parameter-based imputation methods. The experimental analysis is carried out on four benchmark datasets, viz. Dermatology, Pima, Wisconsin, and Yeast datasets, which have taken from UCI data repository. The proposed method proves the efficacy of different datasets, and the results are also promising one.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available