4.7 Article

A class center based approach for missing value imputation

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 151, Issue -, Pages 124-135

Publisher

ELSEVIER SCIENCE BV
DOI: 10.1016/j.knosys.2018.03.026

Keywords

Data mining; Missing value imputation; Incomplete datasets; Machine learning

Funding

  1. Ministry of Science and Technology of Taiwan [MOST 105-2410-H-008-043-MY3, MOST 106-2410-H-182-024]
  2. Chang Gung Memorial Hospital, Linkou [NERPD2G0301T]

Ask authors/readers for more resources

Missing value imputation (MVI) is the major solution method for dealing with incomplete dataset problems in which the missing attribute values are replaced from a chosen set of observed data using some statistical methods, such as mean/mode, machine learning, or support vector machine methods. Although machine learning MVI approaches may produce reasonably good imputation results, they usually require larger imputation times than statistical approaches. In this paper, a Class Center based Missing Value Imputation (CCMVI) approach is introduced for producing effective imputation results more efficiently. It is based on measuring the class center of each class and then the distances between it and the other observed data are used to define a threshold for the later imputation. The experimental results based on numerical, categorical, and mixed data types of datasets show that the proposed CCMVI approach outperforms the other MVI approaches for both numerical and mixed datasets. In addition, it requires much less imputation time than the machine learning MVI methods. (C) 2018 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available