4.6 Article

Locally linear reconstruction based missing value imputation for supervised learning

Journal

NEUROCOMPUTING
Volume 118, Issue -, Pages 65-78

Publisher

ELSEVIER SCIENCE BV
DOI: 10.1016/j.neucom.2013.02.016

Keywords

Locally linear reconstruction (LLR); Missing value imputation; Supervised learning; Classification; Regression

Funding

  1. Basic Science Research Program through the National Research Foundation of Korea (NRF)
  2. Ministry of Education, Science, and Technology (MEST) [2011-0021893]
  3. National Research Foundation of Korea [2011-0021893] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Most learning algorithms generally assume that data is complete so each attribute of all instances is filled with a valid value. However, missing values are very common in real datasets for various reasons. In this paper, we propose a new single imputation method based on locally linear reconstruction (LLR) that improves the prediction performance of supervised learning (classification & regression) with missing values. First, we investigate how missing values degrade the prediction performance with various missing ratios. Next, we compare the proposed missing value imputation method (LLR) with six well-known single imputation methods for five different learning algorithms based on 13 classification and nine regression datasets. The experimental results showed that (1) all imputation methods helped to improve the prediction accuracy, although some were very simple; (2) the proposed LLR imputation method enhanced the modeling performance more than all other imputation methods, irrespective of the learning algorithms and the missing ratios; and (3) LLR was outstanding when the missing ratio was relatively high and its prediction accuracy was similar to that of the complete dataset. (C) 2013 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available