4.5 Article

Shell-neighbor method and its application in missing data imputation

Journal

APPLIED INTELLIGENCE
Volume 35, Issue 1, Pages 123-133

Publisher

SPRINGER
DOI: 10.1007/s10489-009-0207-6

Keywords

kNN; Shell-NN; Missing data imputation; Mining incomplete data

Funding

  1. Australian Research Council (ARC) [DP0985456]
  2. Nature Science Foundation (NSF) of China [90718020]
  3. China 973 Program [2008CB317108]
  4. China Ministry of Personnel for Overseas-Return High-level Talents
  5. MOE
  6. Social Sciences at Universities [07JJD720044]
  7. Guangxi NSF
  8. Australian Research Council [DP0985456] Funding Source: Australian Research Council

Ask authors/readers for more resources

Data preparation is an important step in mining incomplete data. To deal with this problem, this paper introduces a new imputation approach called SN (Shell Neighbors) imputation, or simply SNI. The SNI fills in an incomplete instance (with missing values) in a given dataset by only using its left and right nearest neighbors with respect to each factor (attribute), referred them to Shell Neighbors. The left and right nearest neighbors are selected from a set of nearest neighbors of the incomplete instance. The size of the sets of the nearest neighbors is determined with the cross-validation method. And then the SNI is generalized to deal with missing data in datasets with mixed attributes, for example, continuous and categorical attributes. Some experiments are conducted for evaluating the proposed approach, and demonstrate that the generalized SNI method outperforms the kNN imputation method at imputation accuracy and classification accuracy.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available