4.7 Article

POP algorithm: Kernel-based imputation to treat missing values in knowledge discovery from databases

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 36, 期 2, 页码 2794-2804

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2008.01.059

关键词

Knowledge discovery; Missing value; Random regression imputation; Deterministic regression imputation

资金

  1. ARC [DP0667060]
  2. NSF [60496327, 90718020, 10661003, 60625204]
  3. China 973 Program [2008CB317108]
  4. Chinese Academy of Sciences [06S3011S01]

向作者/读者索取更多资源

To complete missing values a solution is to use correlations between the attributes of the data. The problem is that it is difficult to identify relations within data containing missing values. Accordingly, we develop a kernel-based missing data imputation in this paper. This approach aims at making an optimal inference oil statistical parameters: mean, distribution function and quantile after missing data are imputed. And we refer this approach to parameter optimization method (POP algorithm). We experimentally evaluate our approach, and demonstrate that our POP algorithm (random regression imputation) is much better than deterministic regression imputation in efficiency and generating an inference on the above parameters. (C) 2008 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据