☆ 4.7 Article

Missing value imputation on missing completely at random data using multilayer perceptrons

NEURAL NETWORKS (2011)

期刊

NEURAL NETWORKS

卷 24, 期 1, 页码 121-129

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.neunet.2010.09.008

关键词

Multilayer perceptron; Hot-deck model; Imputation; Mean/mode model; Missing data; Regression model

类别

Computer Science, Artificial Intelligence Neurosciences

资金

Institute of Statistics of Andalusia (Spain) [2007/00001428]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Data mining is based on data files which usually contain errors in the form of missing values. This paper focuses on a methodological framework for the development of an automated data imputation model based on artificial neural networks. Fifteen real and simulated data sets are exposed to a perturbation experiment, based on the random generation of missing values. These data set sizes range from 47 to 1389 records. A perturbation experiment was performed for each data set where the probability of missing value was set to 0.05. Several architectures and learning algorithms for the multilayer perceptron are tested and compared with three classic imputation procedures: mean/mode imputation, regression and hot-deck. The obtained results, considering different performance measures, not only suggest this approach improves the quality of a database with missing values, but also the best results are clearly obtained using the Multilayer Perceptron model in data sets with categorical variables. Three learning rules (Levenberg-Marquardt, BFGS Quasi-Newton and Conjugate Gradient Fletcher-Reeves Update) and a small number of hidden nodes are recommended. (C) 2010 Elsevier Ltd. All rights reserved.

Missing value imputation on missing completely at random data using multilayer perceptrons

期刊

NEURAL NETWORKS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Missing value imputation on missing completely at random data using multilayer perceptrons

期刊

NEURAL NETWORKS

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文