☆ 3.8 Article

Improving cluster-based missing value estimation of DNA microarray data

BIOMOLECULAR ENGINEERING (2007)

期刊

BIOMOLECULAR ENGINEERING

卷 24, 期 2, 页码 273-282

出版社

ELSEVIER

DOI: 10.1016/j.bioeng.2007.04.003

关键词

missing value estimation; K-nearest neighbours; gene expression data; DNA microarray data

类别

Biochemical Research Methods Biotechnology & Applied Microbiology Genetics & Heredity

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We present a modification of the weighted K-nearest neighbours imputation method (KNNimpute) for missing values (MVs) estimation in microarray data based on the reuse of estimated data. The method was called iterative KNN imputation (IKNNimpute) as the estimation is performed iteratively using the recently estimated values. The estimation efficiency of lKNNimpme was assessed under different conditions (data type, fraction and structure of missing data) by the normalized root mean squared error (NRMSE) and the correlation coefficients between estimated and true values, and compared with that of other cluster-based estimation methods (KNNimpute and sequential KNN). We further investigated the influence of imputation on the detection of differentially expressed genes using SAM by examining the differentially expressed genes that are lost after MV estimation. The performance measures give consistent results, indicating that the iterative procedure of lKNNimpute can enhance the prediction ability of cluster-based methods in the presence of high missing rates, in non-time series experiments and in data sets comprising both time series and non-time series data, because the information of the genes having MVs is used more efficiently and the iterative procedure allows refining the MV estimates. More importantly, IKNN has a smaller detrimental effect on the detection of differentially expressed genes. (c) 2007 Elsevier B.V. All rights reserved.

Improving cluster-based missing value estimation of DNA microarray data

期刊

BIOMOLECULAR ENGINEERING

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Improving cluster-based missing value estimation of DNA microarray data

期刊

BIOMOLECULAR ENGINEERING

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文