4.5 Article

PLS model building with missing data: New algorithms and a comparative study

期刊

JOURNAL OF CHEMOMETRICS
卷 31, 期 7, 页码 -

出版社

WILEY
DOI: 10.1002/cem.2897

关键词

imputation; missing data; multivariate calibration; partial least squares regression (PLS); trimmed scores regression (TSR)

资金

  1. Spanish Ministry of Science and Innovation
  2. FEDER
  3. European Union [DPI2011-28112-C04-02, DPI2014-55276-C5-1R]
  4. Spanish Ministry of Economy and Competitiveness [ECO2013-43353-R]

向作者/读者索取更多资源

New algorithms to deal with missing values in predictive modelling are presented in this article. Specifically, 2 trimmed scores regression adaptations are proposed, one from principal component analysis model building with missing data (MD) and other from partial least squares regression model exploitation with missing values. Using these methods, practitioners can impute MD both in the explanatory/predictor and the dependent/response variables. Partial least squares is used here to build the multivariate calibration models; however, any regression method can be used after MD imputation. Four case studies, with different latent structures, are analysed here to compare the trimmed scores regression-based methods against state-of-the-art approaches. The MATLAB code for these methods is also provided for its direct implementation at , under a GNU license. New algorithms to deal with missing values in predictive modelling are presented here. Specifically, two trimmed scores regression (TSR) adaptations are proposed, one from PCA model building with missing data (MD) and other from PLS regression model exploitation with MD. Using these methods, practitioners can impute MD both in the explanatory/predictor and the dependent/response variables. TSR is compared to other state-of-the-art methods in four case studies. The MATLAB code for TSR methods is freely available.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据