期刊
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS
卷 128, 期 -, 页码 17-24出版社
ELSEVIER
DOI: 10.1016/j.chemolab.2013.07.009
关键词
Near infrared; Feature selection; Monte Carlo sampling; Recursive partial least squares
类别
资金
- Qinghai Provincial Natural Science Fund [2012-Z-937Q]
Variable selection is a critical step in data analysis for near infrared spectroscopy. Recently, many studies have been reported on variable selection and researchers have proposed a large number of methods to identify variables (wavelengths) that contribute useful information. In the present study, a key wavelengths selection method named Monte Carlo sampling-recursive partial least squares (MCS-RPLS) is proposed. The method mainly includes three steps: (1) Monte Carlo sampling; (2) feature selection for each subset; and (3) determination of the optimum feature set for the dataset The method has been used for feature selection and multivariate calibration on four near infrared spectroscopic datasets: corn moisture, corn protein, HSA and gamma-globulin of biological samples. And the 10-fold cross validation results are compared with those obtained by full spectra-PLS, Moving Window Partial Least Squares (MWPLS), Monte Carlo-based Uninformative Variable Elimination (MC-UVE) and CARS. The results showed that the data dimensionalities and the RMSECV values of the selected variables are greatly reduced, thus the MCS-RPLS is available for feature selection from NIR data. (C) 2013 Elsevier B.V. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据