4.7 Article

Application of the combination method based on RF and LE in near infrared spectral modeling

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.saa.2022.122247

关键词

Near -infrared spectral; Laplacian Eigenmaps; Random frog; Partial least squares regression; Support vector regression

向作者/读者索取更多资源

The dimensionality of near-infrared (NIR) spectral data is often large, and dimensionality reduction is crucial for increasing the model's performance. Laplacian Eigenmaps (LE) can preserve local neighborhood information but is disturbed by irrelevant information and multicollinearity. Random Frog (RF) algorithm can eliminate noise and collinearity. Hence, before using LE, RF is used to eliminate irrelevant information and reduce correlation, resulting in improved regression models' prediction accuracy and stability.
The dimensionality of near-infrared (NIR) spectral data is often extremely large. Dimensionality reduction of spectral data can effectively reduce the redundant information and correlation between spectral variables and simplify the model, which is crucial to increasing the model's performance. As a nonlinear feature extraction method, Laplacian Eigenmaps (LE) may preserve the local neighborhood information of the dataset, has high robustness, and is simple to compute. However, when the LE algorithm maps the data from high-dimensional space to low-dimensional space, it is often disturbed by irrelevant information and multicollinearity in the spectral data, which lowers the model's prediction performance. Random Frog (RF) algorithm can eliminate noise and collinearity in the spectrum. Therefore, before using the LE algorithm, we use the RF algorithm to eliminate irrelevant information in the spectrum and reduce the correlation between the spectra variables to increase the efficiency of the LE algorithm. We used the RF + LE algorithm to reduce the dimensionality of two public NIRS datasets (soil datasets and pharmaceutical tablets datasets) and compared it with RF and LE algorithms alone. We utilized Partial Least Squares Regression (PLSR) and Support Vector Regression (SVR) to establish regression models. The experimental findings demonstrate that compared with the RF algorithm and LE algorithm, the RF + LE combination method can reduce the dimension of spectral variables and model complexity, and improve regression models' prediction accuracy and stability. It is an effective dimensionality reduction method for the near-infrared spectrum.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据