4.7 Article

An improved successive projections algorithm version to variable selection in multiple linear regression

Journal

ANALYTICA CHIMICA ACTA
Volume 1274, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.aca.2023.341560

Keywords

Variable selection; Successive projections algorithm; Multilinear regression; Partial least squares; NIR spectrometry

Ask authors/readers for more resources

This study proposes a new algorithm called fSPA-MLR, which enhances the performance of the original SPA-MLR method by adding a filter step to reduce the number of uninformative variables. The fSPA-MLR models demonstrate superior performance compared to PLS and the original SPA-MLR models in both cross-validation and external prediction.
The aim of the successive projections algorithm (SPA) is to enhance the accuracy of multiple linear regressions (MLR) by minimizing the impact of collinearity effects in the calibration data set. Combining SPA with MLR as a variable selection approach has resulted in the SPA-MLR method, which has been reported in literature to produce models with good prediction ability compared to conventional full-spectrum models obtained with partial-least-squares (PLS) in some cases. This paper proposes the addition of a filter step to the current version of the SPA algorithm to reduce the number of uninformative variables before the projection phase and assist the algorithm in selecting the best variables on subsequent steps. The proposed fSPA-MLR algorithm is evaluated in two case studies involving the near-infrared spectrometric analysis of pharmaceutical tablet and diesel/biodiesel mixture samples. Compared to PLS, the fSPA-MLR models demonstrate similar or better performance. Moreover, the fSPA-MLR models outperform the original SPA-MLR in both cross-validation and external prediction. The fSPA-MLR models deliver superior results regardless of the pre-processing algorithm tested, including firstderivative Savitzky-Golay (SG) and Standard Normal Variate (SNV), or even in raw spectra data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available