4.5 Article

Feature selection using distributions of orthogonal PLS regression vectors in spectral data

Journal

BIODATA MINING
Volume 14, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s13040-021-00240-3

Keywords

Feature selection; PLS; Orthogonal signal correction; Regression vector; Permutation test

Ask authors/readers for more resources

Feature selection is crucial in chemometric data analysis to produce predictive models. This paper introduces a method of feature selection through orthogonal PLS regression, which combines signal correction with PLS, to generate empirical distributions and examine the significance of input features on the response variable Y. The performance of this method is demonstrated using simulation studies and real-life data such as NIR spectra and mass spectrometry data.
Feature selection, which is important for successful analysis of chemometric data, aims to produce parsimonious and predictive models. Partial least squares (PLS) regression is one of the main methods in chemometrics for analyzing multivariate data with input X and response Y by modeling the covariance structure in the X and Y spaces. Recently, orthogonal projections to latent structures (OPLS) has been widely used in processing multivariate data because OPLS improves the interpretability of PLS models by removing systematic variation in the X space not correlated to Y. The purpose of this paper is to present a feature selection method of multivariate data through orthogonal PLS regression (OPLSR), which combines orthogonal signal correction with PLS. The presented method generates empirical distributions of features effects upon Y in OPLSR vectors via permutation tests and examines the significance of the effects of the input features on Y. We show the performance of the proposed method using a simulation study in which a three-layer network structure exists in compared with the false discovery rate method. To demonstrate this method, we apply it to both real-life NIR spectra data and mass spectrometry data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available