4.7 Article

Dual-sPLS: A family of Dual Sparse Partial Least Squares regressions for feature selection and prediction with tunable sparsity; evaluation on simulated and near-infrared (NIR) data

出版社

ELSEVIER
DOI: 10.1016/j.chemolab.2023.104813

关键词

Partial least squares; Lasso; Ridge; Regression; Sparsity; Dual norm; Chemometrics; Machine learning

向作者/读者索取更多资源

Relating variables X to response y is important in chemometrics. Qualitative interpretation can enhance quantitative prediction by identifying influential features. Projections (e.g. PLS) and variable selections (e.g. lasso) are used for dimension reduction in high-dimensional problems. Dual-sPLS, a variant of PLS1, provides a balance between accurate prediction and efficient interpretation through penalizations inspired by classical regression methods and the dual norm notion. It performs favorably compared to similar regression methods on simulated and real chemical data.
Relating a set of variables X to a response y is crucial in chemometrics. A quantitative prediction objective can be enriched by qualitative data interpretation, for instance by locating the most influential features. When high-dimensional problems arise, dimension reduction techniques can be used. Most notable are projections (e.g. Partial Least Squares or PLS ) or variable selections (e.g. lasso). Sparse partial least squares combine both strategies, by blending variable selection into PLS. The variant presented in this paper, Dual-sPLS, generalizes the classical PLS1 algorithm. It provides balance between accurate prediction and efficient interpretation. It is based on penalizations inspired by classical regression methods (lasso, group lasso, least squares, ridge) and uses the dual norm notion. The resulting sparsity is enforced by an intuitive shrinking ratio parameter. Dual-sPLS favorably compares to similar regression methods, on simulated and real chemical data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据