4.7 Article

Cost-efficient unsupervised sample selection for multivariate calibration

出版社

ELSEVIER
DOI: 10.1016/j.chemolab.2021.104352

关键词

Sample selection; Unsupervised learning; Multivariate calibration; PLS regression; VC dimension; NIR

资金

  1. Research Foundation-Flanders (FWO Brussels, Belgium) [1SA0721N]

向作者/读者索取更多资源

This study aims to select the most informative calibration samples in an unsupervised way based on spectral measurements by providing guidelines for addressing challenges in PLSR model building. Recommendations include calculating a sample size exceeding the model complexity by a factor of 12, performing selection in a PCA score space with a sufficient number of principal components, and using methods such as Kennard-Stone.
Indirect quantification of chemical composition through spectral measurements requires the establishment of multivariate calibration models. The reference analyses on the calibration samples typically form a major cost factor in the establishment of these multivariate models. Therefore, the aim of this study was to select the most informative calibration samples in an unsupervised way based on the spectral measurements. To this end, guidelines to address this challenge in PLSR model building have been developed. The recommendations include calculating a sample size that surpasses the model complexity by a factor of 12, performing the selection in the PCA score space spanned by a sufficiently large number of principal components and using methods such as Kennard-Stone, Puchwein, Clustering or D-optimal designs. We provide the data and methodology used in the present study for future use.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据