Journal
JOURNAL OF CHEMOMETRICS
Volume 34, Issue 4, Pages -Publisher
WILEY
DOI: 10.1002/cem.3202
Keywords
factor methods; feature extraction; PCA; PLSR; Raman spectroscopy
Categories
Funding
- Bundesministerium fur Bildung und Forschung [FKZ 03ZZ0444J]
- Deutsche Forschungsgemeinschaft [BO4700/4-1]
- China Scholarship Council
- BMBF [FKZ 01ET1701, FKZ 13N13852]
- Intersept
Ask authors/readers for more resources
Raman spectra of biological samples often exhibit variations originating from changes of spectrometers, measurement conditions, and cultivation conditions. Such unwanted variations make a classification extremely challenging, especially if they are more significant compared with the differences between groups to be separated. A classifier is prone to such unwanted variations (ie, intragroup variations) and can fail to learn the patterns that can help separate different groups (ie, intergroup differences). This often leads to a poor generalization performance and a degraded transferability of the trained model. A natural solution is to separate the intragroup variations from the intergroup differences and build the classifier based on merely the latter information, for example, by a well-designed feature extraction. This forms the idea of this contribution. Herein, we modified two commonly applied feature extraction approaches, principal component analysis (PCA) and partial least squares (PLS), in order to extract merely the features representing the intergroup differences. Both of the methods were verified with two Raman spectral datasets measured from bacterial cultures and colon tissues of mice, respectively. In comparison to ordinary PCA and PLS, the modified PCA was able to improve the prediction on the testing data that bears significant difference to the training data, while the modified PLS could help avoid overfitting and lead to a more stable classification.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available