4.7 Article

Correcting replicate variation in spectroscopic data by machine learning and model-based pre-processing

Journal

Publisher

ELSEVIER
DOI: 10.1016/j.chemolab.2021.104350

Keywords

Spectral pre-processing; Augmentation; EMSC; Machine learning; Deep learning

Funding

  1. Research Council of Norway-FMETEKN Grant [257622]
  2. BIONAER Grant [268305, 305215]
  3. DAAD Grant [309220]
  4. HAVBRUK2 Grant [302543/E40]
  5. MATFONDAVTALE Grant [301834/E50]

Ask authors/readers for more resources

This study introduces a pre-processing and augmentation approach based on EMSC to eliminate replicate variation in spectroscopic data. It compares replicate correction and replicate augmentation as two inverse procedures in terms of classification performance, using different types of spectral data sets for experimentation.
In this study we present a pre-processing and an augmentation approach based on Extended Multiplicative Signal Correction (EMSC) for removing and modelling replicate variation in spectroscopic data. The EMSC replicate correction method estimates replicate variation from replicate samples and integrates the estimated variation into the EMSC model. In the field of deep learning, augmentation is a frequently applied approach to deal with variability in images. In this study, we suggest augmentation of vibrational spectroscopic data with replicate variation. Replicate correction and replicate augmentation can be considered as two inverse procedures which are compared in our study. Three data sets of Fourier Transform Infrared spectra of yeasts, filamentous fungi and bacteria were used in this study to compare classification performance on genus and species level by (1) Random Forest using replicate corrected spectra vs. (2) Deep Learning using augmented spectra. Both technical and biological replicate correction/augmentation were considered.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available