4.7 Article

Chemometric analysis in Raman spectroscopy from experimental design to machine learning-based modeling

Journal

NATURE PROTOCOLS
Volume 16, Issue 12, Pages 5426-+

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41596-021-00620-3

Keywords

-

Funding

  1. Free State of Thuringia [2019 FGR 0083]
  2. European Union via the TAB-FG MorphoTox
  3. BMBF [FKZ 13N15466]
  4. China Scholarship Council (CSS)
  5. German Research Foundation (DFG) [441958208]

Ask authors/readers for more resources

Raman spectroscopy is increasingly used in various fields such as biology, forensics, diagnostics, pharmaceutics, and food science. Chemometric techniques are employed to detect and extract subtle differences in Raman spectra from related samples, aiding in differentiation between different cell types and assessing biological sample chemical variances.
Raman spectroscopy is increasingly being used in biology, forensics, diagnostics, pharmaceutics and food science applications. This growth is triggered not only by improvements in the computational and experimental setups but also by the development of chemometric techniques. Chemometric techniques are the analytical processes used to detect and extract information from subtle differences in Raman spectra obtained from related samples. This information could be used to find out, for example, whether a mixture of bacterial cells contains different species, or whether a mammalian cell is healthy or not. Chemometric techniques include spectral processing (ensuring that the spectra used for the subsequent computational processes are as clean as possible) as well as the statistical analysis of the data required for finding the spectral differences that are most useful for differentiation between, for example, different cell types. For Raman spectra, this analysis process is not yet standardized, and there are many confounding pitfalls. This protocol provides guidance on how to perform a Raman spectral analysis: how to avoid these pitfalls, and strategies to circumvent problematic issues. The protocol is divided into four parts: experimental design, data preprocessing, data learning and model transfer. We exemplify our workflow using three example datasets where the spectra from individual cells were collected in single-cell mode, and one dataset where the data were collected from a raster scanning-based Raman spectral imaging experiment of mice tissue. Our aim is to help move Raman-based technologies from proof-of-concept studies toward real-world applications. Raman spectroscopy is increasingly being used in biological assays and studies. This protocol provides guidance for performing chemometric analysis to detect and extract information relating to the chemical differences between biological samples.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available