4.7 Article

Impact of different classification schemes on discrimination of proteins with noise-contaminated spectra using laboratory-measured fluorescence data

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.saa.2023.122646

关键词

Three-dimensional fluorescence spectrum; Protein; Feature transformation; Multivariate analysis; Classification scheme

向作者/读者索取更多资源

This study investigated the noise-tolerant capability of laboratory-measured fluorescence spectra as a database for detecting and identifying biological agents. The performance of classification schemes and feature descriptors were analyzed under different noise levels. The results showed that robust features extracted with corresponding techniques are critical for enhancing spectral differentiation capabilities and eliminating the noise effect.
Biological agents are important to detect and identify with respect to environmental contamination and public health. Noise contamination in fluorescent spectra is one of the contributors to the uncertainties of identification. In order to investigate the noise-tolerant capability provided by laboratory-measured excitation -emission matrix (EEM) fluorescence spectra that are used as a database, fluorescence properties of four proteinaceous biotoxin samples and ten harmless protein samples were characterized by EEM fluorescence spectra, and the predicting performance of models trained by laboratory-measured fluorescence data was tested and verified from validation data with noise-contaminated spectra. By means of peak signal of noise (PSNR) as an indicator of noise levels, the potential impact of noise contaminations on the characterization and discrimination of these samples was evaluated quantitatively. Different classification schemes utilizing multivariate analysis techniques of Principal Component Analysis (PCA), Random Forest (RF), and Multi -layer Perceptron (MPL) coupled with feature descriptors of differential transform (DT), Fourier transform (FT) and wavelet transform (WT) were conducted under different PSNR values. We systematically analyzed the performance of classification schemes by the case study at 20 PSNR and by statistical analysis from 1-100 PSNR. The results show that the spectral features with EEM-WT decreased the demanding number of input variables while retaining high performances in sample classification. The spectral features with EEM-FT presented the worst performance although having the largest number of features. The distributions of feature importance and contribution were found sensitive to noise contaminations. The classification scheme of PCA prior to MPL with EEM-WT as input presented an improvement in lower PSNR. These results indicate that robust features extracted by corresponding techniques are critical to enhancing the spectral differentiation capabilities among these samples and play an important role in eliminating the noise effect. The study of classification schemes for discriminating protein samples with noise-contaminated spectra presents tremendous potential for future developments in the rapid detection and identification of proteinaceous biotoxins based on three-dimensional fluorescence spectrometry.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据