4.8 Article

Deep Learning Framework for Integrating Multibatch Calibration, Classification, and Pathway Activities

Journal

ANALYTICAL CHEMISTRY
Volume 94, Issue 25, Pages 8937-8946

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acs.analchem.2c00601

Keywords

-

Ask authors/readers for more resources

By proposing a joint deep learning framework that integrates batch effect removal, classification, and downstream pathway activities, we have validated its effectiveness on metabolomics datasets, achieving higher diagnostic accuracy and notable improvement over other methods.
The amount of available biological data has exploded since the emergence of high-throughput technologies, which is not only revolting the way we recognize molecules and diseases but also bringing novel analytical challenges to bioinformatics analysis. In recent years, deep learning has become a dominant technique in data science. However, classification accuracy is plagued with domain discrepancy. Notably, in the presence of multiple batches, domain discrepancy typically happens between individual batches. Most pairwise adaptation approaches may be suboptimal as they fail to eliminate external factors across multiple batches and take the classification task into account simultaneously. We propose a joint deep learning framework for integrating batch effect removal, classification, and downstream pathway activities upon biological data. To this end, we validate it on two MALDI MS-based metabolomics datasets. We have achieved the highest diagnostic accuracy (ACC), with a notable similar to 10% improvement over other methods. Overall, these results indicate that our approach removes batch effect more effectively than state-of-the-art methods and yields more accurate classification as well as biomarkers for smart diagnosis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available