期刊
BIOINFORMATICS
卷 38, 期 15, 页码 3759-3767出版社
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac407
关键词
-
类别
资金
- Czech Academy of Sciences (CAS) [RVO 68378050, LM2018126]
- Ministry of Education, Youth and Sports of the Czech Republic (MEYS) [CZ.02.1.01/0.0/0.0/16_013/0001789]
- European Structural Investment Funds (ESIF)
- MEYS
Aligning and combining individually preprocessed batches in large-scale multi-batch experiments is a challenging task. We present two methods and demonstrate that retention order swaps are not rare in untargeted LC-MS data.
Motivation: Meticulous selection of chromatographic peak detection parameters and algorithms is a crucial step in preprocessing liquid chromatography-mass spectrometry (LC-MS) data. However, as mass-to-charge ratio and retention time shifts are larger between batches than within batches, finding apt parameters for all samples of a large-scale multi-batch experiment with the aim of minimizing information loss becomes a challenging task. Preprocessing independent batches individually can curtail said problems but requires a method for aligning and combining them for further downstream analysis. Results: We present two methods for aligning and combining individually preprocessed batches in multi-batch LC-MS experiments. Our developed methods were tested on six sets of simulated and six sets of real datasets. Furthermore, by estimating the probabilities of peak insertion, deletion and swap between batches in authentic datasets, we demonstrate that retention order swaps are not rare in untargeted LC-MS data. Availability and implementation: kmersAlignment and rtcorrectedAlignment algorithms are made available as an R package with raw data at https://metabocombiner.img.cas.cz Contact: malinkaf@img.cas.cz Supplementary information: Supplementary data are available at Bioinformatics online.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据