☆ 4.8 Article

Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity

NATURE COMMUNICATIONS (2022)

期刊

NATURE COMMUNICATIONS

卷 13, 期 1, 页码 -

出版社

NATURE PORTFOLIO

DOI: 10.1038/s41467-022-30094-0

关键词

类别

Multidisciplinary Sciences

资金

state of Baden-Wurttemberg through bwHPC
German Research Foundation (DFG) [INST 35/1134-1 FUGG]
DFG [SFB 992/1 2012]
German Federal Ministry of Education and Research BMBF grant [031 A538A de.NBI-RBC]
Deutsche Forschungsgemeinschaft (DFG) [446058856, 466359513, 444936968, 405351425, 431336276, 438496892, SFB 1453, 441891347, SFB 1479, 423813989, GRK 2606, 322977937, GRK 2344]
ERA PerMed programme (BMBF) [01KU1916, 01KU1915A]
German-Israel Foundation [1444]
German Consortium for Translational Cancer Research (project Impro-Rec)
German Ministry of Education and Research [FKZ031L0080]
Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy [CIBSS-EXC-21892100249960-390939984]
Swiss canton of Grisons [628]
Hans Groeber Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study presents a benchmark dataset for evaluating DIA data analysis workflows in clinical settings, using real-world inter-patient heterogeneity. The results demonstrate the effectiveness of gas-phase fractionated spectral libraries and non-parametric permutation-based statistical tests for correctly identifying differentially abundant proteins in DIA analysis.

Numerous software tools exist for data-independent acquisition (DIA) analysis of clinical samples, necessitating their comprehensive benchmarking. We present a benchmark dataset comprising real-world inter-patient heterogeneity, which we use for in-depth benchmarking of DIA data analysis workflows for clinical settings. Combining spectral libraries, DIA software, sparsity reduction, normalization, and statistical tests results in 1428 distinct data analysis workflows, which we evaluate based on their ability to correctly identify differentially abundant proteins. From our dataset, we derive bootstrap datasets of varying sample sizes and use the whole range of bootstrap datasets to robustly evaluate each workflow. We find that all DIA software suites benefit from using a gas-phase fractionated spectral library, irrespective of the library refinement used. Gas-phase fractionation-based libraries perform best against two out of three reference protein lists. Among all investigated statistical tests non-parametric permutation-based statistical tests consistently perform best. Data independent acquisition (DIA) has been gaining momentum in clinical proteomics. Here, the authors create a benchmark dataset comprising inter-patient heterogeneity to compare popular DIA data analysis workflows for identifying differentially abundant proteins.

Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity

期刊

NATURE COMMUNICATIONS

出版社

NATURE PORTFOLIO

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity

期刊

NATURE COMMUNICATIONS

出版社

NATURE PORTFOLIO

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文