4.7 Article

A comprehensive LFQ benchmark dataset on modern day acquisition strategies in proteomics

Journal

SCIENTIFIC DATA
Volume 9, Issue 1, Pages -

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41597-022-01216-6

Keywords

-

Funding

  1. Research Foundation Flanders (FWO) [11B4518N, 1S50918N, 12E9716N]
  2. French Ministry of Research
  3. Investissement d'Avenir Infrastructures Nationales en Biologie et Sante program (ProFi, Proteomics French Infrastructure project) [ANR-10-INBS-08]

Ask authors/readers for more resources

In the past decade, there has been a revolution in liquid chromatography-mass spectrometry (LC-MS) based proteomics with the introduction of novel instruments and data acquisition methodologies. However, the lack of a benchmark experimental design hampers the development of algorithms to mine publicly available proteomics datasets. To address this, we present a comprehensive dataset acquired using different instrument platforms and data acquisition methods, allowing for algorithm development and performance assessment.
In the last decade, a revolution in liquid chromatography-mass spectrometry (LC-MS) based proteomics was unfolded with the introduction of dozens of novel instruments that incorporate additional data dimensions through innovative acquisition methodologies, in turn inspiring specialized data analysis pipelines. Simultaneously, a growing number of proteomics datasets have been made publicly available through data repositories such as ProteomeXchange, Zenodo and Skyline Panorama. However, developing algorithms to mine this data and assessing the performance on different platforms is currently hampered by the lack of a single benchmark experimental design. Therefore, we acquired a hybrid proteome mixture on different instrument platforms and in all currently available families of data acquisition. Here, we present a comprehensive Data-Dependent and Data-Independent Acquisition (DDA/DIA) dataset acquired using several of the most commonly used current day instrumental platforms. The dataset consists of over 700 LC-MS runs, including adequate replicates allowing robust statistics and covering over nearly 10 different data formats, including scanning quadrupole and ion mobility enabled acquisitions. Datasets are available via ProteomeXchange (PXD028735).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available