4.8 Article

The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences

Related references

Note: Only part of the references are listed.
Article Biochemical Research Methods

Generation of ENSEMBL-based proteogenomics databases boosts the identification of non-canonical peptides

Husen M. Umer et al.

Summary: The pypgatk package and pgdb workflow have been implemented to create proteogenomics databases based on ENSEMBL resources. The tools can generate protein sequences from different types of transcripts and take into account the impact of genomic variants on protein sequences. Using these tools, researchers have reanalyzed public datasets and identified a significant number of novel protein sequences.

BIOINFORMATICS (2022)

Article Biochemistry & Molecular Biology

UniProt: the universal protein knowledgebase in 2021

Alex Bateman et al.

Summary: The UniProt Knowledgebase aims to provide users with a comprehensive, high-quality set of protein sequences annotated with functional information. Updates over the past two years have increased the number of sequences to approximately 190 million, with new methods to assess proteome completeness and quality. UniProtKB has responded to the COVID-19 pandemic by expertly curating relevant entries and making them rapidly available through a dedicated portal.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemical Research Methods

BioContainers Registry: Searching Bioinformatics and Proteomics Tools, Packages, and Containers

Jingwen Bai et al.

Summary: BioContainers is an open-source project that provides over 9000 bioinformatics tools, including more than 200 proteomics and mass spectrometry tools. The project aims to standardize software containers and support multiple packaging and container technologies. The BioContainers Registry and Restful API are designed to make containerized bioinformatics tools more discoverable, accessible, interoperable, and reusable.

JOURNAL OF PROTEOME RESEARCH (2021)

Article Biochemical Research Methods

Deep learning embedder method and tool for mass spectra similarity search

Chunyuan Qin et al.

Summary: Spectral similarity calculation is crucial in proteomics data analysis. Deep learning-based MS/MS spectra embedding models improve mass spectral clustering similarity calculation algorithms by learning from large-scale training datasets. Benchmark results show that normalized dot product and DLEAMSE have similar accuracy, but DLEAMSE is faster and more efficient for large-scale data comparisons.

JOURNAL OF PROTEOMICS (2021)

Article Biochemical Research Methods

Data Management of Sensitive Human Proteomics Data: Current Practices, Recommendations, and Perspectives for the Future

Nuno Bandeira et al.

Summary: The increase in clinical proteomics studies has raised concerns about managing and disseminating potentially sensitive human proteomics data. Balancing data privacy with efficient use and reuse of research efforts through sharing clinical proteomics data will require development efforts at different levels including bioinformatics infrastructure, policymaking, and mechanisms of oversight.

MOLECULAR & CELLULAR PROTEOMICS (2021)

Editorial Material Multidisciplinary Sciences

The growing need for controlled data access models in clinical proteomics and metabolomics

Thomas M. Keane et al.

Summary: This commentary discusses the current best practices and future perspectives for responsible handling of clinical proteomics and metabolomics data, emphasizing the lack of bioinformatics resources available to manage access to sensitive human datasets in clinical studies.

NATURE COMMUNICATIONS (2021)

Review Multidisciplinary Sciences

A proteomics sample metadata representation for multiomics integration and big data analysis

Chengxin Dai et al.

Summary: The authors proposed a format and software pipeline for presenting and validating metadata of proteomics datasets, integrating them into ProteomeXchange repositories. They implemented MAGE-TAB-Proteomics in a crowdsourcing project to manually curate over 200 public datasets, aiming to improve reproducibility and facilitate reanalysis and integration of public proteomics datasets.

NATURE COMMUNICATIONS (2021)

Article Biochemical Research Methods

Universal Spectrum Explorer: A Standalone (Web-)Application for Cross-Resource Spectrum Comparison

Tobias Schmidt et al.

Summary: The Universal Spectrum Explorer (USE) is a web-based tool for peptide spectrum visualization and comparison. It allows users to manually provide mass spectra or automatically retrieve them from online repositories, as well as request spectra from other resources via a REST interface. The USE supports exporting annotated mirror spectrum plots as editable scalable high-quality vector graphics.

JOURNAL OF PROTEOME RESEARCH (2021)

Article Multidisciplinary Sciences

An integrated landscape of protein expression in human cancer

Andrew F. Jarnuczak et al.

Summary: Utilizing 11 proteomics datasets from the PRIDE database, a reference expression map was constructed for 191 cancer cell lines and 246 clinical tumor samples, revealing unique peptides in tumor samples and highlighting the correlation between baseline expression in cell lines and tumors. Integration of proteomics and transcriptomics data showed a median correlation of 0.58 across cell lines, indicating that mRNA levels are often a poor predictor of changes in protein abundance. This study represents the first meta-analysis focusing on cancer-related public proteomics datasets, emphasizing the shortcomings and limitations of such studies.

SCIENTIFIC DATA (2021)

Article Biotechnology & Applied Microbiology

MaxDIA enables library-based and library-free data-independent acquisition proteomics

Pavel Sinitcyn et al.

Summary: MaxDIA is a software platform specifically designed for analyzing DIA proteomics data within the MaxQuant environment, achieving deep proteome coverage and improved protein quantification accuracy. It provides accurate FDR estimates for hypothesis-free analysis of DIA samples.

NATURE BIOTECHNOLOGY (2021)

Article Biochemical Research Methods

Universal Spectrum Identifier for mass spectra

Eric W. Deutsch et al.

Summary: The Universal Spectrum Identifier (USI) provides a standardized mechanism for encoding virtual paths to mass spectra in public repositories, enabling greater transparency and traceability of spectral evidence. Over 1 billion USI identifications from more than 3 billion spectra are already available through ProteomeXchange repositories, supporting the findings of mass spectrometry proteomics studies.

NATURE METHODS (2021)

Article Biochemistry & Molecular Biology

The COVID-19 Data Portal: accelerating SARS-CoV-2 and COVID-19 research through rapid open access data sharing

Peter W. Harrison et al.

Summary: The global outbreak of SARS-CoV-2 has resulted in significant impacts on human society and millions of deaths. The COVID-19 Data Portal aims to accelerate global research by providing open data sharing and analysis services.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

OpenProt 2021: deeper functional annotation of the coding potential of eukaryotic genomes

Marie A. Brunet et al.

Summary: OpenProt is the first proteogenomic resource that supports a polycistronic annotation model for eukaryotic genomes, providing deeper annotation of open reading frames (ORFs) with supporting evidence from experimental data. The platform re-analyzes ribosome profiling and mass spectrometry datasets to report non-AUG initiation starts and control the unicity of detected peptides. In addition, detectability statistics and protein relationships are now reported for each protein, and a data analysis platform is offered for users to submit their datasets for analysis and access the results.

NUCLEIC ACIDS RESEARCH (2021)

Article Biochemistry & Molecular Biology

MatrisomeDB: the ECM-protein knowledge database

Xinhao Shao et al.

NUCLEIC ACIDS RESEARCH (2020)

Letter Biochemical Research Methods

The ELIXIR Core Data Resources: fundamental infrastructure for the life sciences

Rachel Drysdale et al.

BIOINFORMATICS (2020)

Article Biotechnology & Applied Microbiology

The functional landscape of the human phosphoproteome

David Ochoa et al.

NATURE BIOTECHNOLOGY (2020)

Review Biochemical Research Methods

Scalable Data Analysis in Proteomics and Metabolomics Using BioContainers and Workflows Engines

Yasset Perez-Riverol et al.

PROTEOMICS (2020)

Article Biochemical Research Methods

Toward a Sample Metadata Standard in Public Proteomics Repositories

Yasset Perez-Riverol

JOURNAL OF PROTEOME RESEARCH (2020)

Article Biochemical Research Methods

Scop3P: A Comprehensive Resource of Human Phosphosites within Their Full Context

Pathmanaban Ramasamy et al.

JOURNAL OF PROTEOME RESEARCH (2020)

Article Biochemical Research Methods

MassIVE.quant: a community resource of quantitative mass spectrometry-based proteomics datasets

Meena Choi et al.

NATURE METHODS (2020)

Article Biochemistry & Molecular Biology

Toward Increased Reliability, Transparency, and Accessibility in Cross-linking Mass Spectrometry

Alexander Leitner et al.

STRUCTURE (2020)

Article Biochemistry & Molecular Biology

The jPOST environment: an integrated proteomics data repository and database

Yuki Moriya et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

iProX: an integrated proteome resource

Jie Ma et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biotechnology & Applied Microbiology

Co-regulation map of the human proteome enables identification of protein functions

Georg Kustatscher et al.

NATURE BIOTECHNOLOGY (2019)

Article Multidisciplinary Sciences

Quantifying the impact of public omics data

Yasset Perez-Riverol et al.

NATURE COMMUNICATIONS (2019)

Article Biochemical Research Methods

Protein Inference Using PIA Workflows and PSI Standard File Formats

Julian Uszkoreit et al.

JOURNAL OF PROTEOME RESEARCH (2019)

Article Biochemistry & Molecular Biology

The PRIDE database and related tools and resources in 2019: improving support for quantification data

Yasset Perez-Riverol et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemical Research Methods

Panorama Public: A Public Repository for Quantitative Data Sets Processed in Skyline

Vagisha Sharma et al.

MOLECULAR & CELLULAR PROTEOMICS (2018)

Article Biotechnology & Applied Microbiology

OpenMS - A platform for reproducible analysis of mass spectrometry data

Julianus Pfeuffer et al.

JOURNAL OF BIOTECHNOLOGY (2017)

Article Biochemical Research Methods

The mzIdentML Data Standard Version 1.2, Supporting Advances in Proteome Informatics

Juan Antonio Vizcaino et al.

MOLECULAR & CELLULAR PROTEOMICS (2017)

Article Biochemical Research Methods

OLS Client and OLS Dialog: Open Source Tools to Annotate Public Omics Datasets

Yasset Perez-Riverol et al.

PROTEOMICS (2017)

Article Biochemical Research Methods

ProtVista: visualization of protein sequence annotations

Xavier Watkins et al.

BIOINFORMATICS (2017)

Article Biochemical Research Methods

Recognizing millions of consistently unidentified spectra across hundreds of shotgun proteomics datasets

Johannes Griss et al.

NATURE METHODS (2016)

Article Biochemical Research Methods

ms-data-core-api: an open-source, metadata-oriented library for computational proteomics

Yasset Perez-Riverol et al.

BIOINFORMATICS (2015)

Article Biochemical Research Methods

PIA: An Intuitive Protein Inference Engine with a Web-Based User Interface

Julian Uszkoreit et al.

JOURNAL OF PROTEOME RESEARCH (2015)

Editorial Material Biochemical Research Methods

Identifying novel biomarkers through data mining-A realistic scenario?

Johannes Griss et al.

PROTEOMICS CLINICAL APPLICATIONS (2015)

Letter Biotechnology & Applied Microbiology

ProteomeXchange provides globally coordinated proteomics data submission and dissemination

Juan A. Vizcaino et al.

NATURE BIOTECHNOLOGY (2014)

Article Biochemical Research Methods

How to submit MS proteomics data to ProteomeXchange via the PRIDE database

Tobias Ternent et al.

PROTEOMICS (2014)

Article Biochemical Research Methods

jmzTab: A Java interface to the mzTab data standard

Qing-Wei Xu et al.

PROTEOMICS (2014)

Article Biochemistry & Molecular Biology

The BioSample Database (BioSD) at the European Bioinformatics Institute

Mikhail Gostev et al.

NUCLEIC ACIDS RESEARCH (2012)

Article Biochemical Research Methods

jmzIdentML API: A Java interface to the mzIdentML standard for peptide and protein identification data

Florian Reisinger et al.

PROTEOMICS (2012)

Article Biochemical Research Methods

PASSEL: The PeptideAtlas SRM experiment library

Terry Farrah et al.

PROTEOMICS (2012)

Article Biochemical Research Methods

mzML-a Community Standard for Mass Spectrometry Data

Lennart Martens et al.

MOLECULAR & CELLULAR PROTEOMICS (2011)

Review Biochemistry & Molecular Biology

PeptideAtlas: a resource for target selection for emerging targeted proteomics workflows

Eric W. Deutsch et al.

EMBO REPORTS (2008)

Article Biochemical Research Methods

Clinical proteomics: A need to define the field and to begin to set adequate standards

Harald Mischak et al.

PROTEOMICS CLINICAL APPLICATIONS (2007)