4.8 Article

OpenProt 2021: deeper functional annotation of the coding potential of eukaryotic genomes

Journal

NUCLEIC ACIDS RESEARCH
Volume 49, Issue D1, Pages D380-D388

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkaa1036

Keywords

-

Funding

  1. Canada Research Chairs
  2. Canadian Institutes of Health Research (CIHR) [MOP-137056, MOP-136962]

Ask authors/readers for more resources

OpenProt is the first proteogenomic resource that supports a polycistronic annotation model for eukaryotic genomes, providing deeper annotation of open reading frames (ORFs) with supporting evidence from experimental data. The platform re-analyzes ribosome profiling and mass spectrometry datasets to report non-AUG initiation starts and control the unicity of detected peptides. In addition, detectability statistics and protein relationships are now reported for each protein, and a data analysis platform is offered for users to submit their datasets for analysis and access the results.
OpenProt (www.openprot.org) is the first proteogenomic resource supporting a polycistronic annotation model for eukaryotic genomes. It provides a deeper annotation of open reading frames (ORFs) while mining experimental data for supporting evidence using cutting-edge algorithms. This update presents the major improvements since the initial release of OpenProt. All species support recent NCBI RefSeq and Ensembl annotations, with changes in annotations being reported in OpenProt. Using the 131 ribosome profiling datasets re-analysed by OpenProt to date, non-AUG initiation starts are reported alongside a confidence score of the initiating codon. From the 177 mass spectrometry datasets re-analysed by OpenProt to date, the unicity of the detected peptides is controlled at each implementation. Furthermore, to guide the users, detectability statistics and protein relationships (isoforms) are now reported for each protein. Finally, to foster access to deeper ORF annotation independently of one's bioinformatics skills or computational resources, OpenProt now offers a data analysis platform. Users can submit their dataset for analysis and receive the results from the analysis by OpenProt. All data on OpenProt are freely available and downloadable for each species, the release-based format ensuring a continuous access to the data. Thus, OpenProt enables a more comprehensive annotation of eukaryotic genomes and fosters functional proteomic discoveries.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available