4.6 Article

Efficient Detection of the Alternative Spliced Human Proteome Using Translatome Sequencing

Journal

FRONTIERS IN MOLECULAR BIOSCIENCES
Volume 9, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA
DOI: 10.3389/fmolb.2022.895746

Keywords

alternative splicing; translatome sequencing; mass spectrometry; proteome; isoform; human proteome project

Funding

  1. Ministry of Science and Technology of China, National Key Research and Development Program [2017YFA0505001/2017YFA0505101/2018YFC0 910201/2018YFC0910202]
  2. National Natural Science Funds of China [81802916/82002949]
  3. Guangdong Key RD Program [2019B020226001]
  4. State Key Laboratory of Respiratory Disease, Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Disease [02-000-2101-5061]
  5. Fundamental Research Funds for the Central Universities

Ask authors/readers for more resources

In this study, a full-length RNC-seq strategy was demonstrated to construct a protein database containing all AS isoforms, and it showed higher efficiency and accuracy in identifying protein isoforms.
Alternative splicing (AS) isoforms create numerous proteoforms, expanding the complexity of the genome. Highly similar sequences, incomplete reference databases and the insufficient sequence coverage of mass spectrometry limit the identification of AS proteoforms. Here, we demonstrated full-length translating mRNAs (ribosome nascent-chain complex-bound mRNAs, RNC-mRNAs) sequencing (RNC-seq) strategy to sequence the entire translating mRNA using next-generation sequencing, including short-read and long-read technologies, to construct a protein database containing all translating AS isoforms. Taking the advantage of read length, short-read RNC-seq identified up to 15,289 genes and 15,906 AS isoforms in a single human cell line, much more than the Ribo-seq. The single-molecule long-read RNC-seq supplemented 4,429 annotated AS isoforms that were not identified by short-read datasets, and 4,525 novel AS isoforms that were not included in the public databases. Using such RNC-seq-guided database, we identified 6,766 annotated protein isoforms and 50 novel protein isoforms in mass spectrometry datasets. These results demonstrated the potential of full-length RNC-seq in investigating the proteome of AS isoforms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available