4.8 Article

Accurate annotation of human protein-coding small open reading frames

期刊

NATURE CHEMICAL BIOLOGY
卷 16, 期 4, 页码 458-+

出版社

NATURE PUBLISHING GROUP
DOI: 10.1038/s41589-019-0425-0

关键词

-

资金

  1. NIGMS NIH HHS [R01 GM102491] Funding Source: Medline

向作者/读者索取更多资源

Functional protein-coding small open reading frames (smORFs) are emerging as an important class of genes. However, the number of translated smORFs in the human genome is unclear because proteogenomic methods are not sensitive enough, and, as we show, Ribo-seq strategies require additional measures to ensure comprehensive and accurate smORF annotation. Here, we integrate de novo transcriptome assembly and Ribo-seq into an improved workflow that overcomes obstacles with previous methods, to more confidently annotate thousands of smORFs. Evolutionary conservation analyses suggest that hundreds of smORF-encoded microproteins are likely functional. Additionally, many smORFs are regulated during fundamental biological processes, such as cell stress. Peptides derived from smORFs are also detectable on human leukocyte antigen complexes, revealing smORFs as a source of antigens. Thus, by including additional validation into our smORF annotation workflow, we accurately identify thousands of unannotated translated smORFs that will provide a rich pool of unexplored, functional human genes. An improved workflow combining de novo transcriptome assembly and Ribo-seq validated by cellular antigen display is developed to maximize small peptide discovery, leading to identification of thousands of unannotated protein-coding smORFs.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据