4.5 Review

Understanding small ORF diversity through a comprehensive transcription feature classification

Journal

DNA RESEARCH
Volume 28, Issue 5, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/dnares/dsab007

Keywords

genome annotation; smORF peptides; long non-coding RNA; dual functional RNA; alternative ORFs

Funding

  1. Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior -CAPES
  2. Fundacao de Amparo a Pesquisa do Estado do Rio de Janeiro -FAPERJ [E-26/202.736/2019, E-26/211.169/2019, E-26/202.605/2019, E-210.264/2018]
  3. Conselho Nacional de Desenvolvimento Cientifico e Tecnologico CNPq

Ask authors/readers for more resources

This review proposes a classification of smORFs based on transcriptional features and discusses promising approaches to investigate them. SmORFs are potentially important coding sequences that have been historically overlooked, but are now gaining attention in systems biology.
Small open reading frames (small ORFs/sORFs/smORFs) are potentially coding sequences smaller than 100 codons that have historically been considered junk DNA by gene prediction software and in annotation screening; however, the advent of next-generation sequencing has contributed to the deeper investigation of junk DNA regions and their transcription products, resulting in the emergence of smORFs as a new focus of interest in systems biology. Several smORF peptides were recently reported in non-canonical mRNAs as new players in numerous biological contexts; however, their relevance is still overlooked in coding potential analysis. Hence, this review proposes a smORF classification based on transcriptional features, discussing the most promising approaches to investigate smORFs based on their different characteristics. First, smORFs were divided into non-expressed (intergenic) and expressed (genic) smORFs. Second, genic smORFs were classified as smORFs located in non-coding RNAs (ncRNAs) or canonical mRNAs. Finally, smORFs in ncRNAs were further subdivided into sequences located in small or long RNAs, whereas smORFs located in canonical mRNAs were subdivided into several specific classes depending on their localization along the gene. We hope that this review provides new insights into large-scale annotations and reinforces the role of smORFs as essential components of a hidden coding DNA world.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available