4.7 Review

Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures

Related references

Note: Only part of the references are listed.
Review Microbiology

A Practical Guide to Small Protein Discovery and Characterization Using Mass Spectrometry

Christian H. Ahrens et al.

Summary: Small proteins are abundant biomolecules, but they are often missed in genome annotations and difficult to identify using standard experimental approaches. Mass spectrometry has great potential for small protein discovery and characterization, but current methods have limitations. This review discusses the challenges and adjustments needed for small protein analysis using mass spectrometry, as well as future directions for improving their detection and characterization.

JOURNAL OF BACTERIOLOGY (2022)

Review Biochemistry & Molecular Biology

Revisiting sORFs: overcoming challenges to identify and characterize functional microproteins

Dorte Schlesinger et al.

Summary: Short ORFs (sORFs), which contain a start and stop codon within 100 codons, can be found in organisms across all domains of life and often outnumber annotated protein-coding ORFs. Recent advancements in technology have led to the identification of thousands of potential coding sORFs, shedding light on the overlooked coding potential of these small proteins. The emerging field of microproteins in eukaryotes shows promise for uncovering new functional small proteins encoded in the genome.

FEBS JOURNAL (2022)

Article Biochemical Research Methods

DeepCPP: a deep neural network based on nucleotide bias information and minimum distribution similarity feature selection for RNA coding potential prediction

Yu Zhang et al.

Summary: The development of deep sequencing technologies has led to the discovery of novel transcripts. Many methods have been developed for assessing the coding potential of these transcripts, with DeepCPP being a deep learning method that outperforms other state-of-the-art methods, especially on sORF type data. The use of discontinuous k-mer, nucleotide bias, and minimal distribution similarity feature selection methods proved crucial in improving the accuracy of coding potential prediction for RNA.

BRIEFINGS IN BIOINFORMATICS (2021)

Review Spectroscopy

WIDENING THE BOTTLENECK OF PHOSPHOPROTEOMICS: EVOLVING STRATEGIES FOR PHOSPHOPEPTIDE ENRICHMENT

Teck Yew Low et al.

Summary: Phosphorylation is a crucial form of posttranslational modification for proteins, and phosphoproteomics is the study of phosphorylated proteome using mass spectrometry. This field faces challenges due to the abundant phosphorylated proteins in human proteome. The future of phosphoproteomics involves exploring the noncanonical phosphoproteome and unraveling the dark phosphoproteome.

MASS SPECTROMETRY REVIEWS (2021)

Article Biochemistry & Molecular Biology

A platform for curated products from novel open reading frames prompts reinterpretation of disease variants

Matthew D. C. Neville et al.

Summary: Recent studies have shown that there are significant numbers of uncharacterized open reading frames (ORFs) in eukaryotic genomes, particularly in humans, which are mostly distributed in diverse regions of the genome. It is important to evaluate the potential functional importance of these unannotated transcripts and proteins at a larger scale. The creation of a valuable nORFs data set with experimental evidence of translation, along with measures of heritability and selection, has implications for reinterpreting genetic variants previously classified as benign or of uncertain significance.

GENOME RESEARCH (2021)

Editorial Material Multidisciplinary Sciences

A wealth of discovery built on the Human Genome Project - by the numbers

Alexander J. Gates et al.

Summary: The new analysis examines the impact of the draft genome on genomics since 2001, highlighting its effects on publications, drug approvals, and understanding of diseases.

NATURE (2021)

Review Genetics & Heredity

Minireview: Novel Micropeptide Discovery by Proteomics and Deep Sequencing Methods

Ravi Tharakan et al.

Summary: A novel class of small proteins called micropeptides, shorter than 100 amino acids, have been found to play important roles in physiological and cellular systems. Ongoing research is focused on discovering and characterizing more micropeptides using -omics methods such as proteomics, RNA sequencing, and ribosome profiling. Future endeavors in the micropeptide field will face methodological and conceptual challenges.

FRONTIERS IN GENETICS (2021)

Review Biochemistry & Molecular Biology

Recent progress in mass spectrometry-based strategies for elucidating protein-protein interactions

Teck Yew Low et al.

Summary: This review discusses the underlying principles, advantages, limitations and experimental considerations of emerging MS-based proteomics techniques for characterising protein-protein interactions. In addition, it briefly accounts for how these techniques are used to investigate the structural and functional properties of protein complexes, including their topology, stoichiometry, copy number and dynamics.

CELLULAR AND MOLECULAR LIFE SCIENCES (2021)

Review Biotechnology & Applied Microbiology

An overview on miRNA-encoded peptides in plant biology research

Ankita Yadav et al.

Summary: This review discusses the importance of miPEPs as regulators of miRNAs, which enhance the activity of miRNAs by increasing their accumulation and subsequently downregulating target genes to improve plant growth and plant-microbe interaction. miPEPs are considered novel and effective tools for enhancing desired plant traits.

GENOMICS (2021)

Article Mathematical & Computational Biology

MetamORF: a repository of unique short open reading frames identified by both experimental and computational approaches for gene and metagene analyses

Sebastien A. Choteau et al.

Summary: High-throughput technologies have revealed the presence of non-canonical short open reading frames (sORFs) on most eukaryotic ribonucleic acids. MetamORF provides a repository of unique sORFs identified in the human and mouse genomes for future investigations. The database offers new analyses at locus, gene, transcript, and ORF levels and is accessible through a user-friendly web interface.

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION (2021)

Article Biochemistry & Molecular Biology

OpenProt 2021: deeper functional annotation of the coding potential of eukaryotic genomes

Marie A. Brunet et al.

Summary: OpenProt is the first proteogenomic resource that supports a polycistronic annotation model for eukaryotic genomes, providing deeper annotation of open reading frames (ORFs) with supporting evidence from experimental data. The platform re-analyzes ribosome profiling and mass spectrometry datasets to report non-AUG initiation starts and control the unicity of detected peptides. In addition, detectability statistics and protein relationships are now reported for each protein, and a data analysis platform is offered for users to submit their datasets for analysis and access the results.

NUCLEIC ACIDS RESEARCH (2021)

Article Oncology

The hidden world of membrane microproteins

Catherine A. Makarewich

EXPERIMENTAL CELL RESEARCH (2020)

Review Oncology

The hunt for sORFs: A multidisciplinary strategy

Marlies K. R. Peeters et al.

EXPERIMENTAL CELL RESEARCH (2020)

Review Biochemistry & Molecular Biology

When Long Noncoding Becomes Protein Coding

Corrine Corrina R. Hartford et al.

MOLECULAR AND CELLULAR BIOLOGY (2020)

Review Biochemistry & Molecular Biology

Emerging role of tumor-related functional peptides encoded by lncRNA and circRNA

Pan Wu et al.

MOLECULAR CANCER (2020)

Article Biochemistry & Molecular Biology

Accurate annotation of human protein-coding small open reading frames

Thomas F. Martinez et al.

NATURE CHEMICAL BIOLOGY (2020)

Article Multidisciplinary Sciences

Pervasive functional translation of noncanonical human open reading frames

Jin Chen et al.

SCIENCE (2020)

Letter Biotechnology & Applied Microbiology

PsORF: a database of small ORFs in plants

Yanjun Chen et al.

PLANT BIOTECHNOLOGY JOURNAL (2020)

Article Biochemical Research Methods

Comparative Proteomic Profiling of Unannotated Microproteins and Alternative Proteins in Human Cell Lines

Xiongwen Cao et al.

JOURNAL OF PROTEOME RESEARCH (2020)

Editorial Material Cell Biology

The functions of short ORFs and their microproteins

Eytan Zlotorynski

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2020)

Article Chemistry, Analytical

Proteomics Using Protease Alternatives to Trypsin Benefits from Sequential Digestion with Trypsin

Therese Dau et al.

ANALYTICAL CHEMISTRY (2020)

Article Biochemistry & Molecular Biology

Translation of small downstream ORFs enhances translation of canonical main open reading frames

Qiushuang Wu et al.

EMBO JOURNAL (2020)

Article Biochemical Research Methods

Using AnABlast for intergenic sORF prediction in the Caenorhabditis elegans genome

C. S. Casimiro-Soriguer et al.

BIOINFORMATICS (2020)

Review Oncology

The microproteome of cancer: From invisibility to relevance

Inaki Merino-Valverde et al.

EXPERIMENTAL CELL RESEARCH (2020)

Review Oncology

Evolution of new proteins from translated sORFs in long non-coding RNAs

Jorge Ruiz-Orera et al.

EXPERIMENTAL CELL RESEARCH (2020)

Article Biochemical Research Methods

smORFunction: a tool for predicting functions of small open reading frames and microproteins

Xiangwen Ji et al.

BMC BIOINFORMATICS (2020)

Article Biochemical Research Methods

Optimized Proteomics Workflow for the Detection of Small Proteins

Juergen Bartel et al.

JOURNAL OF PROTEOME RESEARCH (2020)

Review Oncology

Some like it translated: small ORFs in the 5′UTR

Peter F. Renz et al.

EXPERIMENTAL CELL RESEARCH (2020)

Review Biochemical Research Methods

The small peptide world in long noncoding RNAs

Seo-Won Choi et al.

BRIEFINGS IN BIOINFORMATICS (2019)

Article Genetics & Heredity

MOTS-c peptide regulates adipose homeostasis to prevent ovariectomy-induced metabolic dysfunction

Huanyu Lu et al.

JOURNAL OF MOLECULAR MEDICINE-JMM (2019)

Article Biochemistry & Molecular Biology

CPPred: coding potential prediction based on the global description of RNA sequence

Xiaoxue Tong et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

The Translational Landscape of the Human Heart

Sebastiaan van Heesch et al.

Article Biochemistry & Molecular Biology

A hidden human proteome encoded by 'non-coding' genes

Shaohua Lu et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemical Research Methods

MiPepid: MicroPeptide identification tool using machine learning

Mengmeng Zhu et al.

BMC BIOINFORMATICS (2019)

Article Multidisciplinary Sciences

Regulation of the ER stress response by a mitochondrial microprotein

Qian Chu et al.

NATURE COMMUNICATIONS (2019)

Article Biochemical Research Methods

Mining for Small Translated ORFs

Anastasia Chugunova et al.

JOURNAL OF PROTEOME RESEARCH (2018)

Article Biochemistry & Molecular Biology

An update on sORFs.org: a repository of small ORFs identified by ribosome profiling

Volodimir Olexiouk et al.

NUCLEIC ACIDS RESEARCH (2018)

Review Biochemistry & Molecular Biology

Approaches to identify and characterize microProteins and their potential uses in biotechnology

Kaushal Kumar Bhati et al.

CELLULAR AND MOLECULAR LIFE SCIENCES (2018)

Article Biochemical Research Methods

Comprehensive Peptide Analysis of Mouse Brain Striatum Identifies Novel sORF-Encoded Polypeptides

Harshavardhan Budamgunta et al.

PROTEOMICS (2018)

Article Multidisciplinary Sciences

Discovery of coding regions in the human genome by integrated proteogenomics analysis workflow

Yafeng Zhu et al.

NATURE COMMUNICATIONS (2018)

Review Pharmacology & Pharmacy

Peptides/Proteins Encoded by Non-coding RNA: A Novel Resource Bank for Drug Targets and Biomarkers

Song Zhu et al.

FRONTIERS IN PHARMACOLOGY (2018)

Article Biochemical Research Methods

SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci

Yajing Hao et al.

BRIEFINGS IN BIOINFORMATICS (2018)

Article Biochemistry & Molecular Biology

Ribonuclease selection for ribosome profiling

Maxim V. Gerashchenko et al.

NUCLEIC ACIDS RESEARCH (2017)

Review Cell Biology

Non-AUG translation: a new start for protein synthesis in eukaryotes

Michael G. Kearse et al.

GENES & DEVELOPMENT (2017)

Article Evolutionary Biology

Cross-Species Genome-Wide Identification of Evolutionary Conserved MicroProteins

Daniel Straub et al.

GENOME BIOLOGY AND EVOLUTION (2017)

Article Multidisciplinary Sciences

Regulation of DNA repair pathway choice in S and G2 phases by the NHEJ inhibitor CYREN

Nausica Arnoult et al.

NATURE (2017)

Review Cell Biology

Classification and function of small open reading frames

Juan-Pablo Couso et al.

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2017)

Article Biochemistry & Molecular Biology

CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features

Yu-Jian Kang et al.

NUCLEIC ACIDS RESEARCH (2017)

Article Biochemical Research Methods

Spectral Libraries for SWATH-MS Assays for Drosophila melanogaster and Solanum lycopersicum

Bertrand Fabre et al.

PROTEOMICS (2017)

Article Biochemistry & Molecular Biology

Identification of Microprotein-Protein Interactions via APEX Tagging

Qian Chu et al.

BIOCHEMISTRY (2017)

Article Biochemical Research Methods

ARA-PEPs: a repository of putative sORF-encoded peptides in Arabidopsis thaliana

Rashmi R. Hazarika et al.

BMC BIOINFORMATICS (2017)

Article Biochemical Research Methods

Detecting actively translated open reading frames in ribosome profiling data

Lorenzo Calviello et al.

NATURE METHODS (2016)

Article Chemistry, Analytical

Improved Identification and Analysis of Small Open Reading Frame Encoded Polypeptides

Jiao Ma et al.

ANALYTICAL CHEMISTRY (2016)

Article Biochemistry & Molecular Biology

Ribosome Footprint Profiling of Translation throughout the Genome

Nicholas T. Ingolia

Review Biochemistry & Molecular Biology

Reconciling proteomics with next generation sequencing

Teck Yew Low et al.

CURRENT OPINION IN CHEMICAL BIOLOGY (2016)

Article Biochemistry & Molecular Biology

Upstream ORFs are prevalent translational repressors in vertebrates

Timothy G. Johnstone et al.

EMBO JOURNAL (2016)

Review Biochemistry & Molecular Biology

MOTS-c: A novel mitochondrial-derived peptide regulating muscle and fat metabolism

Changhan Lee et al.

FREE RADICAL BIOLOGY AND MEDICINE (2016)

Article Biochemical Research Methods

Six alternative proteases for mass spectrometry-based proteomics beyond trypsin

Piero Giansanti et al.

NATURE PROTOCOLS (2016)

Article Biochemical Research Methods

The MaxQuant computational platform for mass spectrometry-based shotgun proteomics

Stefka Tyanova et al.

NATURE PROTOCOLS (2016)

Review Genetics & Heredity

Open questions in the study of de novo genes: what, how and why

Aoife McLysaght et al.

NATURE REVIEWS GENETICS (2016)

Review Biochemistry & Molecular Biology

Decoding sORF translation - from small proteins to gene regulation

Luis Enrique Cabrera-Quio et al.

RNA BIOLOGY (2016)

Review Plant Sciences

The Emerging World of Small ORFs

Roger P. Hellens et al.

TRENDS IN PLANT SCIENCE (2016)

Article Biochemistry & Molecular Biology

A Micropeptide Encoded by a Putative Long Noncoding RNA Regulates Muscle Performance

Douglas M. Anderson et al.

Review Biochemistry & Molecular Biology

Proteomics beyond trypsin

Liana Tsiatsiani et al.

FEBS JOURNAL (2015)

Review Cell Biology

Ribosome profiling reveals the what, when, where and how of protein synthesis

Gloria A. Brar et al.

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2015)

Review Biochemistry & Molecular Biology

Toxic proteins in plants

Liuyi Dang et al.

PHYTOCHEMISTRY (2015)

Article Biochemistry & Molecular Biology

PROTEOFORMER: deep proteome coverage through ribosome profiling and MS integration

Jeroen Crappe et al.

NUCLEIC ACIDS RESEARCH (2015)

Article Biotechnology & Applied Microbiology

Extensive identification and analysis of conserved small ORFs in animals

Sebastian D. Mackowiak et al.

GENOME BIOLOGY (2015)

Article Biochemical Research Methods

uPEPperoni: An online tool for upstream open reading frame location and analysis of transcript conservation

Adam Skarshewski et al.

BMC BIOINFORMATICS (2014)

Article Biochemistry & Molecular Biology

An Integrated Approach Reveals Regulatory Controls on Bacterial Translation Elongation

Arvind R. Subramaniam et al.

Article Biochemistry & Molecular Biology

Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation

Ariel A. Bazzini et al.

EMBO JOURNAL (2014)

Article Biochemistry & Molecular Biology

A Human Short Open Reading Frame ( sORF)-encoded Polypeptide That Stimulates DNA End Joining

Sarah A. Slavoff et al.

JOURNAL OF BIOLOGICAL CHEMISTRY (2014)

Article Biochemical Research Methods

HiRIEF LC-MSMS enables deep proteome coverage and unbiased proteogenomics

Rui M. M. Branca et al.

NATURE METHODS (2014)

Review Biochemical Research Methods

Proteogenomics: concepts, applications and computational strategies

Alexey I. Nesvizhskii

NATURE METHODS (2014)

Article Biochemical Research Methods

Detecting small plant peptides using SPADA (Small Peptide Alignment Discovery Application)

Peng Zhou et al.

BMC BIOINFORMATICS (2013)

Review Biochemistry & Molecular Biology

lincRNAs: Genomics, Evolution, and Mechanisms

Igor Ulitsky et al.

Article Biochemistry & Molecular Biology

Ribosome Profiling Provides Evidence that Large Noncoding RNAs Do Not Encode Proteins

Mitchell Guttman et al.

Article Biochemistry & Molecular Biology

Peptidomic discovery of short open reading frame-encoded peptides in human cells

Sarah A. Slavoff et al.

NATURE CHEMICAL BIOLOGY (2013)

Article Multidisciplinary Sciences

Small open reading frames associated with morphogenesis are hidden in plant genomes

Kousuke Hanada et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2013)

Article Biochemistry & Molecular Biology

Expressed Pseudogenes in the Transcriptional Landscape of Human Cancers

Shanker Kalyana-Sundaram et al.

Article Biochemical Research Methods

Estimation of Absolute Protein Quantities of Unlabeled Samples by Selected Reaction Monitoring Mass Spectrometry

Christina Ludwig et al.

MOLECULAR & CELLULAR PROTEOMICS (2012)

Review Oncology

Functional phosphoproteomic mass spectrometry-based approaches

Elena Lopez et al.

CLINICAL AND TRANSLATIONAL MEDICINE (2012)

Article Biochemical Research Methods

PhyloCSF: a comparative genomics method to distinguish protein coding and non-coding regions

Michael F. Lin et al.

BIOINFORMATICS (2011)

Review Biochemistry & Molecular Biology

Small Open Reading Frames: Current Prediction Techniques and Future Prospect

Haoyu Cheng et al.

CURRENT PROTEIN & PEPTIDE SCIENCE (2011)

Article Biotechnology & Applied Microbiology

Hundreds of putatively functional small open reading frames in Drosophila

Emmanuel Ladoukakis et al.

GENOME BIOLOGY (2011)

Article Biochemical Research Methods

sORF finder: a program package to identify small open reading frames with high coding potential

Kousuke Hanada et al.

BIOINFORMATICS (2010)

Article Biochemical Research Methods

Value of Using Multiple Proteases for Large-Scale Mass Spectrometry-Based Proteomics

Danielle L. Swaney et al.

JOURNAL OF PROTEOME RESEARCH (2010)

Review Biochemical Research Methods

A guided tour of the Trans-Proteomic Pipeline

Eric W. Deutsch et al.

PROTEOMICS (2010)

Article Multidisciplinary Sciences

Cell-type-specific isolation of ribosome-associated mRNA from complex tissues

Elisenda Sanz et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2009)

Article Multidisciplinary Sciences

Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling

Nicholas T. Ingolia et al.

SCIENCE (2009)

Article Biochemistry & Molecular Biology

A Translational Profiling Approach for the Molecular Characterization of CNS Cell Types

Myriam Heiman et al.

Review Biochemical Research Methods

Differentiating Protein-Coding and Noncoding RNA: Challenges and Ambiguities

Marcel E. Dinger et al.

PLOS COMPUTATIONAL BIOLOGY (2008)

Article Biochemistry & Molecular Biology

CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine

Lei Kong et al.

NUCLEIC ACIDS RESEARCH (2007)

Article Biochemistry & Molecular Biology

Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes

A Siepel et al.

GENOME RESEARCH (2005)

Article Biochemistry & Molecular Biology

Cytoprotective peptide humanin binds and inhibits proapoptotic Bcl-2/Bax family protein BimEL

F Luciano et al.

JOURNAL OF BIOLOGICAL CHEMISTRY (2005)

Article Multidisciplinary Sciences

Reinitiation involving upstream ORFs regulates ATF4 mRNA translation in mammalian cells

KM Vattem et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2004)

Article Biochemistry & Molecular Biology

The two upstream open reading frames of oncogene mdm2 have different translational regulatory properties

XP Jin et al.

JOURNAL OF BIOLOGICAL CHEMISTRY (2003)

Review Multidisciplinary Sciences

Initial sequencing and analysis of the human genome

ES Lander et al.

NATURE (2001)

Article Biochemistry & Molecular Biology

Regulated translation initiation controls stress-induced gene expression in mammalian cells

HP Harding et al.

MOLECULAR CELL (2000)