4.7 Review

A simple guide to de novo transcriptome assembly and annotation

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 23, Issue 2, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbab563

Keywords

de novo; transcriptome; assembly; annotation; tools; RNA-seq

Funding

  1. SPP DECRyPT 2125 funding program

Ask authors/readers for more resources

A transcriptome constructed from short-read RNA sequencing is a proxy catalog of protein-coding genes. De novo assembly and annotation of the transcriptome can be challenging due to the lack of standardized workflows and authoritative literature.
A transcriptome constructed from short-read RNA sequencing (RNA-seq) is an easily attainable proxy catalog of protein-coding genes when genome assembly is unnecessary, expensive or difficult. In the absence of a sequenced genome to guide the reconstruction process, the transcriptome must be assembled de novo using only the information available in the RNA-seq reads. Subsequently, the sequences must be annotated in order to identify sequence-intrinsic and evolutionary features in them (for example, protein-coding regions). Although straightforward at first glance, de novo transcriptome assembly and annotation can quickly prove to be challenging undertakings. In addition to familiarizing themselves with the conceptual and technical intricacies of the tasks at hand and the numerous pre- and post-processing steps involved, those interested must also grapple with an overwhelmingly large choice of tools. The lack of standardized workflows, fast pace of development of new tools and techniques and paucity of authoritative literature have served to exacerbate the difficulty of the task even further. Here, we present a comprehensive overview of de novo transcriptome assembly and annotation. We discuss the procedures involved, including pre- and post-processing steps, and present a compendium of corresponding tools.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available