4.6 Article

Rascaf: Improving Genome Assembly with RNA Sequencing Data

Journal

PLANT GENOME
Volume 9, Issue 3, Pages -

Publisher

CROP SCIENCE SOC AMER
DOI: 10.3835/plantgenome2016.03.0027

Keywords

-

Funding

  1. National Science Foundation [IOS-1339134]
  2. Division Of Integrative Organismal Systems
  3. Direct For Biological Sciences [1339134] Funding Source: National Science Foundation

Ask authors/readers for more resources

Abundant but short second-generation sequencing reads make assembly difficult, leading to fragmented genomes and gene annotations. Gene structure information from RNA sequences can be used to improve the completeness and contiguity of an assembly, but bioinformatics methods have been lacking. Rascaf is a highly efficient tool leveraging long-range continuity information from intron spanning RNA sequencing (RNA-seq) read pairs to detect new contig connections. It determines a heaviest path in an exon block graph that simultaneously represents a gene and the underlying contig relationships. Rascaf is more accurate than its competitors, highly precise, and finds thousands of new verifiable connections in several draft Rosaceae genomes. Lightweight and practical, it can be readily incorporated into sequencing pipelines to improve an assembly and its gene annotations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available