4.8 Article

Phased diploid genome assembly with single-molecule real-time sequencing

Journal

NATURE METHODS
Volume 13, Issue 12, Pages 1050-+

Publisher

NATURE PORTFOLIO
DOI: 10.1038/NMETH.4035

Keywords

-

Funding

  1. National Institutes of Health award [R01-HG006677]
  2. National Science Foundation [DBI-1350041, IOS-1237880, MCB 0929402, MCB 1122246]
  3. J. Lohr Vineyards and Wines
  4. Direct For Biological Sciences
  5. Div Of Biological Infrastructure [1627442] Funding Source: National Science Foundation

Ask authors/readers for more resources

While genome assembly projects have been successful in many haploid and inbred species, the assembly of noninbred or rearranged heterozygous genomes remains a major challenge. To address this challenge, we introduce the open-source FALCON and FALCON-Unzip algorithms (https://github.com/PacificBiosciences/FALCON/) to assemble long-read sequencing data into highly accurate, contiguous, and correctly phased diploid genomes. We generate new reference sequences for heterozygous samples including an F1 hybrid of Arabidopsis thaliana, the widely cultivated Vitis vinifera cv. Cabernet Sauvignon, and the coral fungus Clavicorona pyxidata, samples that have challenged short-read assembly approaches. The FALCON-based assemblies are substantially more contiguous and complete than alternate short-or long-read approaches. The phased diploid assembly enabled the study of haplotype structure and heterozygosities between homologous chromosomes, including the identification of widespread heterozygous structural variation within coding sequences.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available