4.3 Article

Improvement of the Threespine Stickleback Genome Using a Hi-C-Based Proximity-Guided Assembly

Journal

JOURNAL OF HEREDITY
Volume 108, Issue 6, Pages 693-700

Publisher

OXFORD UNIV PRESS INC
DOI: 10.1093/jhered/esx058

Keywords

chromosome conformation capture; de novo genome assembly; Gasterosteus aculeatus

Funding

  1. Evolutionary, Ecological, or Conservation Genomics Research Award from the American Genetic Association
  2. Office of the Vice President of Research at the University of Georgia
  3. National Institutes of Health [R01 GM116853]
  4. Fred Hutchinson Cancer Research Center Division of Basic Sciences

Ask authors/readers for more resources

Scaffolding genomes into complete chromosome assemblies remains challenging even with the rapidly increasing sequence coverage generated by current next-generation sequence technologies. Even with scaffolding information, many genome assemblies remain incomplete. The genome of the threespine stickleback (Gasterosteus aculeatus), a fish model system in evolutionary genetics and genomics, is not completely assembled despite scaffolding with high-density linkage maps. Here, we first test the ability of a Hi-C based proximity-guided assembly (PGA) to perform a de novo genome assembly from relatively short contigs. Using Hi-C based PGA, we generated complete chromosome assemblies from a distribution of short contigs (20-100 kb). We found that 96.40% of contigs were correctly assigned to linkage groups (LGs), with ordering nearly identical to the previous genome assembly. Using available bacterial artificial chromosome (BAC) end sequences, we provide evidence that some of the few discrepancies between the Hi-C assembly and the existing assembly are due to structural variation between the populations used for the 2 assemblies or errors in the existing assembly. This Hi-C assembly also allowed us to improve the existing assembly, assigning over 60% (13.35 Mb) of the previously unassigned (similar to 21.7 Mb) contigs to LGs. Together, our results highlight the potential of the Hi-C based PGA method to be used in combination with short read data to perform relatively inexpensive de novo genome assemblies. This approach will be particularly useful in organisms in which it is difficult to perform linkage mapping or to obtain high molecular weight DNA required for other scaffolding methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available