4.6 Article Proceedings Paper

Toward a more holistic method of genome assembly assessment

Journal

BMC BIOINFORMATICS
Volume 21, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s12859-020-3382-4

Keywords

Genome assembly; Contiguity; Completeness; Correctness; N50

Funding

  1. USDA Agricultural Research Service [58-6066-6-046, 58-6066-6-059]
  2. USDA National Institute of Food and Agriculture Multi-State Hatch project [17810]
  3. National Science Foundation [DEB-1354147]
  4. EPSCoR RII Track-2 FEC [1736026]
  5. Mississippi State University

Ask authors/readers for more resources

Background A key use of high throughput sequencing technology is the sequencing and assembly of full genome sequences. These genome assemblies are commonly assessed using statistics relating to contiguity of the assembly. Measures of contiguity are not strongly correlated with information about the biological completion or correctness of the assembly, and a commonly reported metric, N50, can be misleading. Over the years, multiple research groups have rejected the overuse of N50 and sought to develop more informative metrics. Results This paper presents a review of problems that arise from relying solely on contiguity as a measure of genome assembly quality as well as current alternative methods. Alternative methods are compared on the basis of how informative they are about the biological quality of the assembly and how easy they are to use. A comprehensive method for using multiple metrics of measuring assembly quality is presented. Conclusions This study aims to report on the status of assembly assessment methods and compare them, as well as to offer a comprehensive method that incorporates multiple facets of quality assessment. Weaknesses and strengths of varying methods are presented and explained, with recommendations based on speed of analysis and user friendliness.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available