4.8 Article

Automated correction of genome sequence errors

Journal

NUCLEIC ACIDS RESEARCH
Volume 32, Issue 2, Pages 562-569

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkh216

Keywords

-

Funding

  1. NIAID NIH HHS [N01AI15447] Funding Source: Medline
  2. NLM NIH HHS [R01 LM006845, R01-LM06845] Funding Source: Medline
  3. NATIONAL LIBRARY OF MEDICINE [R01LM006845] Funding Source: NIH RePORTER

Ask authors/readers for more resources

By using information from an assembly of a genome, a new program called AutoEditor significantly improves base calling accuracy over that achieved by previous algorithms. This in turn improves the overall accuracy of genome sequences and facilitates the use of these sequences for polymorphism discovery. We describe the algorithm and its application in a large set of recent genome sequencing projects. The number of erroneous base calls in these projects was reduced by 80%. In an analysis of over one million corrections, we found that AutoEditor made just one error per 8828 corrections. By substantially increasing the accuracy of base calling, AutoEditor can dramatically accelerate the process of finishing genomes, which involves closing all gaps and ensuring minimum quality standards for the final sequence. It also greatly improves our ability to discover single nucleotide polymorphisms (SNPs) between closely related strains and isolates of the same species.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available