4.2 Review

Using comparative genome analysis to identify problems in annotated microbial genomes

Journal

MICROBIOLOGY-SGM
Volume 156, Issue -, Pages 1909-1917

Publisher

SOC GENERAL MICROBIOLOGY
DOI: 10.1099/mic.0.033811-0

Keywords

-

Categories

Funding

  1. NASA Exobiology [NNX08AQ10G]
  2. Applied Information System Research Program [NNG04GP90G]
  3. NASA [NNX08AQ10G, 95541] Funding Source: Federal RePORTER
  4. Direct For Biological Sciences
  5. Division Of Environmental Biology [0830024] Funding Source: National Science Foundation

Ask authors/readers for more resources

Genome annotation is a tedious task that is mostly done by automated methods; however, the accuracy of these approaches has been questioned since the beginning of the sequencing era. Genome annotation is a multilevel process, and errors can emerge at different stages: during sequencing, as a result of gene-calling procedures, and in the process of assigning gene functions. Missed or wrongly annotated genes differentially impact different types of analyses. Here we discuss and demonstrate how the methods of comparative genome analysis can refine annotations by locating missing orthologues. We also discuss possible reasons for errors and show that the second-generation annotation systems, which combine multiple gene-calling programs with similarity-based methods, perform much better than the first annotation tools. Since old errors may propagate to the newly sequenced genomes, we emphasize that the problem of continuously updating popular public databases is an urgent and unresolved one. Due to the progress in genome-sequencing technologies, automated annotation techniques will remain the main approach in the future. Researchers need to be aware of the existing errors in the annotation of even well-studied genomes, such as Escherichia coli, and consider additional quality control for their results.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available