4.7 Article

Identifying bacterial genes and endosymbiont DNA with Glimmer

Journal

BIOINFORMATICS
Volume 23, Issue 6, Pages 673-679

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btm009

Keywords

-

Funding

  1. NIGMS NIH HHS [R01 GM083873] Funding Source: Medline
  2. NLM NIH HHS [R01 LM007938-04, R01-LM006845, R01 LM006845, R01-LM007938, R01 LM007938, R01 LM006845-08] Funding Source: Medline
  3. PHS HHS [HHSN266200400038C] Funding Source: Medline

Ask authors/readers for more resources

Motivation: The Glimmer gene-finding software has been successfully used for finding genes in bacteria, arch ae a and viruses representing hundreds of species. We describe several major changes to the Glimmer system, including improved methods for identifying both coding regions and start codons. We also describe a new module of Glimmer that can distinguish host and endosymbiont DNA. This module was developed in response to the discovery that eukaryotic genome sequencing projects sometimes inadvertently capture the DNA of intracellular bacteria living in the host. Results: The new methods dramatically reduce the rate of false-positive predictions, while maintaining Glimmer's 99% sensitivity rate at detecting genes in most species, and they find substantially more correct start sites, as measured by comparisons to known and well-curated genes. We show that our interpolated Markov model (IMM) DNA discriminator correctly separated 99% of the sequences in a recent genome project that produced a mixture of sequences from the bacterium Prochloron didemni and its sea squirt host, Lissoclinum patella.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available