4.6 Article

Predicting coding potential from genome sequence: Application to betaherpesviruses infecting rats and mice

Journal

JOURNAL OF VIROLOGY
Volume 79, Issue 12, Pages 7570-7596

Publisher

AMER SOC MICROBIOLOGY
DOI: 10.1128/JVI.79.12.7570-7596.2005

Keywords

-

Categories

Funding

  1. NIAID NIH HHS [R01 AI030363, R01AI30363] Funding Source: Medline
  2. NIGMS NIH HHS [2 R01 GM010452, R01 GM010452] Funding Source: Medline
  3. PHS HHS [R01A120211] Funding Source: Medline

Ask authors/readers for more resources

Prediction of protein-coding regions and other features of primary DNA sequence have greatly contributed to experimental biology. Significant challenges remain in genome annotation methods, including the identification of small or overlapping genes and the assessment of mRNA splicing or unconventional translation signals in expression. We have employed a combined analysis of compositional biases and conservation together with frame-specific G + C representation to reevaluate and annotate the genome sequences of mouse and rat cytomegaloviruses. Our analysis predicts that there are at least 34 protein-coding regions in these genomes that were not apparent in earlier annotation efforts. These include 17 single-exon genes, three new exons of previously identified genes, a newly identified four-exon gene for a lectin-like protein (in rat cytomegalovirus), and 10 probable frameshift extensions of previously annotated genes. This expanded set of candidate genes provides an additional basis for investigation in cytornegallovirus biology and pathogenesis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available