4.7 Article

Genome reannotation of the lizard Anolis carolinensis based on 14 adult and embryonic deep transcriptomes

Journal

BMC GENOMICS
Volume 14, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/1471-2164-14-49

Keywords

Annotation; Lizard; Anolis carolinensis; Transcriptome; Genome; RNA-Seq; Gene; Vertebrate; Embryo; Tissue-specific

Funding

  1. National Center for Research Resources [R21 RR031305]
  2. Office of Research Infrastructure Programs (ORIP) of the National Institutes of Health [R21 RR031305]
  3. Arizona Biomedical Research Commission [1113]
  4. Extreme Science and Engineering Discovery Environment (XSEDE)
  5. Arizona State University Advanced Computing Center (A2C2)
  6. NHGRI

Ask authors/readers for more resources

Background: The green anole lizard, Anolis carolinensis, is a key species for both laboratory and field-based studies of evolutionary genetics, development, neurobiology, physiology, behavior, and ecology. As the first non-avian reptilian genome sequenced, A. carolinesis is also a prime reptilian model for comparison with other vertebrate genomes. The public databases of Ensembl and NCBI have provided a first generation gene annotation of the anole genome that relies primarily on sequence conservation with related species. A second generation annotation based on tissue-specific transcriptomes would provide a valuable resource for molecular studies. Results: Here we provide an annotation of the A. carolinensis genome based on de novo assembly of deep transcriptomes of 14 adult and embryonic tissues. This revised annotation describes 59,373 transcripts, compared to 16,533 and 18,939 currently for Ensembl and NCBI, and 22,962 predicted protein-coding genes. A key improvement in this revised annotation is coverage of untranslated region (UTR) sequences, with 79% and 59% of transcripts containing 5' and 3' UTRs, respectively. Gaps in genome sequence from the current A. carolinensis build (Anocar2.0) are highlighted by our identification of 16,542 unmapped transcripts, representing 6,695 orthologues, with less than 70% genomic coverage. Conclusions: Incorporation of tissue-specific transcriptome sequence into the A. carolinensis genome annotation has markedly improved its utility for comparative and functional studies. Increased UTR coverage allows for more accurate predicted protein sequence and regulatory analysis. This revised annotation also provides an atlas of gene expression specific to adult and embryonic tissues.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available