4.8 Article

Genome annotation with long RNA reads reveals new patterns of gene expression and improves single-cell analyses in an ant brain

Journal

BMC BIOLOGY
Volume 19, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s12915-021-01188-w

Keywords

Iso-Seq; Long-read RNA-seq; Harpegnathos saltator; Ants; Genome annotation; 3 ' UTR annotation; Single-cell sequencing; Alternative splicing

Categories

Funding

  1. NIH [R21MH123841, R01AG071818, R01AG058762]
  2. Max Planckvon Humboldt Research Award
  3. Naito Foundation Grant for Studying Overseas
  4. Human Frontier Science Program [LT000010/2020-L]

Ask authors/readers for more resources

The study utilized Iso-Seq to comprehensively annotate the transcriptome of the ant Harpegnathos saltator, resulting in additional splice isoforms and extended 3' untranslated regions for more than 4000 genes. This improved annotation allowed for more accurate identification of cell type markers in the ant brain and discovery of genes differentially expressed across castes in specific cell types. The findings highlight the efficiency and effectiveness of Iso-Seq in enhancing genome annotations and maximizing information obtained from genomic datasets in various organisms.
Background: Functional genomic analyses rely on high-quality genome assemblies and annotations. Highly contiguous genome assemblies have become available for a variety of species, but accurate and complete annotation of gene models, inclusive of alternative splice isoforms and transcription start and termination sites, remains difficult with traditional approaches. Results: Here, we utilized full-length isoform sequencing (Iso-Seq), a long-read RNA sequencing technology, to obtain a comprehensive annotation of the transcriptome of the ant Harpegnathos saltator. The improved genome annotations include additional splice isoforms and extended 3' untranslated regions for more than 4000 genes. Reanalysis of RNA-seq experiments using these annotations revealed several genes with caste-specific differential expression and tissue- or caste-specific splicing patterns that were missed in previous analyses. The extended 3' untranslated regions afforded great improvements in the analysis of existing single-cell RNA-seq data, resulting in the recovery of the transcriptomes of 18% more cells. The deeper single-cell transcriptomes obtained with these new annotations allowed us to identify additional markers for several cell types in the ant brain, as well as genes differentially expressed across castes in specific cell types. Conclusions: Our results demonstrate that Iso-Seq is an efficient and effective approach to improve genome annotations and maximize the amount of information that can be obtained from existing and future genomic datasets in Harpegnathos and other organisms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available