4.7 Article

Deep landscape update of dispersed and tandem repeats in the genome model of the red jungle fowl, Gallus gallus, using a series of de novo investigating tools

期刊

BMC GENOMICS
卷 17, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s12864-016-3015-5

关键词

Satellite DNA; Transposable elements; Bioinformatics; Benchmarking; Repeat

资金

  1. Region Centre Val de Loire (AviGeS Project)
  2. C.N.R.S.
  3. I.N.R.A.
  4. Ministere de l'Education Nationale, de la Recherche et de la Technologie
  5. I.N.R.A. (PHASE department)/Region Centre Val de Loire
  6. Universitary PRES Centre Val de Loire
  7. STUDIUM
  8. Groupement de Recherche CNRS 3546
  9. Groupement de Recherche CNRS 3604

向作者/读者索取更多资源

Background: The program RepeatMasker and the database Repbase-ISB are part of the most widely used strategy for annotating repeats in animal genomes. They have been used to show that avian genomes have a lower repeat content (8-12 %) than the sequenced genomes of many vertebrate species (30-55 %). However, the efficiency of such a library-based strategies is dependent on the quality and completeness of the sequences in the database that is used. An alternative to these library based methods are methods that identify repeats de novo. These alternative methods have existed for a least a decade and may be more powerful than the library based methods. We have used an annotation strategy involving several complementary de novo tools to determine the repeat content of the model genome galGal4 (1.04 Gbp), including identifying simple sequence repeats (SSRs), tandem repeats and transposable elements (TEs). Results: We annotated over one Gbp. of the galGal4 genome and showed that it is composed of approximately 19 % SSRs and TEs repeats. Furthermore, we estimate that the actual genome of the red jungle fowl contains about 31-35 % repeats. We find that library-based methods tend to overestimate TE diversity. These results have a major impact on the current understanding of repeats distributions throughout chromosomes in the red jungle fowl. Conclusions: Our results are a proof of concept of the reliability of using de novo tools to annotate repeats in large animal genomes. They have also revealed issues that will need to be resolved in order to develop gold-standard methodologies for annotating repeats in eukaryote genomes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据