4.7 Article

Genome-wide identification of Reverse Transcriptase domains of recently inserted endogenous plant pararetrovirus (Caulimoviridae)

期刊

FRONTIERS IN PLANT SCIENCE
卷 13, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fpls.2022.1011565

关键词

pararetrovirus; Reverse Transcriptase (RT); Caulimoviridae; endogenous; virus

向作者/读者索取更多资源

Endogenous pararetroviruses (EPRVs), derived from DNA viruses of the family Caulimoviridae, are viral sequences integrated into plant genomes. This study analyzed a large number of plant genome sequences and identified 57 clusters and 13 genera of EPRVs, including a newly proposed genus called Wendovirus. There are significant differences in the abundance and distribution of EPRVs between different plant families and genera, with florendoviruses being the most abundant and widespread.
Endogenous viral elements (EVEs) are viral sequences that have been integrated into the nuclear chromosomes. Endogenous pararetrovirus (EPRV) are a class of EVEs derived from DNA viruses of the family Caulimoviridae. Previous works based on a limited number of genome assemblies demonstrated that EPRVs are abundant in plants and are present in several species. The availability of genome sequences has been immensely increased in the recent years and we took advantage of these resources to have a more extensive view of the presence of EPRVs in plant genomes. We analyzed 278 genome assemblies corresponding to 267 species (254 from Viridiplantae) using tBLASTn against a collection of conserved domains of the Reverse Transcriptases (RT) of Caulimoviridae. We concentrated our search on complete and well-conserved RT domains with an uninterrupted ORF comprising the genetic information for at least 300 amino acids. We obtained 11.527 sequences from the genomes of 202 species spanning the whole Tracheophyta clade. These elements were grouped in 57 clusters and classified in 13 genera, including a newly proposed genus we called Wendovirus. Wendoviruses are characterized by the presence of four open reading frames and two of them encode for aspartic proteinases. Comparing plant genomes, we observed important differences between the plant families and genera in the number and type of EPRVs found. In general, florendoviruses are the most abundant and widely distributed EPRVs. The presence of multiple identical RT domain sequences in some of the genomes suggests their recent amplification.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据