4.0 Article

Broadly sampled orthologous groups of eukaryotic proteins for the phylogenetic study of plastid-bearing lineages

期刊

BMC RESEARCH NOTES
卷 14, 期 1, 页码 -

出版社

SPRINGERNATURE
DOI: 10.1186/s13104-021-05553-4

关键词

Orthology; Phylogenomics; Algae; CASH; Proteomes; Eukaryotic evolution; Contamination; Organelles; Endosymbiotic gene transfer (EGT); Horizontal or lateral gene transfer (HGT; LGT); Kleptoplasty

资金

  1. University of Liege [SFRD-12/04]
  2. FRS-FNRS [CDR J.0080.15]

向作者/读者索取更多资源

Identifying orthology relationships among sequences is crucial for understanding evolution, diversity of life, and ancestral relationships among organisms. This study assembled a dataset tailored for phylogenomic research on algae and photosynthetic eukaryotes, integrating organellar data and distinguishing between gene paralogs. Through OrthoFinder analysis, top-quality and taxonomically diverse proteomes were used to identify orthogroups, functionally annotated with eggNOG, and aligned for phylogenetic tree computation, with a focus on plastid-bearing organisms.
Objectives Identifying orthology relationships among sequences is essential to understand evolution, diversity of life and ancestry among organisms. To build alignments of orthologous sequences, phylogenomic pipelines often start with all-vs-all similarity searches, followed by a clustering step. For the protein clusters (orthogroups) to be as accurate as possible, proteomes of good quality are needed. Here, our objective is to assemble a data set especially suited for the phylogenomic study of algae and formerly photosynthetic eukaryotes, which implies the proper integration of organellar data, to enable distinguishing between several copies of one gene (paralogs), taking into account their cellular compartment, if necessary. Data description We submitted 73 top-quality and taxonomically diverse proteomes to OrthoFinder. We obtained 47,266 orthogroups and identified 11,775 orthogroups with at least two algae. Whenever possible, sequences were functionally annotated with eggNOG and tagged after their genomic and target compartment(s). Then we aligned and computed phylogenetic trees for the orthogroups with IQ-TREE. Finally, these trees were further processed by identifying and pruning the subtrees exclusively composed of plastid-bearing organisms to yield a set of 31,784 clans suitable for studying photosynthetic organism genome evolution.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据