4.8 Article

A comprehensive vertebrate phylogeny using vector representations of protein sequences from whole genomes

期刊

MOLECULAR BIOLOGY AND EVOLUTION
卷 19, 期 4, 页码 554-562

出版社

OXFORD UNIV PRESS
DOI: 10.1093/oxfordjournals.molbev.a004111

关键词

genomics; mitochondrial DNA; molecular phylogenetics; molecular systematics; sequence analysis; singular value decomposition

向作者/读者索取更多资源

We recently developed a method for producing comprehensive gene and species phylogenies from unaligned whole,genome data using singular value decomposition (SVD) to analyze character string frequencies. This work provides an integrated gene and species phylogeny for 64 vertebrate mitochondrial genomes composed of 832 total proteins. In addition, to provide a theoretical basis for the method, we present a graphical interpretation of both the original frequency matrix and the SVD-derived matrix. These large matrices describe high-dimensional Euclidean spaces within which biomolecular sequences can be uniquely represented as vectors. In particular, the SVD-derived vector space describes each protein relative to a restricted set of newly defined, independent axes, each of which represents a novel form of conserved motif, termed a correlated peptide motif. A quantitative comparison of the relative orientations of protein vectors in this space provides accurate and straightforward estimates of sequence similarity, which can in turn be used to produce comprehensive gene trees. Alternatively. the vector representations of genes from individual species can be summed, allowing species trees to be produced.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据