4.1 Article

Construction of non-symmetric substitution matrices derived from proteomes with biased amino acid distributions

期刊

COMPTES RENDUS BIOLOGIES
卷 328, 期 5, 页码 445-453

出版社

centre Mersenne pour ldition scientifique ouverte
DOI: 10.1016/j.crvi.2005.02.002

关键词

substitution matrix; BLOSUM; biased genome; Plasmodium falciparum; information theory; mutual information

类别

向作者/读者索取更多资源

Automatic comparison of compositionally biased genomes, such as that of the malarial causative agent Plasmodium falciparum (82% adenosine + thymidine), with genomes of average composition, is currently limited. Indeed, popular tools such as BLAST require that amino acid distributions be similar in aligned sequences. However, the P. falciparum genome is so biased that six amino acids account for more than 50% of the protein composition. One reason for the comparison methods failure lies in the compositional difference between the query and the subject proteomes, which is not taken into account in the amino acid substitution matrices. This paper introduces a method to derive substitution matrices, in particular BLOSUM 62, in the frame of the information theory. It allows the construction of non-symmetrical matrices, taking into account the non-symmetric amino acid distributions. The dirAtPf family of matrices allowing the comparison of P. falciparum and A. thaliana is given as an example. This paper further provides an analysis of the obtained matrices in the frame of the information theory, supporting the discrimination advantage they bring. (c) 2005 Academie des sciences. Published by Elsevier SAS. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据