4.6 Article

Family of skewed distributions associated with the gene expression and proteome evolution

期刊

SIGNAL PROCESSING
卷 83, 期 4, 页码 889-910

出版社

ELSEVIER
DOI: 10.1016/S0165-1684(02)00481-4

关键词

gene expression; protein domains; evolution; birth-death stochastic processes; Pareto distribution; Waring distribution

向作者/读者索取更多资源

We study statistical distributions appearing in various genome-related phenomena, including the distribution of the transcript copy number in the transcriptome of eukaryotic cells and the distribution of the number of proteins containing a protein domain in proteomes of species. We found that the empirical distributions for all studied data sets are well fitted by a family of Pareto-like distribution functions whose shape depends in a predictable manner on the sample size. Such distributions are generated as limiting distributions in a Markov random process where the birth and death intensities are linear functions of events. We also propose a novel model of progressive evolution of a population in terms of the increase of the numbers of distinct components and their links in the system and we study evolution of the probability distribution of these links. Estimating two unknown parameters of this model allows us to describe the progressive evolution of the number of distinct protein domain sets and the number of proteins containing a given protein domain in the proteomes of 70 fully sequenced genome organisms. This model also predicts trends in proteome complexity evolution. Published by Elsevier Science B.V.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据