4.7 Article

Measure representation and multifractal analysis of complete genomes

期刊

PHYSICAL REVIEW E
卷 64, 期 3, 页码 -

出版社

AMER PHYSICAL SOC
DOI: 10.1103/PhysRevE.64.031903

关键词

-

向作者/读者索取更多资源

This paper introduces the notion of measure representation of DNA sequences. Spectral analysis and multifractal analysis are then performed on the measure representations of a large number of complete genomes. The main aim of this paper is to discuss the multifractal property of the measure representation and the classification of bacteria. From the measure representations and the values of the D-q spectra and related C-q curves, it is concluded that these complete genomes are not random sequences. In fact, spectral analyses performed indicate that these measure representations, considered as time series, exhibit strong long-range correlation. Here the long-range correlation is for the K-strings with dictionary ordering, and it is different from the base pair correlations introduced by other people. For substrings with length K = 8, the D-q spectra of all organisms studied are multifractal-like and sufficiently smooth for the C-q curves to be meaningful. With the decreasing value of K, the multifractality lessens. The C-q curves of all bacteria resemble a classical phase transition at a critical point. But the analogous phase transitions of chromosomes of nonbacteria organisms are different. Apart from chromosome 1 of C. elegans, they exhibit the shape of double-peaked specific heat function. A classification of genomes of bacteria by assigning to each sequence a point in two-dimensional space (D-1, D-1) and in three-dimensional space (D-1,D-1 D-2) was given. Bacteria that are close phylogenetically are almost close in the spaces (D-1, D-1) and (D-1 D-1, D-2).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据