4.6 Article

Use of 6 Nucleotide Length Words to Study the Complexity of Gene Sequences from Different Organisms

期刊

ENTROPY
卷 24, 期 5, 页码 -

出版社

MDPI
DOI: 10.3390/e24050632

关键词

cds; genome; bacteria; plants; metazoa; Gini coefficient

资金

  1. Ministry of Science and Higher Education of the Russian Federation [075-15-2021-1071]

向作者/读者索取更多资源

This paper examines the irregularity of 6-base occurrences in bacterial genomes using a probabilistic mathematical method, and reveals that bacterial genomes can be divided into two groups, with the coding sequences in the second group being less susceptible to evolutionary changes than those in the first group.
In this paper, we attempted to find a relation between bacteria living conditions and their genome algorithmic complexity. We developed a probabilistic mathematical method for the evaluation of k-words (6 bases length) occurrence irregularity in bacterial gene coding sequences. For this, the coding sequences from different bacterial genomes were analyzed and as an index of k-words occurrence irregularity, we used W, which has a distribution similar to normal. The research results for bacterial genomes show that they can be divided into two uneven groups. First, the smaller one has W in the interval from 170 to 475, while for the second it is from 475 to 875. Plants, metazoan and virus genomes also have W in the same interval as the first bacterial group. We suggested that second bacterial group coding sequences are much less susceptible to evolutionary changes than the first group ones. It is also discussed to use the W index as a biological stress value.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据