4.2 Article

How many protein-coding genes are there in the Saccharomyces cerevisiae genome?

期刊

YEAST
卷 19, 期 7, 页码 619-629

出版社

WILEY
DOI: 10.1002/yea.865

关键词

Saccharomyces cerevisiae; gene number; hypothetical ORFs; questionable ORFs; coding probability; smORFs

向作者/读者索取更多资源

We have compared the results of estimations of the total number of protein-coding genes in the Saccharomyces cerevisiae genome, which have been obtained by many laboratories since the yeast genome sequence was published in 1996. We propose that there are 5300-5400 genes in the genome. This makes the first estimation of the number of intronless ORFs longer than 100 codons, based on the features of the set of genes with phenotypes known in 1997 to be correct. This estimation assumed that the set of the first 2300 genes with known phenotypes was representative for the whole set of protein-coding genes in the genome. The same method used in this paper for the approximation of the total number of protein-coding sequences among more than 40 000 ORFs longer than 20 codons gives a result that is only slightly higher. This suggests that there are still some non-coding ORFs in the databases and a few dozen small ORFs, not yet annotated, which probably code for proteins. Copyright (C) 2002 John Wiley Sons, Ltd.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据