4.5 Article

Absent from DNA and protein: genomic characterization of nullomers and nullpeptides across functional categories and evolution

期刊

GENOME BIOLOGY
卷 22, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s13059-021-02459-z

关键词

Nullomers; Nullpeptides; Primes; Negative selection; Transposable elements; Human population; Phylogenetics

资金

  1. National Human Genome Research Institute [1UM1HG009408, R01HG010333]
  2. National Institute of Mental Health [1R01MH109907, 1U01MH116438]
  3. National Heart Lung and Blood Institute [R35HL145235]
  4. Wellcome Trust

向作者/读者索取更多资源

This study identifies nullomers and nullpeptides in different organisms, showing that specific functional categories may be more likely to be selected against, and that nullpeptides can be used as markers to distinguish different human populations and for phylogenetic classification.
Nullomers and nullpeptides are short DNA or amino acid sequences that are absent from a genome or proteome, respectively. One potential cause for their absence could be their having a detrimental impact on an organism. Results Here, we identify all possible nullomers and nullpeptides in the genomes and proteomes of thirty eukaryotes and demonstrate that a significant proportion of these sequences are under negative selection. We also identify nullomers that are unique to specific functional categories: coding sequences, exons, introns, 5 ' UTR, 3 ' UTR, promoters, and show that coding sequence and promoter nullomers are most likely to be selected against. By analyzing all protein sequences across the tree of life, we further identify 36,081 peptides up to six amino acids in length that do not exist in any known organism, termed primes. We next characterize all possible single base pair mutations that can lead to the appearance of a nullomer in the human genome, observing a significantly higher number of mutations than expected by chance for specific nullomer sequences in transposable elements, likely due to their suppression. We also annotate nullomers that appear due to naturally occurring variants and show that a subset of them can be used to distinguish between different human populations. Analysis of nullomers and nullpeptides across vertebrate evolution shows they can also be used as phylogenetic classifiers. Conclusions We provide a catalog of nullomers and nullpeptides in distinct functional categories, develop methods to systematically study them, and highlight the use of variability in these sequences in other analyses

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据