4.7 Article

mebipred: identifying metal-binding potential in protein sequence

期刊

BIOINFORMATICS
卷 38, 期 14, 页码 3532-3540

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btac358

关键词

-

资金

  1. National Aeronautics and Space Administration (NASA) Astrobiology Institute [80NSSC18M0093]
  2. National Science Foundation (NSF) CAREER award [1553289]
  3. National Institutes of Health (NIH) [R01 GM115486]
  4. European Research Council (ERC) under the European Union [948972]
  5. European Research Council (ERC) [948972] Funding Source: European Research Council (ERC)

向作者/读者索取更多资源

Metal-binding proteins play a central role in maintaining life processes, but experimental annotation is lacking and computational techniques are often imprecise. In this study, we developed a machine learning-based method, mebipred, which accurately identifies metal-binding proteins and annotates metal ions. Mebipred can also identify metal-binding capabilities from short sequence stretches, making it useful for metagenomic samples. Analysis of microbiome data revealed different metal preferences and the impact of ion concentration changes on metal-binding protein abundance.
Motivation: metal-binding proteins have a central role in maintaining life processes. Nearly one-third of known protein structures contain metal ions that are used for a variety of needs, such as catalysis, DNA/RNA binding, protein structure stability, etc. Identifying metal-binding proteins is thus crucial for understanding the mechanisms of cellular activity. However, experimental annotation of protein metal-binding potential is severely lacking, while computational techniques are often imprecise and of limited applicability. Results: we developed a novel machine learning-based method, mebipred, for identifying metal-binding proteins from sequence-derived features. This method is over 80% accurate in recognizing proteins that bind metal ion-containing ligands; the specific identity of 11 ubiquitously present metal ions can also be annotated. mebipred is reference-free, i.e. no sequence alignments are involved, and is thus faster than alignment-based methods; it is also more accurate than other sequence-based prediction methods. Additionally, mebipred can identify protein metal-binding capabilities from short sequence stretches, e.g. translated sequencing reads, and, thus, may be useful for the annotation of metal requirements of metagenomic samples. We performed an analysis of available microbiome data and found that ocean, hot spring sediments and soil microbiomes use a more diverse set of metals than human host-related ones. For human microbiomes, physiological conditions explain the observed metal preferences. Similarly, subtle changes in ocean sample ion concentration affect the abundance of relevant metal-binding proteins. These results highlight mebipred's utility in analyzing microbiome metal requirements.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据