4.7 Article

PTML Model of Enzyme Subclasses for Mining the Proteome of Biofuel Producing Microorganisms

期刊

JOURNAL OF PROTEOME RESEARCH
卷 18, 期 7, 页码 2735-2746

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jproteome.8b00949

关键词

microorganism proteome; enzymes; biofuel production; Spathaspora sp.; perturbation theory; machine learning

资金

  1. Fundacao para a Ciencia e a Tecnologia (FCT/MEC)
  2. European Union (FEDER funds) [UID/QUI/50006/2013, POCI/01/0145/FEDER/007265, SOE1/P1/E0215]
  3. FCT [SFRH/BPD/80605/2011]
  4. European Social Fund [SFRH/BPD/80605/2011]
  5. Ministry of Economy and Competitiveness, MINECO, Spain [FEDER CTQ2016-74881-P]
  6. Basque government [IT1045-16]
  7. IKERBASQUE, Basque Foundation for Sciences
  8. Fundação para a Ciência e a Tecnologia [SFRH/BPD/80605/2011] Funding Source: FCT

向作者/读者索取更多资源

Predicting enzyme function and enzyme subclasses is always a key objective in fields such as biotechnology, biochemistry, medicinal chemistry, physiology, and so on. The Protein Data Bank (PDB) is the largest information archive of biological macromolecular structures, with more than 150 000 entries for proteins, nucleic acids, and complex assemblies. Among these entries, there are more than 4000 proteins whose functions remain unknown because no detectable homology to proteins whose functions are known has been found. The problem is that our ability to isolate proteins and identify their sequences far exceeds our ability to assign them a defined function. As a result, there is a growing interest in this topic, and several methods have been developed to identify protein function based on these innovative approaches. In this work, we have applied perturbation theory to an original data set consisting of 19 187 enzymes representing all 59 subclasses present in the protein data bank. In addition, we developed a series of artificial neural network models able to predict enzyme-enzyme pairs of query-template sequences with accuracy, specificity, and sensitivity greater than 90% in both training and validation series. As a likely application of this methodology and to further validate our approach, we used our novel model to predict a set of enzymes belonging to the yeast Pichia stipites. This yeast has been widely studied because it is commonly present in nature and produces a high ethanol yield by converting lignocellulosic biomass into bioethanol through the xylose reductase enzyme. Using this premise, we tested our model on 222 enzymes including xylose reductase, that is, the enzyme responsible for the conversion of biomass into bioethanol.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据