☆ 4.7 Article

PTML Model of Enzyme Subclasses for Mining the Proteome of Biofuel Producing Microorganisms

JOURNAL OF PROTEOME RESEARCH (2019)

期刊

JOURNAL OF PROTEOME RESEARCH

卷 18, 期 7, 页码 2735-2746

出版社

AMER CHEMICAL SOC

DOI: 10.1021/acs.jproteome.8b00949

关键词

microorganism proteome; enzymes; biofuel production; Spathaspora sp.; perturbation theory; machine learning

类别

Biochemical Research Methods

资金

Fundacao para a Ciencia e a Tecnologia (FCT/MEC)
European Union (FEDER funds) [UID/QUI/50006/2013, POCI/01/0145/FEDER/007265, SOE1/P1/E0215]
FCT [SFRH/BPD/80605/2011]
European Social Fund [SFRH/BPD/80605/2011]
Ministry of Economy and Competitiveness, MINECO, Spain [FEDER CTQ2016-74881-P]
Basque government [IT1045-16]
IKERBASQUE, Basque Foundation for Sciences
Fundação para a Ciência e a Tecnologia [SFRH/BPD/80605/2011] Funding Source: FCT

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Predicting enzyme function and enzyme subclasses is always a key objective in fields such as biotechnology, biochemistry, medicinal chemistry, physiology, and so on. The Protein Data Bank (PDB) is the largest information archive of biological macromolecular structures, with more than 150 000 entries for proteins, nucleic acids, and complex assemblies. Among these entries, there are more than 4000 proteins whose functions remain unknown because no detectable homology to proteins whose functions are known has been found. The problem is that our ability to isolate proteins and identify their sequences far exceeds our ability to assign them a defined function. As a result, there is a growing interest in this topic, and several methods have been developed to identify protein function based on these innovative approaches. In this work, we have applied perturbation theory to an original data set consisting of 19 187 enzymes representing all 59 subclasses present in the protein data bank. In addition, we developed a series of artificial neural network models able to predict enzyme-enzyme pairs of query-template sequences with accuracy, specificity, and sensitivity greater than 90% in both training and validation series. As a likely application of this methodology and to further validate our approach, we used our novel model to predict a set of enzymes belonging to the yeast Pichia stipites. This yeast has been widely studied because it is commonly present in nature and produces a high ethanol yield by converting lignocellulosic biomass into bioethanol through the xylose reductase enzyme. Using this premise, we tested our model on 222 enzymes including xylose reductase, that is, the enzyme responsible for the conversion of biomass into bioethanol.

PTML Model of Enzyme Subclasses for Mining the Proteome of Biofuel Producing Microorganisms

期刊

JOURNAL OF PROTEOME RESEARCH

出版社

AMER CHEMICAL SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

PTML Model of Enzyme Subclasses for Mining the Proteome of Biofuel Producing Microorganisms

期刊

JOURNAL OF PROTEOME RESEARCH

出版社

AMER CHEMICAL SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文