4.1 Review

Data-driven enzyme engineering to identify function-enhancing enzymes

Related references

Note: Only part of the references are listed.
Article Chemistry, Medicinal

EnzyHTP: A High-Throughput Computational Platform for Enzyme Modeling

Qianzhen Shao et al.

Summary: Molecular simulations have been widely used in understanding enzyme catalysis and designing new enzymes. However, manual operation is usually required throughout the simulation process, making it challenging to simulate enzymes in a high-throughput manner. In this work, a Python software called EnzyHTP was developed to automate the entire workflow of enzyme modeling, improving the efficiency of computational enzyme modeling.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Biotechnology & Applied Microbiology

Learning protein fitness models from evolutionary and assay-labeled data

Chloe Hsu et al.

Summary: This study proposes a simple machine learning algorithm that combines evolutionary and experimental data for improved protein fitness prediction. They find that using ridge regression on site-specific amino acid features combined with a probability density feature from modeling the evolutionary data performs well on this task.

NATURE BIOTECHNOLOGY (2022)

Article Multidisciplinary Sciences

Therapeutic enzyme engineering using a generative neural network

Andrew Giessel et al.

Summary: Enhancing the potency of mRNA therapeutics is crucial for treating rare diseases, and enzyme engineering can play a significant role in achieving this by improving the expression, half-life, and catalytic efficiency of the mRNA-encoded enzymes. In this study, a novel engineering method combining deep latent variable modeling, automated protein library design, and construction was used to rapidly identify more thermally stable and catalytically active metabolic enzyme variants.

SCIENTIFIC REPORTS (2022)

Article Biochemical Research Methods

Machine learning modeling of family wide enzyme-substrate specificity screens

Samuel Goldman et al.

Summary: Biocatalysis is a promising method for sustainable synthesis of pharmaceuticals, natural products, and chemicals. However, the selection of enzymes for catalyzing non-natural substrates is currently limited. This study curates multiple enzyme family screens and compares machine learning models for predicting compound-protein interactions. The results suggest that current models are incapable of learning interactions between compounds and proteins. A new structure-based strategy is proposed to improve predictive modeling.

PLOS COMPUTATIONAL BIOLOGY (2022)

Article Chemistry, Physical

Deep learning-based kcat prediction enables improved enzyme-constrained model reconstruction

Feiran Li et al.

Summary: The turnover numbers (k(cat)) of enzymes are crucial for understanding cellular metabolism and physiological diversity. In this study, a deep learning approach (DLKcat) was developed to predict k(cat) values for metabolic enzymes using only substrate structures and protein sequences. The predicted k(cat) values showed good performance in a genome-scale analysis of more than 300 yeast species, and were able to explain phenotypic differences.

NATURE CATALYSIS (2022)

Article Biochemistry & Molecular Biology

ProThermDB: thermodynamic database for proteins and mutants revisited after 15 years

Rahul Nikam et al.

Summary: ProThermDB is an updated version of a thermodynamic database for proteins and mutants, containing a vast amount of data on protein stability and providing a graphical interface for mutation visualization. It also includes a large amount of thermodynamic data from different organisms and cell lines, obtained using recent high throughput techniques.

NUCLEIC ACIDS RESEARCH (2021)

Article Multidisciplinary Sciences

Anomalous collapses of Nares Strait ice arches leads to enhanced export of Arctic sea ice

G. W. K. Moore et al.

Summary: Ice arches at the northern and southern ends of Nares Strait, a key passage in the Arctic, are forming for shorter durations, leading to increased ice transport and accelerating the export of multi-year ice.

NATURE COMMUNICATIONS (2021)

Article Chemistry, Physical

Machine-Learning-Guided Library Design Cycle for Directed Evolution of Enzymes: The Effects of Training Data Composition on Sequence Space Exploration

Yutaka Saito et al.

Summary: The study shows that machine learning is a useful tool in designing proteins with desired functions in protein engineering. Depending on the presence or absence of highly positive variants in the training data, machine learning-guided directed evolution can lead to improved variants in different regions of sequence space.

ACS CATALYSIS (2021)

Article Multidisciplinary Sciences

Ancestral lysosomal enzymes with increased activity harbor therapeutic potential for treatment of Hunter syndrome

Natalie M. Hendrikse et al.

Summary: Ancestral sequence reconstruction can enhance the activity of iduronate-2-sulfatase, potentially improving treatment outcomes for Hunter syndrome. Ancestral variants showed up to 2-fold higher activity than human IDS in vitro and could offer a more effective therapeutic effect, reducing treatment burden.

ISCIENCE (2021)

Article Biochemistry & Molecular Biology

Deep learning allows genome-scale prediction of Michaelis constants from structural features

Alexander Kroll et al.

Summary: The study developed a model using machine and deep learning methods to predict KM values for natural enzyme-substrate combinations, providing genome-scale KM predictions that can help relate metabolite concentrations to cellular physiology.

PLOS BIOLOGY (2021)

Article Chemistry, Multidisciplinary

Rational Enzyme Design without Structural Knowledge: A Sequence-Based Approach for Efficient Generation of Transglycosylases

David Teze et al.

Summary: This method describes a straightforward strategy involving rapid in silico analysis of protein sequences to identify single-mutant candidates for improving transglycosylation yields. Requiring minimal prior knowledge of the target enzyme, the method is generic and can validate mutations in one enzyme for transposition to others, even distantly related enzymes.

CHEMISTRY-A EUROPEAN JOURNAL (2021)

Article Multidisciplinary Sciences

Near-complete depolymerization of polyesters with nano-dispersed enzymes

Christopher Delre et al.

Summary: By dispersing enzymes with deep active sites, semi-crystalline polyesters can be efficiently degraded in a short period of time, with up to 98% conversion of polyester to small molecules. This method allows for complete elimination of the need to separate and landfill products in compost facilities, with degradation achieved in standard soil composts and household tap water.

NATURE (2021)

Article Biochemical Research Methods

Low-N protein engineering with data-efficient deep learning

Surojit Biswas et al.

Summary: The approach introduced in this study utilizes machine learning to build accurate virtual fitness landscapes and screen millions of sequences via in silico directed evolution using minimal functionally assayed mutant sequences. This method not only helps in quickly identifying enhanced protein variants, but also efficiently utilizes resources for high-throughput screening.

NATURE METHODS (2021)

Article Biochemistry & Molecular Biology

Advances in machine learning for directed evolution

Bruce J. Wittmann et al.

Summary: Machine learning can accelerate directed evolution by reducing expensive experimental screens, but collecting data for training ML models remains costly, while raw protein sequence data is readily available. Recent ML advances utilize protein sequences to enhance limited sequence-function data, aiming to efficiently explore vast protein space.

CURRENT OPINION IN STRUCTURAL BIOLOGY (2021)

Article Chemistry, Physical

Machine-Learning-Assisted Free Energy Simulation of Solution-Phase and Enzyme Reactions

Xiaoliang Pan et al.

Summary: This study presents a protocol for machine-learning-assisted free energy simulation of solution-phase and enzyme reactions at the ab initio quantum-mechanical/molecular-mechanical (ai-QM/MM) level. The developed MLP and Delta MLP exhibit promising accuracy in reproducing energy and forces, offering a cost-effective alternative for studying enzymatic reactions.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2021)

Article Chemistry, Physical

Rate-Perturbing Single Amino Acid Mutation for Hydrolases: A Statistical Profiling

Bailu Yan et al.

Summary: Statistical profiling was conducted to identify mutations that enhance catalytic efficiency in hydrolases, revealing that mutations to bulky nonpolar residues are more likely to accelerate reaction rates. The analyses of structure-kinetics relationship showed that the propensity for rate enhancement in hydrolases is independent of protein sizes, and distal mutations have greater potential for inducing efficiency neutrality and avoiding efficiency deletion while still showing similar propensity for rate enhancement.

JOURNAL OF PHYSICAL CHEMISTRY B (2021)

Article Multidisciplinary Sciences

Highly accurate protein structure prediction with AlphaFold

John Jumper et al.

Summary: Proteins are essential for life, and accurate prediction of their structures is a crucial research problem. Current experimental methods are time-consuming, highlighting the need for accurate computational approaches to address the gap in structural coverage. Despite recent progress, existing methods fall short of atomic accuracy in protein structure prediction.

NATURE (2021)

Article Multidisciplinary Sciences

Revealing enzyme functional architecture via high-throughput microfluidic enzyme kinetics

C. J. Markin et al.

Summary: A high-throughput microfluidic enzyme kinetics platform, HT-MEK, was studied for simultaneous expression, purification, and characterization of over 1500 enzyme variants. Through more than 670,000 reactions on 1036 mutants of alkaline phosphatase PafA, over 5000 kinetic and physical constants were determined, uncovering underlying enzyme architecture.

SCIENCE (2021)

Article Chemistry, Physical

Spectroscopically Guided Simulations Reveal Distinct Strategies for Positioning Substrates to Achieve Selectivity in Nonheme Fe(II)/α-Ketoglutarate-Dependent Halogenases

Rimsha Mehmood et al.

Summary: Researchers investigate substrate/active-site dynamics of nonheme iron halogenases like BesD and WelO5 using experimental and computational methods, revealing the importance of active-site configurational isomerization for selective halogenation in WelO5. They also find distinct patterns of substrate-protein interactions for these enzymes, and discuss how optimal substrate/active-site geometry plays a crucial role in facilitating chemoselectivity in halogenases. Their work demonstrates different substrate-dependent strategies used to promote selectivity in halogenases.

ACS CATALYSIS (2021)

Article Biochemistry & Molecular Biology

Informed training set design enables efficient machine learning-assisted directed protein evolution

Bruce J. Wittmann et al.

Summary: The study investigates and optimizes a path-independent machine learning-assisted directed evolution protocol, finding that reducing inclusion of minimally informative protein variants in training data is crucial for improving the outcome of the evolution process.

CELL SYSTEMS (2021)

Review Chemistry, Multidisciplinary

Recent trends in biocatalysis

Dong Yi et al.

Summary: Biocatalysis has made revolutionary progress in the past century, benefiting from the integration of multidisciplinary technologies and the development of robust biocatalysts through protein engineering. The network of natural enzymatic synthesis pathways and artificially designed enzymatic cascades have been gradually constructed. Future development will move towards the integration of new technologies, intelligent manufacturing, and enzymatic total synthesis.

CHEMICAL SOCIETY REVIEWS (2021)

Review Biochemistry & Molecular Biology

Revolutionizing enzyme engineering through artificial intelligence and machine learning

Nitu Singh et al.

Summary: The combinatorial space of enzyme sequences is vast and exploring it with traditional experimental techniques is challenging; Artificial Intelligence and Machine Learning offer potential for revolutionizing enzyme engineering, overcoming limitations of traditional methods.

EMERGING TOPICS IN LIFE SCIENCES (2021)

Article Multidisciplinary Sciences

Multistable inflatable origami structures at the metre scale

Christopher Delre et al.

Summary: Using inspiration from origami, rigid-walled deployable structures that are multistable and inflatable have been designed. A library of bistable origami shapes created and then combined to build meter-scale functional structures.

NATURE (2021)

Article Computer Science, Artificial Intelligence

Expanding functional protein sequence spaces using generative adversarial networks

Donatas Repecka et al.

Summary: De novo protein design for catalysis of any desired chemical reaction has long been a goal in protein engineering. ProteinGAN, a self-attention-based variant of generative adversarial networks, has been developed to learn natural protein sequence diversity and generate functional protein sequences. This AI approach shows potential in rapidly generating diverse functional proteins within biological constraints.

NATURE MACHINE INTELLIGENCE (2021)

Article Biochemistry & Molecular Biology

Machine learning-based prediction of enzyme substrate scope: Application to bacterial nitrilases

Zhongyu Mou et al.

Summary: Predicting the substrate scope of enzymes is challenging, but utilizing machine learning models and experimental data can lead to accurate predictions for related enzymes.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2021)

Article Chemistry, Multidisciplinary

Directed Evolution of a Halide Methyltransferase Enables Biocatalytic Synthesis of Diverse SAM Analogs

M. Eng. Qingyun Tang et al.

Summary: Biocatalytic alkylations play a crucial role in obtaining chemo-, regio- and stereoselectively alkylated compounds. Recent research has shown that a halide methyltransferase from Chloracidobacterium thermophilum can synthesize SAM, leading to the development of a method for directed evolution of an HMT from Arabidopsis thaliana and identification of a variant that can produce various SAM analogs.

ANGEWANDTE CHEMIE-INTERNATIONAL EDITION (2021)

Article Chemistry, Physical

Kemp Elimination Reaction Catalyzed by Electric Fields

Carles Acosta-Silva et al.

CHEMPHYSCHEM (2020)

Article Chemistry, Physical

Machine Learning in Enzyme Engineering

Stanislav Mazurenko et al.

ACS CATALYSIS (2020)

Review Biotechnology & Applied Microbiology

Enzyme engineering: Reshaping the biocatalytic functions

Misha Ali et al.

BIOTECHNOLOGY AND BIOENGINEERING (2020)

Article Chemistry, Medicinal

Deep Dive into Machine Learning Models for Protein Engineering

Yuting Xu et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2020)

Article Multidisciplinary Sciences

An engineered PET depolymerase to break down and recycle plastic bottles

V. Tournier et al.

NATURE (2020)

Article Biochemistry & Molecular Biology

EnzymeMiner: automated mining of soluble enzymes with diverse structures, catalytic properties and stabilities

Jiri Hon et al.

NUCLEIC ACIDS RESEARCH (2020)

Article Multidisciplinary Sciences

Characterization and engineering of a two-enzyme system for plastics depolymerization

Brandon C. Knott et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2020)

Article Biochemical Research Methods

Discovery of Novel Gain-of-Function Mutations Guided by Structure-Based Deep Learning

Raghav Shroff et al.

ACS SYNTHETIC BIOLOGY (2020)

Article Chemistry, Multidisciplinary

Hydrogen-Deuterium Exchange within Adenosine Deaminase, a TIM Barrel Hydrolase, Identifies Networks for Thermal Activation of Catalysis

Shuaihua Gao et al.

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY (2020)

Article Biotechnology & Applied Microbiology

Substrate specificity of 2-deoxy-D-ribose 5-phosphate aldolase (DERA) assessed by different protein engineering and machine learning methods

Sanni Voutilainen et al.

APPLIED MICROBIOLOGY AND BIOTECHNOLOGY (2020)

Article Multidisciplinary Sciences

An evolution-based model for designing chorismate mutase enzymes

William P. Russ et al.

SCIENCE (2020)

Article Chemistry, Multidisciplinary

Machine Learning Identifies Chemical Characteristics That Promote Enzyme Catalysis

Brian M. Bonk et al.

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY (2019)

Article Multidisciplinary Sciences

Machine learning-assisted directed protein evolution with combinatorial libraries

Zachary Wu et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2019)

Review Chemistry, Applied

Enzyme-Catalysed Synthesis of Secondary and Tertiary Amides

Mark R. Petchey et al.

ADVANCED SYNTHESIS & CATALYSIS (2019)

Article Chemistry, Physical

Quantum Mechanical Description of Electrostatics Provides a Unified Picture of Catalytic Action Across Methyltransferases

Zhongyue Yang et al.

JOURNAL OF PHYSICAL CHEMISTRY LETTERS (2019)

Article Chemistry, Physical

Finding Reactive Configurations: A Machine Learning Approach for Estimating Energy Barriers Applied to Sirtuin 5

Beatriz von der Esch et al.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2019)

Article Biochemical Research Methods

Unified rational protein engineering with sequence-based deep representation learning

Ethan C. Alley et al.

NATURE METHODS (2019)

Article Biotechnology & Applied Microbiology

Engineering CRISPR/ Lb Cas12a for highly efficient, temperature‐tolerant plant gene editing

Patrick Schindele et al.

PLANT BIOTECHNOLOGY JOURNAL (2019)

Article Biochemistry & Molecular Biology

UniProt: a worldwide hub of protein knowledge

Alex Bateman et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Biochemistry & Molecular Biology

BRENDA in 2019: a European ELIXIR core data resource

Lisa Jeske et al.

NUCLEIC ACIDS RESEARCH (2019)

Article Chemistry, Multidisciplinary

Revealing quantum mechanical effects in enzyme catalysis with large-scale electronic structure simulation

Zhongyue Yang et al.

REACTION CHEMISTRY & ENGINEERING (2019)

Article Chemistry, Physical

Combining Reclaimed PET with Bio-based Monomers Enables Plastics Upcycling

Nicholas A. Rorrer et al.

JOULE (2019)

Article Biochemistry & Molecular Biology

Mechanism and Catalytic Site Atlas (M-CSA): a database of enzyme reaction mechanisms and active sites

Antonio J. M. Ribeiro et al.

NUCLEIC ACIDS RESEARCH (2018)

Article Biochemistry & Molecular Biology

Speeding up enzyme discovery and engineering with ultrahigh-throughput methods

Hans Adrian Bunzel et al.

CURRENT OPINION IN STRUCTURAL BIOLOGY (2018)

Article Biochemistry & Molecular Biology

ProtaBank: A repository for protein design and engineering data

Connie Y. Wang et al.

PROTEIN SCIENCE (2018)

Review Biochemistry & Molecular Biology

Directed Evolution of Protein Catalysts

Cathleen Zeymer et al.

ANNUAL REVIEW OF BIOCHEMISTRY, VOL 87 (2018)

Article Biochemistry & Molecular Biology

Automated Design of Efficient and Functionally Diverse Enzyme Repertoires

Olga Khersonsky et al.

MOLECULAR CELL (2018)

Article Biochemical Research Methods

Deep generative models of genetic variation capture the effects of mutations

Adam J. Riesselman et al.

NATURE METHODS (2018)

Article Chemistry, Physical

Large-scale QM/MM free energy simulations of enzyme catalysis reveal the influence of charge transfer

Heather J. Kulik

PHYSICAL CHEMISTRY CHEMICAL PHYSICS (2018)

Article Biochemistry & Molecular Biology

Functional and informatics analysis enables glycosyltransferase activity prediction

Min Yang et al.

NATURE CHEMICAL BIOLOGY (2018)

Article Biotechnology & Applied Microbiology

Predicting novel substrates for enzymes with minimal experimental effort with active learning

Dante A. Pertusi et al.

METABOLIC ENGINEERING (2017)

Article Biotechnology & Applied Microbiology

Mutation effects predicted from sequence co-variation

Thomas A. Hopf et al.

NATURE BIOTECHNOLOGY (2017)

Review Biochemical Research Methods

An introduction to deep learning on biological sequence data: examples and solutions

Vanessa Isabell Jurtz et al.

BIOINFORMATICS (2017)

Article Biochemistry & Molecular Biology

Coevolutionary Landscape Inference and the Context-Dependence of Mutations in Beta-Lactamase TEM-1

Matteo Figliuzzi et al.

MOLECULAR BIOLOGY AND EVOLUTION (2016)

Article Biochemical Research Methods

Semisupervised Gaussian Process for Automated Enzyme Search

Joseph Mellor et al.

ACS SYNTHETIC BIOLOGY (2016)

Article Chemistry, Multidisciplinary

Engineering of Kuma030: A Gliadin Peptidase That Rapidly Degrades Immunogenic Gliadin Peptides in Gastric Conditions

Clancey Wolf et al.

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY (2015)

Article Multidisciplinary Sciences

Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics

Ehsaneddin Asgari et al.

PLOS ONE (2015)

Article Biochemical Research Methods

Deep mutational scanning: a new style of protein science

Douglas M. Fowler et al.

NATURE METHODS (2014)

Review Biochemistry & Molecular Biology

Design of Protein Catalysts

Donald Hilvert

ANNUAL REVIEW OF BIOCHEMISTRY, VOL 82 (2013)

Review Biochemistry & Molecular Biology

De novo enzymes by computational design

Hajo Kries et al.

CURRENT OPINION IN CHEMICAL BIOLOGY (2013)

Article Biochemistry & Molecular Biology

Many Pathways in Laboratory Evolution Can Lead to Improved Enzymes: How to Escape from Local Minima

Yosephine Gumulya et al.

CHEMBIOCHEM (2012)

Article Chemistry, Multidisciplinary

Computational Design of an α-Gliadin Peptidase

Sydney R. Gordon et al.

JOURNAL OF THE AMERICAN CHEMICAL SOCIETY (2012)

Article Biochemistry & Molecular Biology

SABIO-RK-database for biochemical reaction kinetics

Ulrike Wittig et al.

NUCLEIC ACIDS RESEARCH (2012)

Article Multidisciplinary Sciences

Bridging the gaps in design methodologies by evolutionary optimization of the stability and proficiency of designed Kemp eliminase KE59

Olga Khersonsky et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2012)

Article Multidisciplinary Sciences

RosettaScripts: A Scripting Language Interface to the Rosetta Macromolecular Modeling Suite

Sarel J. Fleishman et al.

PLOS ONE (2011)

Article Multidisciplinary Sciences

Direct-coupling analysis of residue coevolution captures native contacts across many protein families

Faruck Morcos et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2011)

Review Biotechnology & Applied Microbiology

Deep mutational scanning: assessing protein function on a massive scale

Carlos L. Araya et al.

TRENDS IN BIOTECHNOLOGY (2011)

Editorial Material Biochemistry & Molecular Biology

An exciting but challenging road ahead for computational enzyme design

David Baker

PROTEIN SCIENCE (2010)

Review Cell Biology

Exploring protein fitness landscapes by directed evolution

Philip A. Romero et al.

NATURE REVIEWS MOLECULAR CELL BIOLOGY (2009)

Article Multidisciplinary Sciences

Bimodal protein solubility distribution revealed by an aggregation analysis of the entire ensemble of Escherichia coli proteins

Tatsuya Niwa et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2009)

Article Biotechnology & Applied Microbiology

Improving catalytic function by ProSAR-driven enzyme evolution

Richard J. Fox et al.

NATURE BIOTECHNOLOGY (2007)

Article Biochemistry & Molecular Biology

A new set of amino acid descriptors and its application in peptide QSARs

H Mei et al.

BIOPOLYMERS (2005)

Article Biochemistry & Molecular Biology

UniProt: the Universal Protein knowledgebase

R Apweiler et al.

NUCLEIC ACIDS RESEARCH (2004)

Review Chemistry, Applied

Concepts of nature in organic synthesis: Cascade catalysis and multistep conversions in concert

A Bruggink et al.

ORGANIC PROCESS RESEARCH & DEVELOPMENT (2003)

Article Biochemistry & Molecular Biology

Optimizing the search algorithm for protein engineering by directed evolution

R Fox et al.

PROTEIN ENGINEERING (2003)

Article Biochemistry & Molecular Biology

BRENDA, enzyme data and metabolic information

I Schomburg et al.

NUCLEIC ACIDS RESEARCH (2002)

Article Biochemistry & Molecular Biology

Analysis and prediction of functional sub-types from protein sequence alignments

SS Hannenhalli et al.

JOURNAL OF MOLECULAR BIOLOGY (2000)

Article Biochemistry & Molecular Biology

The Protein Data Bank

HM Berman et al.

NUCLEIC ACIDS RESEARCH (2000)