4.7 Article

Reducing false positive rate of docking-based virtual screening by active learning

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Chemistry, Medicinal

TocoDecoy: A New Approach to Design Unbiased Datasets for Training and Benchmarking Machine-Learning Scoring Functions

Xujun Zhang et al.

Summary: The development of accurate machine-learning-based scoring functions for virtual screening requires unbiased and diverse datasets. However, most existing datasets may suffer from hidden biases and data insufficiency. In this study, we developed a new approach named TocoDecoy to generate unbiased and expandable datasets, and evaluated its performance compared to other datasets.

JOURNAL OF MEDICINAL CHEMISTRY (2022)

Review Chemistry, Multidisciplinary

Featurization strategies for protein-ligand interactions and their applications in scoring function development

Guoli Xiong et al.

Summary: Classical scoring functions have reached a plateau in predictive performance, while machine learning scoring functions relying on sophisticated techniques show great potential in binding affinity prediction. Automated-extraction features are emerging as a new trend in featurization for protein-ligand interactions, helping capture important physical processes.

WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE (2022)

Review Biochemical Research Methods

The impact of compound library size on the performance of scoring functions for structure-based virtual screening

Louison Fresnais et al.

Summary: Increasing the size of training datasets improves the accuracy of machine learning-based scoring functions in structure-based virtual screening, and using massive test sets can lead to fast discovery of drug leads with low-nanomolar potency. Screening larger compound libraries results in the identification of more potent actives, and ranking molecules with more accurate machine learning-based scoring functions can further enhance their potency. Additionally, classical and machine learning-based scoring functions often find different actives, suggesting the benefit of using both types of scoring functions on multiple targets.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

Improving structure-based virtual screening performance via learning from scoring function components

Guo-Li Xiong et al.

Summary: Scoring functions based on new protein-ligand interaction representations and advanced ML algorithms, such as the energy auxiliary terms learning (EATL) method, outperform classical SFs in terms of ROC and BEDROC, showing comparable performance with other advanced ML-based methods on the diverse subset of Directory of Useful Decoys: Enhanced (DUD-E). This approach demonstrates effectiveness in improving screening power and can be extended to other docking programs and SFs available.

BRIEFINGS IN BIOINFORMATICS (2021)

Review Biochemistry & Molecular Biology

Recent progress on the prospective application of machine learning to structure-based virtual screening

Ghita Ghislat et al.

Summary: With the availability of more bioactivity and protein structure data, scoring functions using machine learning are becoming more accurate and widely applicable. Improvements in selecting suitable decoys and training and evaluating ML-based scoring functions have enhanced their performance for virtual screening. Recent applications have shown promising results, indicating potential for future advancements in structure-based virtual screening studies.

CURRENT OPINION IN CHEMICAL BIOLOGY (2021)

Article Chemistry, Medicinal

Property-Unmatched Decoys in Docking Benchmarks

Reed M. Stein et al.

Summary: The enrichment of ligands compared to property-matched decoys is commonly used in docking library screens, but over-optimizing for enrichment alone can lead to false confidence in prospective performance. By adding decoys representing charge extrema and overall characteristics of the library being docked, one can sample molecules not represented by the ligands and property-matched decoys, improving the accuracy of future screening results.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2021)

Article Chemistry, Physical

Efficient Exploration of Chemical Space with Docking and Deep Learning

Ying Yang et al.

Summary: The increasing availability of purchasable compounds for virtual screening and assay has led to the development of a machine learning-enhanced molecular docking protocol, which drastically improves throughput and preserves the diversity of experimentally confirmed hit compounds. This protocol successfully identifies high scoring compounds while exploring a large region of chemical space, demonstrating superior performance compared to traditional methods.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2021)

Article Chemistry, Medicinal

Active Learning for Drug Design: A Case Study on the Plasma Exposure of Orally Administered Drugs

Xiaoyu Ding et al.

Summary: The research introduces a two-phase active learning pipeline that is successfully applied to the prediction of drug oral plasma exposure. The first phase model demonstrates excellent sampling capability in a noisy data set, while the second phase model achieves improved accuracy and confident predictions through exploring a large diverse chemical space.

JOURNAL OF MEDICINAL CHEMISTRY (2021)

Article Chemistry, Medicinal

InteractionGraphNet: A Novel and Efficient Deep Graph Representation Learning Framework for Accurate Protein-Ligand Interaction Predictions

Dejun Jiang et al.

Summary: The study proposed a novel deep learning framework named InteractionGraphNet (IGN) to learn protein-ligand interactions from 3D structures. IGN utilizes two independent graph convolution modules to sequentially learn intramolecular and intermolecular interactions, achieving better performance in binding affinity prediction, virtual screening, and pose prediction experiments.

JOURNAL OF MEDICINAL CHEMISTRY (2021)

Article Biochemical Research Methods

Generating property-matched decoy molecules using deep learning

Fergus Imrie et al.

Summary: This study introduces a deep learning method (DeepCoy) that generates decoys to remove biases and improve virtual screening performance, significantly reducing the risk of misclassification.

BIOINFORMATICS (2021)

Article Biochemical Research Methods

Forman persistent Ricci curvature (FPRC)-based machine learning models for protein-ligand binding affinity prediction

JunJie Wee et al.

Summary: Artificial intelligence techniques have been applied to the entire drug design process, with molecular featurization being a central challenge for AI-based drug design success.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Multidisciplinary Sciences

Persistent spectral-based machine learning (PerSpect ML) for protein-ligand binding affinity prediction

Zhenyu Meng et al.

Summary: Molecular descriptors are crucial for quantitative structure-activity relationship (QSAR) models and machine learning-based data analysis. The proposed PerSpect ML models utilize a novel filtration process to generate spectral models at various scales, showing potential to greatly improve learning models in molecular data analysis. Results demonstrate superior performance in protein-ligand binding affinity prediction compared to existing models across commonly used databases.

SCIENCE ADVANCES (2021)

Review Chemistry, Multidisciplinary

From machine learning to deep learning: Advances in scoring functions for protein-ligand docking

Chao Shen et al.

WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE (2020)

Article Chemistry, Medicinal

Application of Negative Design To Design a More Desirable Virtual Screening Library

Zi-Yi Yang et al.

JOURNAL OF MEDICINAL CHEMISTRY (2020)

Article Chemistry, Medicinal

LIT-PCBA: An Unbiased Data Set for Machine Learning and Virtual Screening

Viet-Khoa Tran-Nguyen et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2020)

Article Chemistry, Medicinal

Improving Docking-Based Virtual Screening Ability by Integrating Multiple Energy Auxiliary Terms from Molecular Docking Scoring

Wen-Ling Ye et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2020)

News Item Biotechnology & Applied Microbiology

Drug-induced protein degradation heats up

Michael Eisenstein

NATURE BIOTECHNOLOGY (2020)

Article Multidisciplinary Sciences

Machine learning classification can reduce false positives in structure-based virtual screening

Yusuf O. Adeshina et al.

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (2020)

Article Chemistry, Medicinal

In Need of Bias Control: Evaluating Chemical Data for Machine Learning in Structure-Based Virtual Screening

Jochen Sieg et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2019)

Article Chemistry, Medicinal

Practical Model Selection for Prospective Virtual Screening

Shengchao Liu et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2019)

Article Biochemical Research Methods

Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening

Zixuan Cang et al.

PLOS COMPUTATIONAL BIOLOGY (2018)

Review Pharmacology & Pharmacy

Decoys Selection in Benchmarking Datasets: Overview and Perspectives

Manon Reau et al.

FRONTIERS IN PHARMACOLOGY (2018)

Article Computer Science, Artificial Intelligence

Machine learning in computational docking

Mohamed A. Khamis et al.

ARTIFICIAL INTELLIGENCE IN MEDICINE (2015)

Article Automation & Control Systems

Comparative assessment of machine-learning scoring functions on PDBbind 2013

Mohamed A. Khamis et al.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2015)

Review Chemistry, Multidisciplinary

Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening

Qurrat Ul Ain et al.

WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE (2015)

Article Chemistry, Medicinal

Does a More Precise Chemical Description of Protein-Ligand Complexes Lead to More Accurate Prediction of Binding Affinity?

Pedro J. Ballester et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2014)

Article Chemistry, Medicinal

Evaluation and Optimization of Virtual Screening Workflows with DEKOIS 2.0-A Public Library of Challenging Docking Benchmark Sets

Matthias R. Bauer et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2013)

Article Chemistry, Medicinal

Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking

Michael M. Mysinger et al.

JOURNAL OF MEDICINAL CHEMISTRY (2012)

Article Chemistry, Medicinal

A Machine Learning-Based Method To Improve Docking Scoring Functions and Its Application to Drug Repurposing

Sarah L. Kinnings et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2011)

Review Chemistry, Multidisciplinary

Outstanding challenges in protein-ligand docking and structure-based virtual screening

Bohdan Waszkowycz et al.

WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE (2011)

Article Chemistry, Medicinal

Virtual screening system for finding structurally diverse hits by active learning

Yukiko Fujiwara et al.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2008)

Article Chemistry, Medicinal

Benchmarking sets for molecular docking

Niu Huang et al.

JOURNAL OF MEDICINAL CHEMISTRY (2006)

Article Chemistry, Medicinal

Prediction of protein-ligand interactions. Docking and scoring: Successes and gaps

Andrew R. Leach et al.

JOURNAL OF MEDICINAL CHEMISTRY (2006)