☆ 4.5 Article

Data driven analysis of aromatase inhibitors through machine learning, database mining and library generation

CHEMICAL PHYSICS (2024)

Journal

CHEMICAL PHYSICS

Volume 577, Issue -, Pages -

Publisher

ELSEVIER

DOI: 10.1016/j.chemphys.2023.112143

Keywords

Aromatase inhibitors; Drug design; Library enumeration; Data mining; Machine learning

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Designing novel drugs using data-driven and virtual screening approaches, such as machine learning and data mining, is a popular research topic in the pharmaceutical industry. In this study, ML models were trained using data collected from academic research articles, and molecular descriptors were utilized. The best ML models were selected and optimized to identify potential compounds for aromatase inhibitors. These models accurately predicted the inhibition values of compounds in a database, and new compounds were designed based on the predictions. Overall, this study demonstrates the potential significance of data-driven and virtual screening approaches in pharmaceutical research.

Designing of novel drugs using data-driven and virtual screening approaches is a popular research topic in the pharmaceutical industry. Machine learning (ML) and data mining have recently emerged as useful tools for finding potent compounds and predicting their biological activities. In this study, data was collected from academic research articles to train ML models. Molecular descriptors were utilized for training over forty ML models. The best two models (Decision Tree regressor and Extra Tree regressor) were selected based on statistical parameters, and their hyperparameters were optimized to identify the best compounds with high pIC50 values for aromatase inhibitors. A database of more than 5000 compounds was extracted from PubChem, and the best ML model was used to predict their aromatase inhibition values. The top three reference compounds from the database were elected, and new compounds were designed using the library enumeration methodology. The two best ML models (Decision Tree regressor and Extra Tree regressor) were able to accurately predict the aromatase inhibition values of the compounds in our database. In conclusion, our study shows that data-driven and virtual screening approaches using machine learning and data mining can be used to design novel molecules as drugs, specifically in the case of aromatase inhibitors. The results of our study have the potential to contribute significantly to the field of pharmaceutical research and development.

Data driven analysis of aromatase inhibitors through machine learning, database mining and library generation

Journal

CHEMICAL PHYSICS

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Data driven analysis of aromatase inhibitors through machine learning, database mining and library generation

Journal

CHEMICAL PHYSICS

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper