4.5 Article

Data driven analysis of aromatase inhibitors through machine learning, database mining and library generation

Journal

CHEMICAL PHYSICS
Volume 577, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.chemphys.2023.112143

Keywords

Aromatase inhibitors; Drug design; Library enumeration; Data mining; Machine learning

Ask authors/readers for more resources

Designing novel drugs using data-driven and virtual screening approaches, such as machine learning and data mining, is a popular research topic in the pharmaceutical industry. In this study, ML models were trained using data collected from academic research articles, and molecular descriptors were utilized. The best ML models were selected and optimized to identify potential compounds for aromatase inhibitors. These models accurately predicted the inhibition values of compounds in a database, and new compounds were designed based on the predictions. Overall, this study demonstrates the potential significance of data-driven and virtual screening approaches in pharmaceutical research.
Designing of novel drugs using data-driven and virtual screening approaches is a popular research topic in the pharmaceutical industry. Machine learning (ML) and data mining have recently emerged as useful tools for finding potent compounds and predicting their biological activities. In this study, data was collected from academic research articles to train ML models. Molecular descriptors were utilized for training over forty ML models. The best two models (Decision Tree regressor and Extra Tree regressor) were selected based on statistical parameters, and their hyperparameters were optimized to identify the best compounds with high pIC50 values for aromatase inhibitors. A database of more than 5000 compounds was extracted from PubChem, and the best ML model was used to predict their aromatase inhibition values. The top three reference compounds from the database were elected, and new compounds were designed using the library enumeration methodology. The two best ML models (Decision Tree regressor and Extra Tree regressor) were able to accurately predict the aromatase inhibition values of the compounds in our database. In conclusion, our study shows that data-driven and virtual screening approaches using machine learning and data mining can be used to design novel molecules as drugs, specifically in the case of aromatase inhibitors. The results of our study have the potential to contribute significantly to the field of pharmaceutical research and development.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available