4.5 Article

Overcoming class imbalance in drug discovery problems: Graph neural networks and balancing approaches

期刊

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.jmgm.2023.108627

关键词

Graph neural networks; Unbalanced dataset; Drug discovery

向作者/读者索取更多资源

This research explores the application of Graph Neural Networks (GNNs) in drug development to overcome cost and time limitations. By comparing different datasets and GNN architectures, it provides benchmarking research to guide future investigations and enhance the effectiveness of GNNs in drug discovery and design. While oversampling technique shows potential on imbalanced datasets, further research is needed to optimize its usage and explore alternative solutions for class imbalance.
This research investigates the application of Graph Neural Networks (GNNs) to enhance the cost-effectiveness of drug development, addressing the limitations of cost and time. Class imbalances within classification datasets, such as the discrepancy between active and inactive compounds, give rise to difficulties that can be resolved through strategies like oversampling, undersampling, and manipulation of the loss function. A comparison is conducted between three distinct datasets using three different GNN architectures. This benchmarking research can steer future investigations and enhance the efficacy of GNNs in drug discovery and design. Three hundred models for each combination of architecture and dataset were trained using hyperparameter tuning techniques and evaluated using a range of metrics. Notably, the oversampling technique outperforms eight experiments, showcasing its potential. While balancing techniques boost imbalanced dataset models, their efficacy depends on dataset specifics and problem type. Although oversampling aids molecular graph datasets, more research is needed to optimize its usage and explore other class imbalance solutions.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据