4.7 Article

New Workflow for QSAR Model Development from Small Data Sets: Small Dataset Curator and Small Dataset Modeler. Integration of Data Curation, Exhaustive Double Cross-Validation, and a Set of Optimal Model Selection Techniques

期刊

JOURNAL OF CHEMICAL INFORMATION AND MODELING
卷 59, 期 10, 页码 4070-4076

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jcim.9b00476

关键词

-

资金

  1. FCT/MCTES [UID/QUI/50006/2019, PTDC/QUI-QIN/30649/2017]
  2. Polish National Science Center [UMO-2016/23/D/NZ7/03973 (SONATA-12)]
  3. University Grants Commission, New Delhi

向作者/读者索取更多资源

Quantitative structure activity relationship (QSAR) modeling is a well-known in silico technique with extensive applications in several major fields such as drug design, predictive toxicology, materials science, food science, etc. Handling small-sized datasets due to the lack of experimental data for specialized end points is a crucial task for the QSAR researcher. In the present study, we propose an integrated workflow/scheme capable of dealing with small dataset modeling that integrates dataset curation, exhaustive double cross-validation and a set of optimal model selection techniques including consensus predictions. We have developed two software tools, namely, Small Dataset Curator, version 1.0.0, and Small Dataset Modeler, version 1.0.0, to effortlessly execute the proposed workflow. These tools are freely available for download from https://dtclab.webs.com/software-tools. We have performed case studies employing seven diverse datasets to demonstrate the performance of the proposed scheme (including data curation) for small dataset QSAR modeling. The case studies also confirm the usability and stability of the developed software tools.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据