4.6 Article

ExCAPE-DB: an integrated large scale dataset facilitating Big Data analysis in chemogenomics

期刊

JOURNAL OF CHEMINFORMATICS
卷 9, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s13321-017-0203-5

关键词

Big Data; Bioactivity; Chemogenomics; Chemical structure; Molecular fingerprints; Search engine; QSAR

资金

  1. European Union [671555]

向作者/读者索取更多资源

Chemogenomics data generally refers to the activity data of chemical compounds on an array of protein targets and represents an important source of information for building in silico target prediction models. The increasing volume of chemogenomics data offers exciting opportunities to build models based on Big Data. Preparing a high quality data set is a vital step in realizing this goal and this work aims to compile such a comprehensive chemogenomics dataset. This dataset comprises over 70 million SAR data points from publicly available databases (PubChem and ChEMBL) including structure, target information and activity annotations. Our aspiration is to create a useful chemogenomics resource reflecting industry-scale data not only for building predictive models of in silico polypharmacology and offtarget effects but also for the validation of cheminformatics approaches in general.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据