4.7 Article

Benchmark Data Set for in Silico Prediction of Ames Mutagenicity

Journal

JOURNAL OF CHEMICAL INFORMATION AND MODELING
Volume 49, Issue 9, Pages 2077-2081

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/ci900161g

Keywords

-

Funding

  1. FP7-ICT Programme of the European Community [ICT-216886]
  2. DFG [MU 987/4-1]

Ask authors/readers for more resources

Up to now, publicly available data sets to build and evaluate Ames mutagenicity prediction tools have been very limited in terms of size and chemical space covered. In this report we describe a new unique public Ames mutagenicity data set comprising about 6500 nonconfidential compounds (available as SMILES strings and SDF) together with their biological activity. Three commercial tools (DEREK, MultiCASE, and an off-the-shelf Bayesian machine learner in Pipeline Pilot) are compared with four noncommercial machine learning implementations (Support Vector Machines, Random Forests, k-Nearest Neighbors, and Gaussian Processes) on the new benchmark data set.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available