4.5 Article

Hit Dexter: A Machine-Learning Model for the Prediction of Frequent Hitters

Journal

CHEMMEDCHEM
Volume 13, Issue 6, Pages 564-571

Publisher

WILEY-V C H VERLAG GMBH
DOI: 10.1002/cmdc.201700673

Keywords

cheminformatics; compound promiscuity; frequent hitters; PAINS; high-throughput screening

Funding

  1. Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) [KI 2085/1-1]
  2. Ministry of Education of the Czech Republic [NPU I-LO1220, LM2015063]
  3. Erasmus+ Programme of the European Commission

Ask authors/readers for more resources

False-positive assay readouts caused by badly behaving compoundsfrequent hitters, pan-assay interference compounds (PAINS), aggregators, and otherscontinue to pose a major challenge to experimental screening. There are only a few insilico methods that allow the prediction of such problematic compounds. We report the development of Hit Dexter, two extremely randomized trees classifiers for the prediction of compounds likely to trigger positive assay readouts either by true promiscuity or by assay interference. The models were trained on a well-prepared dataset extracted from the PubChem Bioassay database, consisting of approximately 311000 compounds tested for activity on at least 50 proteins. Hit Dexter reached MCC and AUC values of up to 0.67 and 0.96 on an independent test set, respectively. The models are expected to be of high value, in particular to medicinal chemists and biochemists who can use Hit Dexter to identify compounds for which extra caution should be exercised with positive assay readouts. Hit Dexter is available as a free web service at .

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available