4.7 Article

Comparative Assessment of Scoring Functions on an Updated Benchmark: 1. Compilation of the Test Set

Journal

JOURNAL OF CHEMICAL INFORMATION AND MODELING
Volume 54, Issue 6, Pages 1700-+

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/ci500080q

Keywords

-

Funding

  1. Chinese National Natural Science Foundation [81172984, 21072213, 21002117, 21102168, 21102165]
  2. Chinese Ministry of Science and Technology (863 High-Tech Grant) [2012AA020308]
  3. Science and Technology Development Fund of Macao SAR [0330]
  4. Chinese Academy of Sciences
  5. MSD China Postdoctoral Research Fellowship

Ask authors/readers for more resources

Scoring functions are often applied in combination with molecular docking methods to predict ligand binding poses and ligand binding affinities or to identify active compounds through virtual screening. An objective benchmark for assessing the performance of current scoring functions is expected to provide practical guidance for the users to make smart choices among available methods. It can also elucidate the common weakness in current methods for future improvements. The primary goal of our comparative assessment of scoring functions (CASF) project is to provide a high-standard, publicly accessible benchmark of this type. Our latest study, i.e., CASF-2013, evaluated 20 popular scoring functions on an updated set of protein-ligand complexes. This data set was selected out of 8302 protein-ligand complexes recorded in the PDBbind database (version 2013) through a fairly complicated process. Sample selection was made by considering the quality of complex structures as well as binding data. Finally, qualified complexes were clustered by 90% similarity in protein sequences. Three representative complexes were chosen from each cluster to control sample redundancy. The final outcome, namely, the PDBbind core set (version 2013), consists of 195 protein ligand complexes in 65 clusters with binding constants spanning nearly 10 orders of magnitude. In this data set, 82% of the ligand molecules are druglike and 78% of the protein molecules are validated or potential drug targets. Correlation between binding constants and several key properties of ligands are discussed. Methods and results of the scoring function evaluation will be described in a companion work in this issue (doi: 10.1021/ci500081m).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available