4.7 Article

Comparison of Nonbinary Similarity Coefficients for Similarity Searching, Clustering and Compound Selection

期刊

JOURNAL OF CHEMICAL INFORMATION AND MODELING
卷 49, 期 5, 页码 1193-1201

出版社

AMER CHEMICAL SOC
DOI: 10.1021/ci8004644

关键词

-

资金

  1. University of Sheffield through the Jacques-Emile Dubois Grant
  2. Tripos Inc.

向作者/读者索取更多资源

Several recent studies have compared the relative performance of a selection of similarity coefficients when applied to chemical databases represented by binary fingerprints. Considerable variation in performance, when used for (dis)similarity-based techniques, such as similarity searching, database clustering, and dissimilarity-based compound selection, has been reported, the reasons for which are closely related to molecular size. For many of these similarity coefficients, an alternative form can be derived which is applicable to sets of nonbinary data, such as calculated or measured physicochemical properties, or counts of substructural fragments. Here we report on several studies which have been undertaken to investigate the relative performance of twelve coefficients when applied to nonbinary data using such (dis)similarity-based techniques. Results suggest that no single coefficient is appropriate for all methodologies investigated and that the size bias detected with binary data is not as apparent when the data and, hence, coefficient are nonbinary in nature.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据