4.6 Article

Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach

期刊

JOURNAL OF CHEMINFORMATICS
卷 13, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s13321-021-00506-2

关键词

Structure-activity relationship; Similarity-based virtual screening; 2D fingerprint; Unsupervised feature selection; Chemoinformatics

资金

  1. King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research (OSR) [BAS/1/1624-01, URF/1/3412-01, URF/1/3450-01, FCC/1/1976-18, FCC/1/197623, FCC/1/1976-25, FCC/1/1976-26, FCS/1/4102-02]

向作者/读者索取更多资源

The study identified 2D fingerprints with little to no contribution to the eigenvalue distribution of the feature matrix using an eigenvalue-based entropy approach, showing that their presence can substantially affect molecular similarity scores and bias the outcome of molecular similarity analysis. This has implications for the optimal selection of 2D fingerprints for compound similarity analysis and identification of potential hits for compounds with target biological activity in virtual screening.
Two-dimensional (2D) chemical fingerprints are widely used as binary features for the quantification of structural similarity of chemical compounds, which is an important step in similarity-based virtual screening (VS). Here, using an eigenvalue-based entropy approach, we identified 2D fingerprints with little to no contribution to shaping the eigenvalue distribution of the feature matrix as related ones and examined the degree to which these related 2D fingerprints influenced molecular similarity scores calculated with the Tanimoto coefficient. Our analysis identified many related fingerprints in publicly available fingerprint schemes and showed that their presence in the feature set could have substantial effects on the similarity scores and bias the outcome of molecular similarity analysis. Our results have implication in the optimal selection of 2D fingerprints for compound similarity analysis and the identification of potential hits for compounds with target biological activity in VS.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据