期刊
BIOINFORMATICS
卷 23, 期 22, 页码 3001-3008出版社
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btm470
关键词
-
Motivation: The large-scale comparison of proteinligand binding sites is problematic, in that measures of structural similarity are difficult to quantify and are not easily understood in terms of statistical similarity that can ultimately be related to structure and function. We present a binding site matching score the Poisson Index (PI) based upon a well-defined statistical model. PI requires only the number of matching atoms between two sites and the size of the two sitesthe same information used by the Tanimoto Index (TI), a comparable and widely used measure for molecular similarity. We apply PI and TI to a previously automatically extracted set of binding sites to determine the robustness and usefulness of both scores. Results: We found that PI outperforms TI; moreover, site similarity is poorly defined for TI at values around the 99.5 confidence level for which PI is well defined. A difference map at this confidence level shows that PI gives much more meaningful information than TI. We show individual examples where TI fails to distinguish either a false or a true site paring in contrast to PI, which performs much better. TI cannot handle large or small sites very well, or the comparison of large and small sites, in contrast to PI that is shown to be much more robust. Despite the difficulty of determining a biological ground truth for binding site similarity we conclude that PI is a suitable measure of binding site similarity and could form the basis for a binding site classification scheme comparable to existing protein domain classification schema. Availability: Pl is implemented in SitesBase www.modelling.leeds.ac.uk/sb/ Contact: r.m.jackson@leeds.ac.uk.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据