期刊
MOLECULAR INFORMATICS
卷 30, 期 9, 页码 779-789出版社
WILEY-V C H VERLAG GMBH
DOI: 10.1002/minf.201100053
关键词
Subspace mapping; Applicability domain; QSAR; Chemoinformatics; Bayesian estimation
类别
资金
- MINCyT-BMBF [AL0811-ARG 08/016]
This work describes a methodology for assisting virtual screening of drugs during the early stages of the drug development process. This methodology is proposed to improve the reliability of in silico property prediction and it is structured in two steps. Firstly, a transformation is sought for mapping a high-dimensional space defined by potentially redundant or irrelevant molecular descriptors into a low-dimensional application-related space. For this task we evaluate three different target-driven subspace mapping methods, out of which we highlight the recent Correlative Matrix Mapping (CMM) as the most stable. Secondly, we apply an applicability domain model on the low-dimensional dimensional space for assessing confidentiality of compound classification. By a probabilistic framework the applicability domain approach identifies poorly represented compounds in the training set (extrapolation problems) and regions in the space where the uncertainty about the correct class is higher than normal (interpolation problems). This two-step approach represents an important contribution to the development of confident prediction tools in the chemoinformatics area, where the field is in need of both interpretable models and methods that estimate the confidence of predictions.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据