4.7 Article

Graph kernels based on tree patterns for molecules

Journal

MACHINE LEARNING
Volume 75, Issue 1, Pages 3-35

Publisher

SPRINGER
DOI: 10.1007/s10994-008-5086-2

Keywords

Graph kernels; Support vector machines; Chemoinformatics

Ask authors/readers for more resources

Motivated by chemical applications, we revisit and extend a family of positive definite kernels for graphs based on the detection of common subtrees, initially proposed by Ramon and Gartner (Proceedings of the first international workshop on mining graphs, trees and sequences, pp. 65-74, 2003). We propose new kernels with a parameter to control the complexity of the subtrees used as features to represent the graphs. This parameter allows to smoothly interpolate between classical graph kernels based on the count of common walks, on the one hand, and kernels that emphasize the detection of large common subtrees, on the other hand. We also propose two modular extensions to this formulation. The first extension increases the number of subtrees that define the feature space, and the second one removes noisy features from the graph representations. We validate experimentally these new kernels on problems of toxicity and anti-cancer activity prediction for small molecules with support vector machines.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available