☆ 4.7 Article

A generalized Weisfeiler-Lehman graph kernel

MACHINE LEARNING (2022)

Journal

MACHINE LEARNING

Volume 111, Issue 7, Pages 2601-2629

Publisher

SPRINGER

DOI: 10.1007/s10994-022-06131-w

Keywords

Graph kernel; Weisfeiler-Lehman; Tree edit distance; Wasserstein distance

Funding

Federal Ministry of Education and Research of Germany [01|S18038C]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Weisfeiler-Lehman graph kernels are still one of the most prevalent graph kernels after more than a decade, thanks to their impressive predictive performance and time complexity. However, their binary comparison based on label equality may be too rigid for certain graph classes. To address this limitation, we propose a generalization of the Weisfeiler-Lehman graph kernels that considers a more natural and fine-grained similarity between labels. We demonstrate that this similarity can be efficiently calculated using the Wasserstein distance between vectors representing the labels. Our generalization outperforms other state-of-the-art graph kernels in terms of predictive performance on datasets with structurally complex graphs.

After more than one decade, Weisfeiler-Lehman graph kernels are still among the most prevalent graph kernels due to their remarkable predictive performance and time complexity. They are based on a fast iterative partitioning of vertices, originally designed for deciding graph isomorphism with one-sided error. The Weisfeiler-Lehman graph kernels retain this idea and compare such labels with respect to equality. This binary valued comparison is, however, arguably too rigid for defining suitable graph kernels for certain graph classes. To overcome this limitation, we propose a generalization of Weisfeiler-Lehman graph kernels which takes into account a more natural and finer grade of similarity between Weisfeiler-Lehman labels than equality. We show that the proposed similarity can be calculated efficiently by means of the Wasserstein distance between certain vectors representing Weisfeiler-Lehman labels. This and other facts give rise to the natural choice of partitioning the vertices with the Wasserstein k-means algorithm. We empirically demonstrate on the Weisfeiler-Lehman subtree kernel, which is one of the most prominent Weisfeiler-Lehman graph kernels, that our generalization significantly outperforms this and other state-of-the-art graph kernels in terms of predictive performance on datasets which contain structurally more complex graphs beyond the typically considered molecular graphs.

A generalized Weisfeiler-Lehman graph kernel

Journal

MACHINE LEARNING

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A generalized Weisfeiler-Lehman graph kernel

Journal

MACHINE LEARNING

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper