3.8 Proceedings Paper

Statistical Network Similarity

Publisher

SPRINGER INTERNATIONAL PUBLISHING AG
DOI: 10.1007/978-3-031-21131-7_25

Keywords

-

Ask authors/readers for more resources

Graph isomorphism is an intractable problem, and computing graph similarity metrics is NP-hard. However, assessing (dis)similarity between networks is crucial in various fields. This article proposes a statistical approach to answer questions about network similarity and difference, using distance matrices and probability distributions. The comparison focuses on connectivity and community structure rather than observable graph characteristics. Experimental results with synthetic and real-world graphs validate the effectiveness and accuracy of the proposed technique.
Graph isomorphism is a problem for which there is no known polynomialtime solution. The more general problem of computing graph similarity metrics, graph edit distance or maximum common subgraph, is NP-hard. Nevertheless, assessing (dis)similarity between two or more networks is a key task in many areas, such as image recognition, biology, chemistry, computer and social networks. In this article, we offer a statistical answer to the following questions: (a) Are networks G1 and G2 similar?, (b) How different are the networks G1 and G2? and (c) Is G3 more similar to G1 or G2?. Our comparisons begin with the transformation of each graph into an all-pairs distance matrix. Our node-node distance, Jaccard distance, has been shown to offer an accurate reflection of the graph's connectivity structure. We then model these distances as probability distributions. Finally, we use well-established statistical tools to gauge the (dis)similarities in terms of probability distribution (dis)similarity. This comparison procedure aims to detect (dis)similarities in connectivity structure and community structure in particular, not in easily observable graph characteristics, such as degrees, edge counts or density. We validate our hypothesis that graphs can be meaningfully summarized and compared via their node-node distance distributions, using several synthetic and real-world graphs. Empirical results demonstrate its validity and the accuracy of our comparison technique.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available