3.8 Proceedings Paper

Statistical Network Similarity

出版社

SPRINGER INTERNATIONAL PUBLISHING AG
DOI: 10.1007/978-3-031-21131-7_25

关键词

-

向作者/读者索取更多资源

Graph isomorphism is an intractable problem, and computing graph similarity metrics is NP-hard. However, assessing (dis)similarity between networks is crucial in various fields. This article proposes a statistical approach to answer questions about network similarity and difference, using distance matrices and probability distributions. The comparison focuses on connectivity and community structure rather than observable graph characteristics. Experimental results with synthetic and real-world graphs validate the effectiveness and accuracy of the proposed technique.
Graph isomorphism is a problem for which there is no known polynomialtime solution. The more general problem of computing graph similarity metrics, graph edit distance or maximum common subgraph, is NP-hard. Nevertheless, assessing (dis)similarity between two or more networks is a key task in many areas, such as image recognition, biology, chemistry, computer and social networks. In this article, we offer a statistical answer to the following questions: (a) Are networks G1 and G2 similar?, (b) How different are the networks G1 and G2? and (c) Is G3 more similar to G1 or G2?. Our comparisons begin with the transformation of each graph into an all-pairs distance matrix. Our node-node distance, Jaccard distance, has been shown to offer an accurate reflection of the graph's connectivity structure. We then model these distances as probability distributions. Finally, we use well-established statistical tools to gauge the (dis)similarities in terms of probability distribution (dis)similarity. This comparison procedure aims to detect (dis)similarities in connectivity structure and community structure in particular, not in easily observable graph characteristics, such as degrees, edge counts or density. We validate our hypothesis that graphs can be meaningfully summarized and compared via their node-node distance distributions, using several synthetic and real-world graphs. Empirical results demonstrate its validity and the accuracy of our comparison technique.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据