☆ 4.7 Article

Finding Route Hotspots in Large Labeled Networks

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2021)

期刊

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

卷 33, 期 6, 页码 2479-2492

出版社

IEEE COMPUTER SOC

DOI: 10.1109/TKDE.2019.2956924

关键词

Computer hacking; Communication networks; Collaboration; Indexes; Trojan horses; Feature extraction; Graphs; sequential pattern; community detection; indexing

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems Engineering, Electrical & Electronic

资金

National Key Research and Development Program of China [2017YFB0803301]
Natural Science Foundation of China [61976026, U1836215]
111 Project [B18008]
US National Science Foundation [III-1526499, III-1763325, III-1909323, CNS-1930941]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This article explores the problem of finding route hotspots in large labeled networks, proposes a scalable algorithm FastRH, and designs an RH-Index index structure for storing hotspot and pattern information. Experimental results demonstrate the effectiveness and scalability of these methods on real-world datasets.

In many advanced network analysis applications, like social networks, e-commerce, and network security, hotspots are generally considered as a group of vertices that are tightly connected owing to the similar characteristics, such as common habits and location proximity. In this article, we investigate the formation of hotspots from an alternative perspective that considers the routes along the network paths as the auxiliary information, and attempt to find the route hotspots in large labeled networks. A route hotspot is a cohesive subgraph that is covered by a set of routes, and these routes correspond to the same sequential pattern consisting of vertices' labels. To the best of our knowledge, the problem of Finding Route Hotspots in Large Labeled Networks has not been tackled in the literature. However, it is challenging as counting the number of hotspots in a network is #P-hard. Inspired by the observation that the sizes of hotspots decrease with the increasing lengths of patterns, we prove several anti-monotonicity properties of hotspots, and then develop a scalable algorithm called FastRH that can use these properties to effectively prune the patterns that cannot form any hotspots. In addition, to avoid the duplicate computation overhead, we judiciously design an effective index structure called RH-Index for storing the hotspot and pattern information collectively, which also enables incremental updating and efficient query processing. Our experimental results on real-world datasets clearly demonstrate the effectiveness and scalability of our proposed methods.

Finding Route Hotspots in Large Labeled Networks

期刊

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Finding Route Hotspots in Large Labeled Networks

期刊

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文