☆ 4.5 Article

Network Sampling: From Static to Streaming Graphs

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2014)

Journal

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

Volume 8, Issue 2, Pages -

Publisher

ASSOC COMPUTING MACHINERY

DOI: 10.1145/2601438

Keywords

Network sampling; social network analysis; graph streams; relational classification

Funding

ARO
NSF [W911NF-08-1-0238, IIS-1017898, IIS-1149789, IIS-1219015]
Direct For Computer & Info Scie & Enginr
Div Of Information & Intelligent Systems [1219015] Funding Source: National Science Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Network sampling is integral to the analysis of social, information, and biological networks. Since many real-world networks are massive in size, continuously evolving, and/or distributed in nature, the network structure is often sampled in order to facilitate study. For these reasons, a more thorough and complete understanding of network sampling is critical to support the field of network science. In this paper, we outline a framework for the general problem of network sampling by highlighting the different objectives, population and units of interest, and classes of network sampling methods. In addition, we propose a spectrum of computational models for network sampling methods, ranging from the traditionally studied model based on the assumption of a static domain to a more challenging model that is appropriate for streaming domains. We design a family of sampling methods based on the concept of graph induction that generalize across the full spectrum of computational models (from static to streaming) while efficiently preserving many of the topological properties of the input graphs. Furthermore, we demonstrate how traditional static sampling algorithms can be modified for graph streams for each of the three main classes of sampling methods: node, edge, and topology-based sampling. Experimental results indicate that our proposed family of sampling methods more accurately preserve the underlying properties of the graph in both static and streaming domains. Finally, we study the impact of network sampling algorithms on the parameter estimation and performance evaluation of relational classification algorithms.

Network Sampling: From Static to Streaming Graphs

Journal

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Network Sampling: From Static to Streaming Graphs

Journal

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

Publisher

ASSOC COMPUTING MACHINERY

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper