4.5 Article

Sampling Based Estimation of In-Degree Distribution for Directed Complex Networks

Journal

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS
Volume 30, Issue 4, Pages 863-876

Publisher

TAYLOR & FRANCIS INC
DOI: 10.1080/10618600.2021.1873143

Keywords

Asymptotic approach; Directed network; In-degree; Random walks; Sampling; Statistical inversion

Funding

  1. FCT [CRM:0022222]
  2. NSF [DMS-1712966, DMS-1606839, DMS-1613072]
  3. ARO [W911NF-17-1-0010]

Ask authors/readers for more resources

This work focuses on estimating the in-degree distribution of directed networks from sampling network nodes or edges. Two estimation approaches are proposed, based on inversion and asymptotic methods. The performance of these approaches is tested on synthetic and real networks, showing good results.
The focus of this work is on estimation of the in-degree distribution in directed networks from sampling network nodes or edges. A number of sampling schemes are considered, including random sampling with and without replacement, and several approaches based on random walks with possible jumps. When sampling nodes, it is assumed that only the out-edges of that node are visible, that is, the in-degree of that node is not observed. The suggested estimation of the in-degree distribution is based on two approaches. The inversion approach exploits the relation between the original and sample in-degree distributions, and can estimate the bulk of the in-degree distribution, but not the tail of the distribution. The tail of the in-degree distribution is estimated through an asymptotic approach, which itself has two versions: one assuming a power-law tail and the other for a tail of general form. The two estimation approaches are examined on synthetic and real networks, with good performance results, especially striking for the asymptotic approach. Supplementary files for this article are available online.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available