4.7 Article

Large deviations of semisupervised learning in the stochastic block model

Journal

PHYSICAL REVIEW E
Volume 105, Issue 3, Pages -

Publisher

AMER PHYSICAL SOC
DOI: 10.1103/PhysRevE.105.034108

Keywords

-

Funding

  1. ERC under European Union [714608-SMiLe]

Ask authors/readers for more resources

In semisupervised community detection, knowing the membership of a set of revealed nodes can lead to better inference accuracies. This study focuses on correlated subsets in the dense stochastic block model, showing a nonmonotonic relationship between reconstruction accuracy and free energy. The findings have potential implications for active learning applications in community detection.
In semisupervised community detection, the membership of a set of revealed nodes is known in addition to the graph structure and can be leveraged to achieve better inference accuracies. While previous works investigated the case where the revealed nodes are selected at random, this paper focuses on correlated subsets leading to atypically high accuracies. In the framework of the dense stochastic block model, we employ statistical physics methods to derive a large deviation analysis of the number of these rare subsets, as characterized by their free energy. We find theoretical evidence of a nonmonotonic relationship between reconstruction accuracy and the free energy associated to the posterior measure of the inference problem. We further discuss possible implications for active learning applications in community detection.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available