4.6 Article

Robust graph regularization nonnegative matrix factorization for link prediction in attributed networks

Journal

MULTIMEDIA TOOLS AND APPLICATIONS
Volume 82, Issue 3, Pages 3745-3768

Publisher

SPRINGER
DOI: 10.1007/s11042-022-12943-8

Keywords

Complex network; Link prediction; Nonnegative matrix factorization; Attributed network

Ask authors/readers for more resources

Link prediction is a widely studied problem in complex network analysis. The existing methods often overlook the potential of nodal attributes and only focus on the network's topological structure. To address this limitation, a novel method called RGNMF-AN was proposed, which models both the topological structure and nodal attributes for link prediction. The method combines network topology and nodal attribute information and calculates high-order proximities using the SARWS method. Empirical findings on real-world complex network datasets show that the combination of attributed and topological information significantly improves prediction performance compared to baseline and other NMF-based algorithms.
Link prediction is one of the most widely studied problems in the area of complex network analysis, in which machine learning techniques can be applied to deal with it. The biggest drawback of the existing methods, however, is that in most cases they only consider the topological structure of the network, and therefore completely miss out on the great potential that stems from the nodal attributes. Both topological structure and nodes' attributes are essential in predicting the evolution of attributed networks and can act as complements to each other. To bring out their full potential in solving the link prediction problem, a novel Robust Graph Regularization Nonnegative Matrix Factorization for Attributed Networks (RGNMF-AN) was proposed, which models not only the topology structure of networks but also their node attributes for direct link prediction. This model, in particular, combines two types of information, namely network topology, and nodal attributes information, and calculates high-order proximities between nodes using the Structure-Attribute Random Walk Similarity (SARWS) method. The SARWS score matrix is an indicator structural and attributed matrix that collects more useful attributed information in high-order proximities, whereas graph regularization technology combines the SARWS score matrix with topological and attribute information to collect more valuable attributed information in high-order proximities. Furthermore, the RGNMF-AN employs the l(2,1)-norm to constrain the loss function and regularization terms, effectively removing random noise and spurious links. According to empirical findings on nine real-world complex network datasets, the use of a combination of attributed and topological information in tandem enhances the prediction performance significantly compared to the baseline and other NMF-based algorithms.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available