4.7 Article

Large Scale Network Embedding: A Separable Approach

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2020.3002700

关键词

Network embedding; matrix factorization; separability

资金

  1. National Natural Science Foundation of China [61876006]

向作者/读者索取更多资源

This paper introduces a new network embedding algorithm called SepNE, which learns representations for different subsets of nodes independently in a separated process, thereby improving the scalability of large-scale networks. By preserving both local and global information in the objective function, this algorithm is able to capture more information and leverage high-order proximities in large networks using several methods.
Many successful methods have been proposed for learning low-dimensional representations on large-scale networks, while almost all existing methods are designed in inseparable processes, learning embeddings for entire networks even when only a small proportion of nodes are of interest. This leads to great inconvenience, especially on large-scale or dynamic networks, where these methods become almost impossible to implement. In this paper, we formalize the problem of separated matrix factorization, based on which we elaborate a novel objective function that preserves both local and global information. We compare our SMF framework with approximate SVD algorithms and demonstrate SMF can capture more information when factorizing a given matrix. We further propose SepNE, a simple and flexible network embedding algorithm which independently learns representations for different subsets of nodes in separated processes. By implementing separability, our algorithm reduces the redundant efforts to embed irrelevant nodes, yielding scalability to large networks. To further incorporate complex information into SepNE, we discuss several methods that can be used to leverage high-order proximities in large networks. We demonstrate the effectiveness of SepNE on several real-world networks with different scales and subjects. With comparable accuracy, our approach significantly outperforms state-of-the-art baselines in running times on large networks.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据