4.4 Article

Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes

期刊

PROCEEDINGS OF THE VLDB ENDOWMENT
卷 5, 期 5, 页码 394-405

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.14778/2140436.2140437

关键词

-

资金

  1. U.S. Army Research Laboratory [W911NF-09-2-0053]
  2. MIAS
  3. DHS-IDS Center for Multimodal Information Access and Synthesis at UIUC
  4. U.S. National Science Foundation [IIS-0905215]
  5. U.S. Air Force Office of Scientific Research MURI [FA9550-08-1-0265]

向作者/读者索取更多资源

With the rapid development of online social media, online shopping sites and cyber-physical systems, heterogeneous information networks have become increasingly popular and content-rich over time. In many cases, such networks contain multiple types of objects and links, as well as different kinds of attributes. The clustering of these objects can provide useful insights in many applications. However, the clustering of such networks can be challenging since (a) the attribute values of objects are often incomplete, which implies that an object may carry only partial attributes or even no attributes to correctly label itself; and (b) the links of different types may carry different kinds of semantic meanings, and it is a difficult task to determine the nature of their relative importance in helping the clustering for a given purpose. In this paper, we address these challenges by proposing a model-based clustering algorithm. We design a probabilistic model which clusters the objects of different types into a common hidden space, by using a user-specified set of attributes, as well as the links from different relations. The strengths of different types of links are automatically learned, and are determined by the given purpose of clustering. An iterative algorithm is designed for solving the clustering problem, in which the strengths of different types of links and the quality of clustering results mutually enhance each other. Our experimental results on real and synthetic data sets demonstrate the effectiveness and efficiency of the algorithm.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据