4.4 Article

HIERARCHICAL RELATIONAL MODELS FOR DOCUMENT NETWORKS

期刊

ANNALS OF APPLIED STATISTICS
卷 4, 期 1, 页码 124-150

出版社

INST MATHEMATICAL STATISTICS
DOI: 10.1214/09-AOAS309

关键词

Mixed-membership models; variational methods; text analysis; network models

资金

  1. ONR [175-6343]
  2. NSF [0745520]
  3. Google
  4. Microsoft
  5. Direct For Computer & Info Scie & Enginr
  6. Div Of Information & Intelligent Systems [GRANTS:14026548, 0745520] Funding Source: National Science Foundation

向作者/读者索取更多资源

We develop the relational topic model (RTM), a hierarchical model of both network structure and node attributes. We focus on document networks, where the attributes of each document are its words, that is, discrete observations taken from a fixed vocabulary. For each pair of documents, the RTM models their link as a binary random variable that is conditioned on their contents. The model can be used to summarize a network of documents, predict links between them, and predict words within them. We derive efficient inference and estimation algorithms based on variational methods that take advantage of sparsity and scale with the number of links. We evaluate the predictive performance of the RTM for large networks of scientific abstracts, web documents, and geographically tagged news.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据