4.4 Article

HIERARCHICAL RELATIONAL MODELS FOR DOCUMENT NETWORKS

Journal

ANNALS OF APPLIED STATISTICS
Volume 4, Issue 1, Pages 124-150

Publisher

INST MATHEMATICAL STATISTICS
DOI: 10.1214/09-AOAS309

Keywords

Mixed-membership models; variational methods; text analysis; network models

Funding

  1. ONR [175-6343]
  2. NSF [0745520]
  3. Google
  4. Microsoft
  5. Direct For Computer & Info Scie & Enginr
  6. Div Of Information & Intelligent Systems [GRANTS:14026548, 0745520] Funding Source: National Science Foundation

Ask authors/readers for more resources

We develop the relational topic model (RTM), a hierarchical model of both network structure and node attributes. We focus on document networks, where the attributes of each document are its words, that is, discrete observations taken from a fixed vocabulary. For each pair of documents, the RTM models their link as a binary random variable that is conditioned on their contents. The model can be used to summarize a network of documents, predict links between them, and predict words within them. We derive efficient inference and estimation algorithms based on variational methods that take advantage of sparsity and scale with the number of links. We evaluate the predictive performance of the RTM for large networks of scientific abstracts, web documents, and geographically tagged news.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available