4.7 Article Proceedings Paper

UTOPIAN: User-Driven Topic Modeling Based on Interactive Nonnegative Matrix Factorization

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TVCG.2013.212

Keywords

Latent Dirichlet allocation; nonnegative matrix factorization; topic modeling; visual analytics; interactive clustering; text analytics

Funding

  1. National Science Foundation (NSF) [CCF-0808863, IIS-1242304, IIS-1231742]
  2. Defense Advanced Research Projects Agency (DARPA) XDATA program [FA8750-12-2-0309]
  3. Division of Computing and Communication Foundations
  4. Direct For Computer & Info Scie & Enginr [0808863] Funding Source: National Science Foundation
  5. Div Of Information & Intelligent Systems
  6. Direct For Computer & Info Scie & Enginr [1231742, 1242304] Funding Source: National Science Foundation

Ask authors/readers for more resources

Topic modeling has been widely used for analyzing text document collections. Recently, there have been significant advancements in various topic modeling techniques, particularly in the form of probabilistic graphical modeling. State-of-the-art techniques such as Latent Dirichlet Allocation (LDA) have been successfully applied in visual text analytics. However, most of the widely-used methods based on probabilistic modeling have drawbacks in terms of consistency from multiple runs and empirical convergence. Furthermore, due to the complicatedness in the formulation and the algorithm, LDA cannot easily incorporate various types of user feedback. To tackle this problem, we propose a reliable and flexible visual analytics system for topic modeling called UTOPIAN (User-driven Topic modeling based on Inter active Nonnegative Matrix Factorization). Centered around its semi-supervised formulation, UTOPIAN enables users to interact with the topic modeling method and steer the result in a user-driven manner. We demonstrate the capability of UTOPIAN via several usage scenarios with real-world document corpuses such as InfoVis/VAST paper data set and product review data sets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available