4.7 Article

Online Data Organizer: Micro-Video Categorization by Structure-Guided Multimodal Dictionary Learning

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING
Volume 28, Issue 3, Pages 1235-1247

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2018.2875363

Keywords

Micro-video organization; tree-guided constraints; multi-modal dictionary learning; online learning

Funding

  1. National Basic Research Program of China (973) [2015CB352501, 2015CB352502]
  2. National Natural Science Foundation of China [61772310, 61702300, 61702302]
  3. One Thousand Talents Plan of China
  4. Tencent AI Lab Rhino-Bird Joint Research Program [JR201805]
  5. National Science Foundation of China [61429201]
  6. ARO [W911NF-15-1-0290]
  7. Faculty Research Gift Awards by NEC Laboratory of America
  8. Faculty Research Gift Awards by NEC Laboratory of Blippar

Ask authors/readers for more resources

Micro-videos have rapidly become one of the most dominant trends in the era of social media. Accordingly, how to organize them draws our attention. Distinct from the traditional long videos that would have multi-site scenes and tolerate the hysteresis, a micro-video: 1) usually records contents at one specific venue within a few seconds. The venues are structured hierarchically regarding their category granularity. This motivates us to organize the micro-videos via their venue structure. 2) timely circulates over social networks. Thus, the timeliness of micro-videos desires effective online processing. However, only 1.22% of micro-videos are labeled with venue information when uploaded at the mobile end. To address this problem, we present a framework to organize the micro-videos online. In particular, we first build a structure-guided multi-modal dictionary learning model to learn the concept-level micro-video representation by jointly considering their venue structure and modality relatedness. We then develop an online learning algorithm to incrementally and efficiently strengthen our model, as well as categorize the micro-videos into a tree structure. Extensive experiments on a real-world data set validate our model well. In addition, we have released the codes to facilitate the research in the community.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available