☆ 4.7 Article

Online Data Organizer: Micro-Video Categorization by Structure-Guided Multimodal Dictionary Learning

IEEE TRANSACTIONS ON IMAGE PROCESSING (2019)

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING

Volume 28, Issue 3, Pages 1235-1247

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TIP.2018.2875363

Keywords

Micro-video organization; tree-guided constraints; multi-modal dictionary learning; online learning

Funding

National Basic Research Program of China (973) [2015CB352501, 2015CB352502]
National Natural Science Foundation of China [61772310, 61702300, 61702302]
One Thousand Talents Plan of China
Tencent AI Lab Rhino-Bird Joint Research Program [JR201805]
National Science Foundation of China [61429201]
ARO [W911NF-15-1-0290]
Faculty Research Gift Awards by NEC Laboratory of America
Faculty Research Gift Awards by NEC Laboratory of Blippar

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Micro-videos have rapidly become one of the most dominant trends in the era of social media. Accordingly, how to organize them draws our attention. Distinct from the traditional long videos that would have multi-site scenes and tolerate the hysteresis, a micro-video: 1) usually records contents at one specific venue within a few seconds. The venues are structured hierarchically regarding their category granularity. This motivates us to organize the micro-videos via their venue structure. 2) timely circulates over social networks. Thus, the timeliness of micro-videos desires effective online processing. However, only 1.22% of micro-videos are labeled with venue information when uploaded at the mobile end. To address this problem, we present a framework to organize the micro-videos online. In particular, we first build a structure-guided multi-modal dictionary learning model to learn the concept-level micro-video representation by jointly considering their venue structure and modality relatedness. We then develop an online learning algorithm to incrementally and efficiently strengthen our model, as well as categorize the micro-videos into a tree structure. Extensive experiments on a real-world data set validate our model well. In addition, we have released the codes to facilitate the research in the community.

Online Data Organizer: Micro-Video Categorization by Structure-Guided Multimodal Dictionary Learning

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Online Data Organizer: Micro-Video Categorization by Structure-Guided Multimodal Dictionary Learning

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper