☆ 4.6 Article

DC programming for solving a sparse modeling problem of video key frame extraction

DIGITAL SIGNAL PROCESSING (2018)

期刊

DIGITAL SIGNAL PROCESSING

卷 83, 期 -, 页码 214-222

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

DOI: 10.1016/j.dsp.2018.08.005

关键词

Sparse representation; Key frame; Determinant; DC programming; Convolutional neural network

类别

Engineering, Electrical & Electronic

资金

New Energy and Industrial Technology Development Organization (NEDO), Japan, JST CREST [JPMJCR15E2]
JST SICORP
JSPS KAKENHI [26730130]
Grants-in-Aid for Scientific Research [26730130] Funding Source: KAKEN

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Extracting key frames from a video can reduce redundancies in continuous scenes and pithy represent the entire video. This technique copes with the issue of how to efficiently manage a large amount of video data and has many applications such as video indexing, querying, and browsing. Current key frame extraction methods often utilize sparse modeling, which assumes that each video frame can be expressed as a linear combination of a few representative key frames. Although these methods are successful, they are less aware of the fact that videos have spatio-temporal structure and used l(1) norm based sparsity measure. In this paper, we propose a key frame extraction method that considers spatio-temporal information of videos. We employ a sparsity measure that is based on the determinant of the Gram matrix computed from entire video frames. The determinant sparsity measure is computed for the entire matrix, not for single frames, and thus it reflects the spatial or joint sparseness of the entire video. With this measure, the solutions are sparser than conventional sparseness measures but the cost function is non-convex and optimization becomes harder. Then, utilizing the fact that the proposed cost function is the difference of two convex functions, we employ an efficient algorithm based on the difference-of-convex (DC) programming, which often finds the global solution of the non-convex cost function. Experiments show that the proposed algorithm generates high-quality, high-compression key frames, compared with the existing key frame extraction method with the l(1) norm. The determinant sparsity measure was recently proposed and can be utilized for more video processing applications such as video saliency detection. (C) 2018 Elsevier Inc. All rights reserved.

DC programming for solving a sparse modeling problem of video key frame extraction

期刊

DIGITAL SIGNAL PROCESSING

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

DC programming for solving a sparse modeling problem of video key frame extraction

期刊

DIGITAL SIGNAL PROCESSING

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文