☆ 4.6 Article

Deep video representation learning: a survey

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

期刊

MULTIMEDIA TOOLS AND APPLICATIONS

卷 -, 期 -, 页码 -

出版社

SPRINGER

DOI: 10.1007/s11042-023-17815-3

关键词

Video representation learning; Feature modeling; Video feature extraction; Feature learning

类别

Computer Science, Information Systems Computer Science, Software Engineering Computer Science, Theory & Methods Engineering, Electrical & Electronic

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper reviews representation learning for videos, discussing recent spatio-temporal feature learning methods and comparing their advantages and disadvantages for general video analysis. It emphasizes the importance of building effective video features in computer vision tasks and summarizes the effectiveness and challenges of existing spatial and temporal features.

This paper provides a review on representation learning for videos. We classify recent spatio-temporal feature learning methods for sequential visual data and compare their pros and cons for general video analysis. Building effective features for videos is a fundamental problem in computer vision tasks involving video analysis and understanding. Existing features can be generally categorized into spatial and temporal features. Their effectiveness under variations of illumination, occlusion, view and background are discussed. Finally, we discuss the remaining challenges in existing deep video representation learning studies.

Deep video representation learning: a survey

期刊

MULTIMEDIA TOOLS AND APPLICATIONS

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Deep video representation learning: a survey

期刊

MULTIMEDIA TOOLS AND APPLICATIONS

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文