期刊
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP)
卷 -, 期 -, 页码 879-883出版社
IEEE
DOI: 10.1109/icip.2019.8802944
关键词
Video Search; Multi-label; Deep Metric Learning; Feature Composition
资金
- Agency for Science, Technology and Research (A*STAR) under its Hardware-Software Co-optimisation for Deep Learning [A1892b0026]
- Singapore Ministry of Education Tier-2 Fund [MOE2016-T2-2-057(S)]
In this paper, we propose Deep Holographic Networks (DHN) to learn similarity metrics of videos for multi-label video search. DHN introduces a holographic composition layer to explicitly encode similarity metrics at intermediate layer of the network, instead of conventional deep metric learning approaches driven by ranking losses. The holographic composition layer is parameter-free and enables less memory footprint compared with state-of-the-art. Towards multi-label video search at large scale, we present a new video benchmark built upon the YouTube-8M dataset. Extensive evaluations on this dataset demonstrate that DHN performs better than traditional deep metric learning approaches as well as other compositional networks.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据