4.7 Article

Interactive Video Retrieval in the Age of Deep Learning - Detailed Evaluation of VBS 2019

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 23, 期 -, 页码 243-256

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2020.2980944

关键词

Task analysis; Visualization; Browsers; Annotations; Deep learning; Semantics; Tools; Interactive video retrieval; video browsing; video content analysis; content-based retrieval; evaluations

资金

  1. Czech Science Foundation (GACR) [19-22071Y]
  2. European Unions Horizon 2020 research and innovation programme: V4Design [779962]
  3. MARCONI [761802]

向作者/读者索取更多资源

This study provides a detailed analysis of the results and outcomes of the 8th iteration of the Video Browser Showdown (VBS), focusing on the new dataset, task performances, and search system features of six international teams. The analysis delves into per-team success ratio, search strategies, and popular features, revealing valuable insights such as the significance of textual search and the trend towards deep learning features.
Despite the fact that automatic content analysis has made remarkable progress over the last decade - mainly due to significant advances in machine learning - interactive video retrieval is still a very challenging problem, with an increasing relevance in practical applications. The Video Browser Showdown (VBS) is an annual evaluation competition that pushes the limits of interactive video retrieval with state-of-the-art tools, tasks, data, and evaluation metrics. In this paper, we analyse the results and outcome of the 8th iteration of the VBS in detail. We first give an overview of the novel and considerably larger V3C1 dataset and the tasks that were performed during VBS 2019. We then go on to describe the search systems of the six international teams in terms of features and performance. And finally, we perform an in-depth analysis of the per-team success ratio and relate this to the search strategies that were applied, the most popular features, and problems that were experienced. A large part of this analysis was conducted based on logs that were collected during the competition itself. This analysis gives further insights into the typical search behavior and differences between expert and novice users. Our evaluation shows that textual search and content browsing are the most important aspects in terms of logged user interactions. Furthermore, we observe a trend towards deep learning based features, especially in the form of labels generated by artificial neural networks. But nevertheless, for some tasks, very specific content-based search features are still being used. We expect these findings to contribute to future improvements of interactive video search systems.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据