4.6 Article

VMLH: Efficient Video Moment Location via Hashing

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

Distilled Siamese Networks for Visual Tracking

Jianbing Shen et al.

Summary: This paper introduces a distilled Siamese tracking framework, which learns small, fast, and accurate trackers through a teacher-student knowledge distillation model from large Siamese trackers. The proposed framework achieves high compression rates and frame rates while maintaining tracking accuracy by utilizing teacher-student distillation and student-student knowledge sharing.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Computer Science, Artificial Intelligence

Segmenting Objects From Relational Visual Data

Xiankai Lu et al.

Summary: In this article, the authors propose an attentive graph neural network (AGNN) for pixelwise object segmentation tasks (i.e., automatic video segmentation, image co-segmentation, and few-shot semantic segmentation) in relational visual data. AGNN effectively captures knowledge from the relational visual data through iterative information fusion, leading to accurate object discovery and segmentation.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Engineering, Electrical & Electronic

Adaptive Region Proposal With Channel Regularization for Robust Object Tracking

Xiankai Lu et al.

Summary: The paper proposes an adaptive region proposal scheme with feature channel regularization for robust object tracking. By integrating correlation filters and adaptively learned region proposals, an enhanced two-stream tracking framework is presented to address tracking failures and scale estimation problems. Extensive experimental validations demonstrate the effectiveness of the proposed method against state-of-the-art tracking algorithms.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2021)

Article Computer Science, Artificial Intelligence

Video Moment Localization via Deep Cross-Modal Hashing

Yupeng Hu et al.

Summary: Video moment localization, an important branch of video content analysis, has attracted attention from both industry and academia, facing challenges such as temporal context modeling, intelligent moment candidate generation, and efficiency and scalability. To address these challenges, researchers have proposed a deep end-to-end cross-modal hashing network, demonstrating its superiority through experimental results.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Language-driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model

Weining Wang et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Proceedings Paper Computer Science, Information Systems

Attentive Moment Retrieval in Videos

Meng Liu et al.

ACM/SIGIR PROCEEDINGS 2018 (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Dense-Captioning Events in Videos

Ranjay Krishna et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Localizing Moments in Video with Natural Language

Lisa Anne Hendricks et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

TALL: Temporal Activity Localization via Language Query

Jiyang Gao et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Joao Carreira et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Article Computer Science, Artificial Intelligence

Sequential Compact Code Learning for Unsupervised Image Hashing

Li Liu et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2016)

Proceedings Paper Computer Science, Artificial Intelligence

Fast R-CNN

Ross Girshick

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Proceedings Paper Computer Science, Artificial Intelligence

Learning Spatiotemporal Features with 3D Convolutional Networks

Du Tran et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)