Related references
Note: Only part of the references are listed.Video Description: A Survey of Methods, Datasets, and Evaluation Metrics
Nayyer Aafaq et al.
ACM COMPUTING SURVEYS (2020)
STAT: Spatial-Temporal Attention Mechanism for Video Captioning
Chenggang Yan et al.
IEEE TRANSACTIONS ON MULTIMEDIA (2020)
Video Captioning With Object-Aware Spatio-Temporal Correlation and Aggregation
Junchao Zhang et al.
IEEE TRANSACTIONS ON IMAGE PROCESSING (2020)
Rich Visual and Language Representation with Complementary Semantics for Video Captioning
Pengjie Tang et al.
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2019)
MULTI-MODAL REPRESENTATION LEARNING FOR SHORT VIDEO UNDERSTANDING AND RECOMMENDATION
Daya Guo et al.
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW) (2019)
Streamlined Dense Video Captioning
Jonghwan Mun et al.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)
LaSO: Label-Set Operations networks for multi-label few-shot learning
Amit Alfassy et al.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)
Adversarial Inference for Multi-Sentence Video Description
Jae Sung Park et al.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)
Visual to Text: Survey of Image and Video Captioning
Sheng Li et al.
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE (2019)
Image Captioning with Affective Guiding and Selective Attention
Anqi Wang et al.
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2018)
Efficient Video Encoding for Automatic Video Analysis in Distributed Wireless Surveillance Systems
Lingchao Kong et al.
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2018)
Video Captioning With Attention-Based LSTM and Semantic Consistency
Lianli Gao et al.
IEEE TRANSACTIONS ON MULTIMEDIA (2017)
StyleNet: Generating Attractive Visual Captions with Styles
Chuang Gan et al.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)
Dense-Captioning Events in Videos
Ranjay Krishna et al.
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)
Semantic Compositional Networks for Visual Captioning
Zhe Gan et al.
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)
Video Captioning with Guidance of Multimodal Latent Topics
Shizhe Chen et al.
PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17) (2017)
Near-Lossless Semantic Video Summarization and Its Applications to Video Analysis
Tao Mei et al.
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2013)