相关参考文献
注意:仅列出部分参考文献,下载原文获取全部文献信息。Dual-CNN: A Convolutional language decoder for paragraph image captioning
Ruifan Li et al.
NEUROCOMPUTING (2020)
Spatiotemporal saliency-based multi-stream networks with attention-aware LSTM for action recognition
Zhenbing Liu et al.
NEURAL COMPUTING & APPLICATIONS (2020)
Evolutionary recurrent neural network for image captioning
Hanzhang Wang et al.
NEUROCOMPUTING (2020)
Discriminative deep multi-task learning for facial expression recognition
Hao Zheng et al.
INFORMATION SCIENCES (2020)
Background-foreground interaction for moving object detection in dynamic scenes
Zhe Chen et al.
INFORMATION SCIENCES (2019)
DAA: Dual LSTMs with adaptive attention for image captioning
Fen Xiao et al.
NEUROCOMPUTING (2019)
Recurrent convolutional video captioning with global and local attention
Tao Jin et al.
NEUROCOMPUTING (2019)
Reconstruction Network for Video Captioning
Bairui Wang et al.
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)
M3: Multimodal Memory Modelling for Video Captioning
Junbo Wang et al.
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)
Video Captioning With Attention-Based LSTM and Semantic Consistency
Lianli Gao et al.
IEEE TRANSACTIONS ON MULTIMEDIA (2017)
Group-Based Alternating Direction Method of Multipliers for Distributed Linear Classification
Huihui Wang et al.
IEEE TRANSACTIONS ON CYBERNETICS (2017)
Dual Learning for Cross-domain Image Captioning
Wei Zhao et al.
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT (2017)
Deep Dual Learning for Semantic Image Segmentation
Ping Luo et al.
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky et al.
INTERNATIONAL JOURNAL OF COMPUTER VISION (2015)
Feature selection for least squares projection twin support vector machine
Jianhui Guo et al.
NEUROCOMPUTING (2014)
Dynamic multi-objective evolution of classifier ensembles for video face recognition
Jean-Francois Connolly et al.
APPLIED SOFT COMPUTING (2013)
Natural language description of human activities from video images based on concept hierarchy of actions
A Kojima et al.
INTERNATIONAL JOURNAL OF COMPUTER VISION (2002)