4.7 Article

Panel-Page-Aware Comic Genre Understanding

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Information Systems

Instance-sequence reasoning for video question answering

Rui Liu et al.

Summary: Video question answering requires understanding of video content and question language, as well as the integration of textual semantic with visual content. It involves reasoning with causal connections between instances and requires instance grounding and temporal localization. This paper proposes an instance-sequence reasoning network that embeds visual instances and textual representations into graph nodes, and utilizes graph causal convolution for visual grounding and instance-sequence reasoning.

FRONTIERS OF COMPUTER SCIENCE (2022)

Article Computer Science, Artificial Intelligence

Action Keypoint Network for Efficient Video Recognition

Xu Chen et al.

Summary: This paper proposes an Action Keypoint Network (AK-Net) that integrates temporal and spatial selection to improve the efficiency of video recognition models. AK-Net selects informative keypoints from arbitrary-shaped regions and transforms the video recognition into point cloud classification, providing two-fold benefits for efficiency. Experimental results demonstrate that AK-Net consistently improves the efficiency and performance of baseline methods on several video recognition benchmarks.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Information Systems

Multiple knowledge representation for big data artificial intelligence: framework, applications, and case studies

Yi Yang et al.

Summary: The paper introduces a multiple knowledge representation (MKR) framework and discusses its potential in developing big data artificial intelligence (AI) techniques. MKR makes current AI techniques more explainable and generalizable, while also expanding current AI techniques to facilitate the mutual benefits of different representations.

FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING (2021)

Article Computer Science, Information Systems

STAT: Spatial-Temporal Attention Mechanism for Video Captioning

Chenggang Yan et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2020)

Article Automation & Control Systems

Describing Video With Attention-Based Bidirectional LSTM

Yi Bin et al.

IEEE TRANSACTIONS ON CYBERNETICS (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Multi-Label Image Recognition with Graph Convolutional Networks

Zhao-Min Chen et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Article Computer Science, Artificial Intelligence

Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos

Wenbin Du et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2018)

Article Computer Science, Software Engineering

What Characterizes Personalities of Graphic Designs?

Nanxuan Zhao et al.

ACM TRANSACTIONS ON GRAPHICS (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Comics Story Representation System Based on Genre

Yuki Daiku et al.

2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Densely Connected Convolutional Networks

Gao Huang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Attention-Based Multimodal Fusion for Video Description

Chiori Hori et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Mask R-CNN

Kaiming He et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Joao Carreira et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)