4.8 Article

A Comprehensive Survey of Scene Graphs: Generation and Application

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Artificial Intelligence

ZeroNAS: Differentiable Generative Adversarial Networks Search for Zero-Shot Learning

Caixia Yan et al.

Summary: This paper proposes a method called ZeroNAS that integrates neural architecture search (NAS) techniques into zero-shot learning (ZSL). It uses adversarial training to search for desirable architectures in a specially designed search space for GANs. Extensive experiments show that ZeroNAS outperforms state-of-the-art ZSL and GZSL approaches.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Computer Science, Artificial Intelligence

Toward Region-Aware Attention Learning for Scene Graph Generation

An-An Liu et al.

Summary: This article proposes a region-aware attention learning method to explicitly construct the attention space for exploring salient regions with object and predicate inferences, improving upon existing works that mainly focus on coarse-grained features.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Theory & Methods

A Survey of Deep Active Learning

Pengzhen Ren et al.

Summary: Researchers have shown relatively lower interest in active learning compared to deep learning, but with the increasing demand for large-scale high-quality annotated datasets, active learning is receiving more attention. This article provides a comprehensive survey on deep active learning, including a formal classification method, an overview of existing work, and an analysis of developments from an application perspective.

ACM COMPUTING SURVEYS (2022)

Article Computer Science, Artificial Intelligence

Learning to transfer focus of graph neural network for scene graph parsing

Junjie Jiang et al.

Summary: Scene graph parsing is a challenging task in image understanding and pattern recognition. The proposed graphical focal network aims to improve the recognition rate of semantic relationships by capturing dependencies between object and relationship detectors. Through adjusting loss proportions and introducing depth and layout information, the method outperforms competitors in Visual Genome benchmark.

PATTERN RECOGNITION (2021)

Article Computer Science, Artificial Intelligence

Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation

Zih-Siou Hung et al.

Summary: The proposed translation embedding model with context augmentation captures both common and rare relations effectively, outperforming previous models in comprehensive evaluations on challenging benchmarks. It achieves promising results for the task of scene graph generation and comes close to or exceeds the state of the art across a range of settings.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)

Article Robotics

Kimera: From SLAM to spatial perception with 3D dynamic scene graphs

Antoni Rosinol et al.

Summary: The article highlights the differences in perception between humans and robots, introducing a novel representation method - 3D dynamic scene graph (DSG), and developing Kimera for automatic construction of DSG from visual-inertial data. The research also includes a comprehensive evaluation of Kimera in real-life datasets and simulations, showing its competitive performance in real-time environment reconstruction and path planning.

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH (2021)

Article Computer Science, Artificial Intelligence

Scene Graph Generation With Hierarchical Context

Guanghui Ren et al.

Summary: This paper discusses the importance of enhancing predicate representations for scene graph generation, analyzes the key factors affecting relation detection results, and proposes a hierarchical context network (HCNet) for scene graph generation.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Linguistic Structures as Weak Supervision for Visual Scene Graph Generation

Keren Ye et al.

Summary: This study explores how linguistic structures in captions can benefit scene graph generation. Captions, as a weaker type of supervision than triplets, are more scalable due to the large and diverse sources of multimodal data on the web.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Energy-Based Learning for Scene Graph Generation

Mohammed Suhail et al.

Summary: A novel energy-based learning framework is proposed for generating scene graphs, leading to significant performance improvements on the Visual Genome and GQA benchmark datasets. The framework efficiently incorporates the structure of scene graphs and allows models to learn efficiently with a small number of labels.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation

Naina Dhingra et al.

Summary: In this study, a bidirectional GRU transformer network (BGT-Net) is proposed for scene graph generation in images, which enhances object prediction accuracy through novel object-object communication and information sharing.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 (2021)

Article Computer Science, Artificial Intelligence

Visual Social Relationship Recognition

Junnan Li et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2020)

Article Automation & Control Systems

3-D Scene Graph: A Sparse and Semantic Representation of Physical Environments for Intelligent Agents

Ue-Hwan Kim et al.

IEEE TRANSACTIONS ON CYBERNETICS (2020)

Article Computer Science, Artificial Intelligence

A hierarchical recurrent approach to predict scene graphs from a visual-attention-oriented perspective

Wenjing Gao et al.

COMPUTATIONAL INTELLIGENCE (2019)

Article Computer Science, Information Systems

Know More Say Less: Image Captioning Based on Scene Graphs

Xiangyang Li et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2019)

Article Computer Science, Information Systems

Scene graph captioner: Image captioning based on structural visual representation

Ning Xu et al.

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (2019)

Proceedings Paper Computer Science, Artificial Intelligence

VrR-VG: Refocusing Visually-Relevant Relationships

Yuanzhi Liang et al.

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Exploring Context and Visual Pattern of Relationship for Scene Graph Generation

Wenbin Wang et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Scene Graph Generation with External Knowledge and Image Reconstruction

Jiuxiang Gu et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Attentive Relational Networks for Mapping Images to Scene Graphs

Mengshi Qi et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Proceedings Paper Imaging Science & Photographic Technology

DEEPLY SUPERVISED MULTIMODAL ATTENTIONAL TRANSLATION EMBEDDINGS FOR VISUAL RELATIONSHIP DETECTION

Nikolaos Gkanatsios et al.

2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) (2019)

Proceedings Paper Computer Science, Theory & Methods

Adversarial Adaptation of Scene Graph Models for Understanding Civic Issues

Shanu Kumar et al.

WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019) (2019)

Proceedings Paper Computer Science, Interdisciplinary Applications

Explainable Video Action Reasoning via Prior Knowledge and State Transitions

Tao Zhuo et al.

PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19) (2019)

Proceedings Paper Computer Science, Software Engineering

MULTI-GRANULARITY REASONING FOR SOCIAL RELATION RECOGNITION FROM IMAGES

Meng Zhang et al.

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) (2019)

Proceedings Paper Computer Science, Software Engineering

PANET: A CONTEXT BASED PREDICATE ASSOCIATION NETWORK FOR SCENE GRAPH GENERATION

Yunian Chen et al.

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) (2019)

Article Engineering, Electrical & Electronic

T-CNN: Tubelets With Convolutional Neural Networks for Object Detection From Videos

Kai Kang et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2018)

Article Computer Science, Software Engineering

Narrative Collage of Image Collections by Scene Graph Recombination

Fei Fang et al.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Tensorize, Factorize and Regularize: Robust Visual Relationship Learning

Seong Jae Hwang et al.

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Iterative Visual Reasoning Beyond Convolutions

Xinlei Chen et al.

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)

Article Computer Science, Artificial Intelligence

Image Understanding using vision and reasoning through Scene Description Graph

Somak Aditya et al.

COMPUTER VISION AND IMAGE UNDERSTANDING (2018)

Article Computer Science, Artificial Intelligence

Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs

Yingjie Xia et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2017)

Article Computer Science, Artificial Intelligence

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Article Computer Science, Artificial Intelligence

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Ranjay Krishna et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2017)

Article Geography, Physical

On support relations and semantic scene graphs

Michael Ying Yang et al.

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Weakly-supervised learning of visual relations

Julia Peyre et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

ViP-CNN: Visual Phrase Guided Convolutional Neural Network

Yikang Li et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection

Xiaodan Liang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Object Detection in Videos with Tubelet Proposal Networks

Kai Kang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Detecting Visual Relationships with Deep Relational Networks

Bo Dai et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Visual Translation Embedding Network for Visual Relation Detection

Hanwang Zhang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Mask R-CNN

Kaiming He et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Object Detection from Video Tubelets with Convolutional Neural Networks

Kai Kang et al.

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2016)

Proceedings Paper Computer Science, Artificial Intelligence

HICO: A Benchmark for Recognizing Human-Object Interactions in Images

Yu-Wei Chao et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Article Computer Science, Information Systems

Image Re-Attentionizing

Tam V. Nguyen et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2013)

Article Computer Science, Artificial Intelligence

Invariant Scattering Convolution Networks

Joan Bruna et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2013)

Article Computer Science, Artificial Intelligence

The Graph Neural Network Model

Franco Scarselli et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS (2009)