3.8 Proceedings Paper

RegionCLIP: Region-based Language-Image Pretraining

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Proceedings Paper Computer Science, Artificial Intelligence

Learning to Generate Scene Graph from Natural Language Supervision

Yiwu Zhong et al.

Summary: This paper introduces a method that learns from image-sentence pairs to extract a graphical representation of localized objects and their relationships within an image, known as a scene graph. By leveraging an off-the-shelf object detector and designing a Transformer-based model to predict pseudo labels, the model achieves strong results for weakly and fully supervised scene graph generation tasks. The experiment results show a 30% relative gain over the latest method trained with human-annotated unlocalized scene graphs.

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

PreDet: Large-scale weakly supervised pre-training for detection

Vignesh Ramanathan et al.

Summary: This study introduces a new large-scale pre-training strategy for object detection, augmenting standard classification pre-training by introducing noisy class labels and a detection-specific pretext task. By redesigning Faster R-CNN modules to efficiently perform this task, significant improvements over existing weakly-supervised and self-supervised pre-training approaches in detection accuracy and fine-tuning speed were shown.

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) (2021)

Article Computer Science, Artificial Intelligence

Zero-Shot Object Detection: Joint Recognition and Localization of Novel Concepts

Shafin Rahman et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2020)

Proceedings Paper Computer Science, Artificial Intelligence

R-FCN-3000 at 30fps: Decoupling Detection and Classification

Bharat Singh et al.

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)

Article Computer Science, Artificial Intelligence

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

Ranjay Krishna et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Mask R-CNN

Kaiming He et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Webly Supervised Learning of Convolutional Networks

Xinlei Chen et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Proceedings Paper Computer Science, Artificial Intelligence

Learning Everything about Anything: Webly-Supervised Visual Concept Learning

Santosh K. Divvala et al.

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2014)