4.7 Article

Object Detection in 20 Years: A Survey

Related references

Note: Only part of the references are listed.
Article Computer Science, Artificial Intelligence

TransVOD: End-to-End Video Object Detection With Spatial-Temporal Transformers

Qianyu Zhou et al.

Summary: This paper proposes an end-to-end video object detection system called TransVOD based on simple yet effective spatial-temporal Transformer architectures. It streamlines the current video object detection pipeline by eliminating the need for many hand-designed components, and achieves good performance on the ImageNet VID dataset.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Automation & Control Systems

Intelligent Small Object Detection for Digital Twin in Smart Manufacturing With Industrial Cyber-Physical Systems

Xiaokang Zhou et al.

Summary: This article focuses on the development of a small object detection model for digital twins, aiming to achieve dynamic synchronization and real-time estimation of environmental parameters. By constructing a hybrid deep neural network model and learning algorithm, efficient multi-type small object detection is achieved to facilitate process modeling, monitoring, and optimization in smart manufacturing.

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS (2022)

Article Computer Science, Artificial Intelligence

Weakly Supervised Object Detection Using Proposal- and Semantic-Level Relationships

Dingwen Zhang et al.

Summary: Weakly supervised object detection has received great attention in recent years in the computer vision community. However, existing approaches mostly focus on visual appearance and ignore the use of context information. This paper proposes a weakly supervised learning framework that incorporates proposal-level and semantic-level context, leading to improved learning performance through deep multiple instance reasoning. Experimental results demonstrate the superior performance of the proposed approach on widely used benchmarks.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Automation & Control Systems

Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation

Zhaohui Zheng et al.

Summary: The proposed CIoU loss and Cluster-NMS approach, which incorporates geometric factors, significantly improve average precision and average recall in object detection and instance segmentation, with notable gains without sacrificing inference efficiency.

IEEE TRANSACTIONS ON CYBERNETICS (2022)

Proceedings Paper Computer Science, Artificial Intelligence

PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images

Chengjian Feng et al.

Summary: The goal of this work is to establish a scalable pipeline for expanding an object detector towards novel/unseen categories, using zero manual annotations. The proposed approach includes a two-stage openvocabulary object detector, regional prompt learning to align visual and textual embeddings, and a self-training framework using online resources. The proposed detector, PromptDet, outperforms existing approaches with fewer training images and no manual annotations.

COMPUTER VISION, ECCV 2022, PT IX (2022)

Proceedings Paper Computer Science, Artificial Intelligence

RegionCLIP: Region-based Language-Image Pretraining

Yiwu Zhong et al.

Summary: Contrastive language-image pretraining (CLIP) has achieved impressive results in image classification tasks. However, directly applying CLIP models to object detection tasks leads to unsatisfactory performance due to domain shift. To address this issue, we propose a new method called RegionCLIP that enables fine-grained alignment between image regions and textual concepts.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Bridged Transformer for Vision and Point Cloud 3D Object Detection

Yikai Wang et al.

Summary: 3D object detection is a crucial research topic in computer vision, and there is a trend of leveraging multiple sources of input data. However, the heterogeneous geometrics of 2D and 3D representations prevent the direct application of pre-trained neural networks for multimodal fusion. To address this issue, the Bridged Transformer (BrT) is proposed, which is an end-to-end architecture that learns to identify 3D and 2D object bounding boxes from both points and image patches.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection

Chenhongyi Yang et al.

Summary: This paper proposes a query mechanism to accelerate the inference speed of feature-pyramid based object detectors for small object detection. By predicting coarse locations on low-resolution features and computing accurate results on high-resolution features, the proposed method improves detection performance and inference speed significantly.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Implicit Motion Handling for Video Camouflaged Object Detection

Xuelian Cheng et al.

Summary: A new video camouflaged object detection framework that utilizes short-term dynamics and long-term temporal consistency is proposed. The method unifies motion estimation and object segmentation within a single optimization framework and improves predictions using a spatio-temporal transformer.

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2022)

Article Computer Science, Artificial Intelligence

CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection

Runmin Cong et al.

Summary: In this study, a novel convolutional neural network model CIR-Net is proposed for RGB-D salient object detection task. By incorporating cross-modality interaction and refinement, as well as inserting a refinement middleware structure between the encoder and decoder, the detection performance can be effectively improved.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Artificial Intelligence

Weighted boxes fusion: Ensembling boxes from different object detection models

Roman Solovyev et al.

Summary: This study introduces a novel method, weighted boxes fusion, for combining predictions from different object detection models, significantly improving the quality of the ensemble predicted rectangles. The method achieved top results in various datasets and challenges, with the 3D version of boxes fusion being successfully applied in winning teams of specific competitions.

IMAGE AND VISION COMPUTING (2021)

Article Computer Science, Artificial Intelligence

STDnet-ST: Spatio-temporal ConvNet for small object detection

Brais Bosquet et al.

Summary: Object detection using convolutional neural networks has achieved unprecedented levels of accuracy, but there is still room for improvement in detecting small objects. Utilizing spatial information alongside temporal video data is a new trend that can potentially enhance overall object detection performance. STDnet-ST is an end-to-end spatio-temporal convolutional neural network designed for detecting small objects in video, achieving state-of-the-art results on various video datasets.

PATTERN RECOGNITION (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Domain-Specific Suppression for Adaptive Object Detection

Yu Wang et al.

Summary: This study presents a new perspective on how CNN models gain transferability by distinguishing and suppressing domain-specific directions to optimize domain adaptation in object detection. Experimental results demonstrate that the domain-specific suppression method significantly improves object detection performance, with an increase in mAP by 10.2 to 12.2%.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Informative and Consistent Correspondence Mining for Cross-Domain Weakly Supervised Object Detection

Luwei Hou et al.

Summary: The study proposes a novel cross-domain co-attention scheme for more accurate knowledge transfer through learning pixel-wise cross-domain correspondences.

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 (2021)

Article Computer Science, Artificial Intelligence

Focal Loss for Dense Object Detection

Tsung-Yi Lin et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2020)

Article Computer Science, Artificial Intelligence

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection

Peng Tang et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2020)

Article Computer Science, Artificial Intelligence

The Open Images Dataset V4 Unified Image Classification, Object Detection, and Visual Relationship Detection at Scale

Alina Kuznetsova et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2020)

Article Computer Science, Artificial Intelligence

Self Paced Deep Learning for Weakly Supervised Object Detection

Enver Sangineto et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2019)

Article Computer Science, Artificial Intelligence

Leveraging Prior-Knowledge for Weakly Supervised Object Detection Under a Collaborative Self-Paced Curriculum Learning Framework

Dingwen Zhang et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2019)

Review Computer Science, Artificial Intelligence

Object Detection With Deep Learning: A Review

Zhong-Qiu Zhao et al.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2019)

Article Computer Science, Artificial Intelligence

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection

Gong Cheng et al.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Adapting Object Detectors via Selective Cross-Domain Alignment

Xinge Zhu et al.

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) (2019)

Article Engineering, Electrical & Electronic

T-CNN: Tubelets With Convolutional Neural Networks for Object Detection From Videos

Kai Kang et al.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2018)

Article Computer Science, Artificial Intelligence

Crafting GBD-Net for Object Detection

Xingyu Zeng et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2018)

Article Geochemistry & Geophysics

Online Exemplar-Based Fully Convolutional Network for Aircraft Detection in Remote Sensing Images

Bowen Cai et al.

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS (2018)

Article Computer Science, Artificial Intelligence

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge

Qi Wu et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2018)

Proceedings Paper Energy & Fuels

Space Charge Analysis of Polyethylene with Chemical Defects Based on Density Function Theory

Tao Lin et al.

2018 IEEE INTERNATIONAL CONFERENCE ON HIGH VOLTAGE ENGINEERING AND APPLICATION (ICHVE) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships

Yong Liu et al.

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Dynamic Zoom-in Network for Fast Object Detection in Large Images

Mingfei Gao et al.

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Generative Adversarial Learning Towards Fast Weakly Supervised Detection

Yunhan Shen et al.

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks

Xuepeng Shi et al.

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)

Article Computer Science, Information Systems

Attentive Contexts for Object Detection

Jianan Li et al.

IEEE TRANSACTIONS ON MULTIMEDIA (2017)

Article Computer Science, Artificial Intelligence

Weakly Supervised Object Localization with Multi-Fold Multiple Instance Learning

Ramazan Gokberk Cinbis et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Article Computer Science, Artificial Intelligence

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2017)

Proceedings Paper Computer Science, Artificial Intelligence

FCNN: Fourier Convolutional Neural Networks

Harry Pratt et al.

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT I (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Soft Proposal Networks for Weakly Supervised Object Localization

Yi Zhu et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Weakly Supervised Cascaded Convolutional Networks

Ali Diba et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

StuffNet: Using 'Stuff' to Improve Object Detection

Samarth Brahmbhatt et al.

2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Speed/accuracy trade-offs for modern convolutional object detectors

Jonathan Huang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Learning non-maximum suppression

Jan Hosang et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Mask R-CNN

Kaiming He et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Scale-Aware Face Detection

Zekun Hao et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Spatial Memory for Context Reasoning in Object Detection

Xinlei Chen et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Soft-NMS - Improving Object Detection With One Line of Code

Navaneeth Bodla et al.

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

Christian Ledig et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Article Engineering, Electrical & Electronic

Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks

Kaipeng Zhang et al.

IEEE SIGNAL PROCESSING LETTERS (2016)

Article Geochemistry & Geophysics

Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images

Gong Cheng et al.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2016)

Article Computer Science, Artificial Intelligence

Accelerating Very Deep Convolutional Networks for Classification and Detection

Xiangyu Zhang et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2016)

Article Computer Science, Artificial Intelligence

What Makes for Effective Detection Proposals?

Jan Hosang et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2016)

Article Computer Science, Artificial Intelligence

Region-Based Convolutional Networks for Accurate Object Detection and Segmentation

Ross Girshick et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2016)

Proceedings Paper Computer Science, Artificial Intelligence

We don't need no bounding-boxes: Training object class detectors using only human verification

Dim P. Papadopoulos et al.

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2016)

Proceedings Paper Computer Science, Artificial Intelligence

Adaptive Object Detection Using Adjacency and Zoom Prediction

Yongxi Lu et al.

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2016)

Proceedings Paper Computer Science, Artificial Intelligence

Is Faster R-CNN Doing Well for Pedestrian Detection?

Liliang Zhang et al.

COMPUTER VISION - ECCV 2016, PT II (2016)

Proceedings Paper Computer Science, Artificial Intelligence

Contextual Priming and Feedback for Faster R-CNN

Abhinav Shrivastava et al.

COMPUTER VISION - ECCV 2016, PT I (2016)

Article Computer Science, Artificial Intelligence

Contextualizing Object Detection and Classification

Qiang Chen et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2015)

Article Computer Science, Artificial Intelligence

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2015)

Review Multidisciplinary Sciences

Deep learning

Yann LeCun et al.

NATURE (2015)

Article Computer Science, Artificial Intelligence

The PASCAL Visual Object Classes Challenge: A Retrospective

Mark Everingham et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2015)

Proceedings Paper Computer Science, Artificial Intelligence

Spatial Semantic Regularisation for Large Scale Object Detection

Damian Mrowca et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Proceedings Paper Computer Science, Artificial Intelligence

Fast R-CNN

Ross Girshick

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Proceedings Paper Computer Science, Artificial Intelligence

Object detection via a multi-region & semantic segmentation-aware CNN model

Spyros Gidaris et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Proceedings Paper Computer Science, Artificial Intelligence

Learning Complexity-Aware Cascades for Deep Pedestrian Detection

Zhaowei Cai et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Article Geography, Physical

Multi-class geospatial object detection and geographic image classification based on collection of part detectors

Gong Cheng et al.

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (2014)

Proceedings Paper Computer Science, Artificial Intelligence

Scalable Object Detection using Deep Neural Networks

Dumitru Erhan et al.

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2014)

Proceedings Paper Computer Science, Artificial Intelligence

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

Ming-Ming Cheng et al.

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2014)

Article Computer Science, Artificial Intelligence

Selective Search for Object Recognition

J. R. R. Uijlings et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2013)

Article Computer Science, Artificial Intelligence

Measuring the Objectness of Image Windows

Bogdan Alexe et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2012)

Article Computer Science, Artificial Intelligence

Pedestrian Detection: An Evaluation of the State of the Art

Piotr Dollar et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2012)

Article Computer Science, Artificial Intelligence

Discriminative Models for Multi-Class Object Layout

Chaitanya Desai et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2011)

Article Computer Science, Artificial Intelligence

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2010)

Article Computer Science, Artificial Intelligence

Robust real-time face detection

P Viola et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2004)

Article Computer Science, Artificial Intelligence

Shape matching and object recognition using shape contexts

S Belongie et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2002)

Article Computer Science, Artificial Intelligence

Coarse-to-fine face detection

F Fleuret et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2001)

Article Computer Science, Artificial Intelligence

A trainable system for object detection

C Papageorgiou et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2000)