4.7 Article

Semantics-Preserving Graph Propagation for Zero-Shot Object Detection

Journal

IEEE TRANSACTIONS ON IMAGE PROCESSING
Volume 29, Issue -, Pages 8163-8176

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2020.3011807

Keywords

Semantics; Object detection; Task analysis; Visualization; Motorcycles; Bicycles; Correlation; Zero-shot object detection; semantic embedding; semantic relation; graph propagation

Funding

  1. China Scholarship Council
  2. National Key Research and Development Program of China [2016YFB1000903]
  3. National Nature Science Foundation of China [61872287, 61772411, 61532015]
  4. Innovative Research Group of the National Natural Science Foundation of China [61721002]
  5. Innovation Research Team of Ministry of Education [IRT 17R86]
  6. Project of China Knowledge Center for Engineering Science and Technology

Ask authors/readers for more resources

Most existing object detection models are restricted to detecting objects from previously seen categories, an approach that tends to become infeasible for rare or novel concepts. Accordingly, in this paper, we explore object detection in the context of zero-shot learning, i.e., Zero-Shot Object Detection (ZSD), to concurrently recognize and localize objects from novel concepts. Existing ZSD algorithms are typically based on a strict mapping-transfer strategy that suffers from a significant visual-semantic gap. To bridge the gap, we propose a novel Semantics-Preserving Graph Propagation model for ZSD based on Graph Convolutional Networks (GCN). More specifically, we develop a graph construction module to flexibly build category graphs by leveraging diverse correlations between category nodes; this is followed by two semantics-preserving graph propagation modules that enhance both category and region representations. Benefiting from the multi-step graph propagation process, both the semantic description and structural knowledge exhibited in prior category graphs can be effectively leveraged to boost the generalization capability of the learned projection function. Experiments on existing seen/unseen splits of three popular object detection datasets demonstrate that the proposed approach performs favorably against state-of-the-art ZSD methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available