3.8 Proceedings Paper

Spatial Knowledge Distillation to aid Visual Reasoning

Publisher

IEEE
DOI: 10.1109/WACV.2019.00030

Keywords

-

Funding

  1. National Science Foundation under the Robust Intelligence Program [1816039, 1750082]
  2. Direct For Computer & Info Scie & Enginr
  3. Div Of Information & Intelligent Systems [1816039] Funding Source: National Science Foundation
  4. Div Of Information & Intelligent Systems
  5. Direct For Computer & Info Scie & Enginr [1750082] Funding Source: National Science Foundation

Ask authors/readers for more resources

For tasks involving language and vision, the current state-of-the-art methods tend not to leverage any additional information that might be present to gather relevant (commonsense) knowledge. A representative task is Visual Question Answering where large diagnostic datasets have been proposed to test a system's capability of answering questions about images. The training data is often accompanied by annotations of individual object properties and spatial locations. In this work, we take a step towards integrating this additional privileged information in the form of spatial knowledge to aid in visual reasoning. We propose a framework that combines recent advances in knowledge distillation (teacher-student framework), relational reasoning and probabilistic logical languages to incorporate such knowledge in existing neural networks for the task of Visual Question Answering. Specifically, for a question posed against an image, we use a probabilistic logical language to encode the spatial knowledge and the spatial understanding about the question in the form of a mask that is directly provided to the teacher network. The student network learns from the ground-truth information as well as the teachers prediction via distillation. We also demonstrate the impact of predicting such a mask inside the teachers network using attention. Empirically, we show that both the methods improve the test accuracy over a state-of-the-art approach on a publicly available dataset.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available