3.8 Proceedings Paper

Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning

出版社

IEEE COMPUTER SOC
DOI: 10.1109/WACV51458.2022.00253

关键词

-

资金

  1. CERCA Programme/Generalitat de Catalunya
  2. AGAUR [2019PROD00090]
  3. UAB [B18P0073]
  4. [PID2020-116298GB-I00]

向作者/读者索取更多资源

The article discusses object bias (hallucination) in image captioning and presents three simple yet efficient training augmentation methods to reduce it without the need for new data or increased model size. The proposed methods are shown to significantly decrease object bias in the models based on hallucination metrics, and reduce dependency on visual features through experimental demonstration. All code, configuration files, and model weights are available online.
Explaining an image with missing or non-existent objects is known as object bias (hallucination) in image captioning. This behaviour is quite common in the state-of-the-art captioning models which is not desirable by humans. To decrease the object hallucination in captioning, we propose three simple yet efficient training augmentation method for sentences which requires no new training data or increase in the model size. By extensive analysis, we show that the proposed methods can significantly diminish our models' object bias on hallucination metrics. Moreover, we experimentally demonstrate that our methods decrease the dependency on the visual features. All of our code, configuration files and model weights are available online(1).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据