☆ 4.6 Article

Image captioning with semantic-enhanced features and extremely hard negative examples

NEUROCOMPUTING (2020)

Journal

NEUROCOMPUTING

Volume 413, Issue -, Pages 31-40

Publisher

ELSEVIER

DOI: 10.1016/j.neucom.2020.06.112

Keywords

Image captioning; Image-text matching; Negative examples

Funding

National Natural Science Foundation of China [61976094]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Image captioning is a task to generate natural descriptions of images. In existing image captioning models, the generated captions usually lack semantic discriminability. Semantic discriminability is difficult as it requires the model to capture detailed differences in images. In this paper, we propose an image captioning framework with semantic-enhanced features and extremely hard negative examples. These two components are combined in a Semantic-Enhanced Module. The semantic-enhanced module consists of an image-text matching sub-network and a Feature Fusion layer which provides semantic-enhanced features of rich semantic information. Moreover, in order to improve the semantic discriminability, we propose an extremely hard negative mining method which utilize the extremely hard negative examples to improve the latent alignment between visual and language information. Experimental results on MSCOCO and Flickr30K show that our proposed framework and training method can simultaneously improve the performance of image-text matching and image captioning, achieving competitive performance against state-of-the-art methods. (C) 2020 Elsevier B.V. All rights reserved.

Image captioning with semantic-enhanced features and extremely hard negative examples

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Image captioning with semantic-enhanced features and extremely hard negative examples

Journal

NEUROCOMPUTING

Publisher

ELSEVIER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper