☆ 4.6 Article

Generating training images with different angles by GAN for improving grocery product image recognition

NEUROCOMPUTING (2022)

期刊

NEUROCOMPUTING

卷 488, 期 -, 页码 694-705

出版社

ELSEVIER

DOI: 10.1016/j.neucom.2021.11.080

关键词

Grocery product recognition; Data augmentation; Generative adversarial network (GAN); Convolutional neural network (CNN)

类别

Computer Science, Artificial Intelligence

资金

China Scholarship Council (CSC)
CSC

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a multi-angle Generative Adversarial Network (MAGAN) for data augmentation in grocery product recognition tasks. By generating realistic training images from different angles, the deep learning model can achieve improved accuracy in recognizing grocery products.

Image recognition based on deep learning methods has gained remarkable achievements by feeding with abundant training data. Unfortunately, collecting a tremendous amount of annotated images is time-consuming and expensive, especially in grocery product recognition tasks. It is challenging to recognise grocery products accurately when the deep learning model is trained with insufficient data. This paper proposes multi-angle Generative Adversarial Networks (MAGAN), which can generate realistic training images with different angles for data augmentation. Mutual information is employed in the novel GAN to achieve the learning of angles in an unsupervised manner. This paper aims to create training images containing grocery products from different angles, thus improving grocery product recognition accuracy. We first enlarge the fruit dataset by using MAGAN and the state-of-the-art GAN variants. Then, we compare the top-1 accuracy results from CNN classifiers trained with different data augmentation methods. Finally, our experiments demonstrate that the MAGAN exceeds the existing GANs for grocery product recognition tasks, obtaining a significant increase in the accuracy. (C) 2021 Elsevier B.V. All rights reserved.

Generating training images with different angles by GAN for improving grocery product image recognition

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Generating training images with different angles by GAN for improving grocery product image recognition

期刊

NEUROCOMPUTING

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文